期刊文献+

基于互信息量的改进K-Modes聚类方法 被引量:3

下载PDF
导出
摘要 文章提出了一种基于互信息量的改进K-Modes聚类方法,采用样本互信息来刻画数据对象属性之间的相互关系。在此基础上提出了一种新的距离度量,该距离度量方法既考虑了对象某个属性值本身的不同,又考虑了对象其它属性对该属性值的影响,使之更符合实际问题情况。实验结果表明,聚类方法有效地提高了聚类精度。
作者 吴润秀
出处 《统计与决策》 CSSCI 北大核心 2012年第6期89-91,共3页 Statistics & Decision
基金 江西省自然科学基金资助项目(2007GZS1056)
  • 相关文献

参考文献7

  • 1Dino Ienco,Ruggero G.Pensa,Rosa Meo.Context-Based DistanceLearning for Categorical Data Clustering[Z].IDA 2009,LNCS 5772,2009. 被引量:1
  • 2Shyam Boriah,Varun Chandola,Vipin Kumar.Similarity Measures forCategorical Data:A Comparative Evaluation[EB/OL]@cs.umu.edu,1999. 被引量:1
  • 3Amir Ahmad,Lipika Dey.A K-mean Clustering Algorithm for MixedNumeric and Categorical Data[J].Data&Knowledge Engineering,2007,(63). 被引量:1
  • 4Michael K.Ng,Mark Junjie Li,Joshua Zhexue Huang,Zengyou He.Onthe Impact of Dissimilarity Measure in k-Modes Clustering Algorithm[J].Ieee Transactions on Pattern Analysis and Machine Intelligence,2007,29(3). 被引量:1
  • 5白亮,梁吉业,曹付元.基于粗糙集的改进K-Modes聚类算法[J].计算机科学,2009,36(1):162-164. 被引量:15
  • 6张小宇,梁吉业,曹付元,于慧娟.基于加权连接度的改进K-Modes聚类算法[J].广西师范大学学报(自然科学版),2008,26(3):189-193. 被引量:3
  • 7Amir Ahmad,Lipika Dey.A Method to Compute Distance betweenTwo Categorical Values of Same Attribute in Unsupervised Learningfor Categorical Data Set[J].Pattern Recognition Letters,2007,(28). 被引量:1

二级参考文献29

  • 1张敏,于剑.基于划分的模糊聚类算法[J].软件学报,2004,15(6):858-868. 被引量:176
  • 2Han Jiawei,Kamber M. Data Mining:Concepts and Techniques. San Francisco, US: Morgan Kaufmann, 2001 被引量:1
  • 3MacQueen J B. Some methods for classification and analysis of multivariate observation//Proceeding 5^th Berkley Symposium, on Mathematical Statistics and Probability. 1967, I:281-297. University of California Press, 1967, Xvii, 666 被引量:1
  • 4Huang Zhexue. Clustering Large Data Sets with Mixed Numeric and Categorical Values//PAKDD'97. Singapore, World Scientific, 1997:21-35 被引量:1
  • 5Huang Zhexue. Extensions to the k Means algorithm for clustering large data sets with categorical values. Data Mining and Knowledge Discovery, 1998,2 : 283-304 被引量:1
  • 6Michael K, Ng M, Li Junjie, et al. On the impact of dissimilarity measure in K-Modes clustering algorithm. IEEE Transaction on Pattern Analysis and Machine Intelligence, 2007,29 (3) : 503-507 被引量:1
  • 7Li Cen, Biswas Gautam. Unsupervised learning with mixed numeric and nominal data. IEEE Transactions on Knowledge and Data Engineering, 2002,14 :673-690 被引量:1
  • 8Hsu C C, Chen Chinlong, Su Yuwei. Hierarchical clustering of mixed data based on distance hierarchy. Information Sciences, 2007 :4474-4492 被引量:1
  • 9Hsu C C. Generalizing self-organizing map for categorical data. IEEE Transaction on Neural Network, 2006,17 (2) : 294-304 被引量:1
  • 10Ganti V, Ramakrishnanz J G R. CACTUS, clustering categorical data using summaries//Proceedings of the 5th International Conference on Knowledge Discovery and Data Mining. San Diego:ACM Press, 1999 : 73-83 被引量:1

共引文献15

同被引文献17

引证文献3

二级引证文献8

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部