期刊文献+

一种基于边缘度密度距的聚类算法 被引量:6

Cluster Algorithm Based on Edge Density Distance
下载PDF
导出
摘要 传统网格聚类算法聚类质量低,而密度聚类算法时间复杂度高。针对两类算法各自的缺点,结合它们的聚类思想提出了一种新的聚类算法。该算法提出了边缘度密度距作为新的密度度量,并在此基础上逐步确定了类的定义和聚类过程的定义。算法前期通过网格划分操作统计记录了待聚类数据的初始信息,以供随后的k近邻统计使用。在寻找聚类中心点时使用了桶排序的策略,使得算法能快速地选出下一个聚类中心点。随后的聚类步骤是迭代搜索并检验当前类中未检验的k近邻是否满足密度可达性来完成聚类。理论分析和实验测试的结果表明,该算法不仅保持了较高的聚类精度,而且有接近线性的低时间复杂度。 Clustering algorithms based on grid have a drawback of low clustering precision, and most clustering algo- rithms based on density have high time complexity. In order to improve clustering performance, a cluster algorithm based on edge density distance was proposed in this paper. The new cluster algorithm makes new definitions of density and category. In the clustering process, data are divided into grids and some initial information is recorded firstly for the operation of finding k near points. Then in the process of finding a new clustering center, a method come from bucket sort is used,which makes it fast to find the clustering center. A subsequent procedure is to iteratively analyse k near points of one category to judge whether they are density accessible. Analysis in theory and result of experiments show that the proposed algorithm has both high quality in clustering result and low time complexity.
出处 《计算机科学》 CSCD 北大核心 2014年第8期245-249,共5页 Computer Science
基金 浙江省重点科技创新团队项目(2010R50009)资助
关键词 聚类 网格 密度 Caed DBSCAN Kmeans Cluster, Grid, Density, Caed, Dbscan, Kmeans
  • 相关文献

参考文献14

  • 1Escudero L F,Garín M A,Pérez G,et al.Scenario Cluster Decomposition of the Lagrangian dual in two stage stochastic mixed 0 1 optimization[J].Computers & Operations Research,2013,1(40):362-377. 被引量:1
  • 2Wang W,Yang J,Muntz R.STING:A statistical information grid approach to spatial data mining[C]// Proc.of the 23rd Very Large Databases Conf.(VLDB 1997).Athens,Greece.1997:186-195. 被引量:1
  • 3Rakai L,Farshidi A,Behjat L,et al.A New Length Based Algebraic Multigrid Clustering Algorithm[J].VLSI Design,2012,2012. 被引量:1
  • 4Demirtas E A.A Data Envelopment Based Clustering Approach for Public Sugar Factories in Privatizing Process[J].Mathematical Problems in Engineering,2011,2011. 被引量:1
  • 5Xu X W,Ester M,Kriegel H P,et al.A distribution based clustering algorithm for mining in large spatial databases[C]// Proc.14th Intemat.Conf.on Data Eng.(ICDE 98).Orlando,FL,1998:324-331. 被引量:1
  • 6Zhong YF,Zhang LP.A New Fuzzy Clustering algorithm Based on Clonal S election for Land Cover Classlfication[J].Mathematical Problems in Engineering,2011,2011. 被引量:1
  • 7Elbatta M T,Bolbol R M,Ashour W M.A Vibration Method for Discovering Density Varied Clusters[J].ISRN Artificial Intelligence,2012,2012. 被引量:1
  • 8Karypis G,Han E H,Kumar V.CHAMELEON:A hierarchical clustering algorithm using dynamic modeling[J].IEEE Computer,1999,32(8):68-75. 被引量:1
  • 9孙吉贵,刘杰,赵连宇.聚类算法研究[J].软件学报,2008(1):48-61. 被引量:1079
  • 10赵慧,刘希玉,崔海青.网格聚类算法[J].计算机技术与发展,2010,20(9):83-85. 被引量:29

二级参考文献23

共引文献1103

同被引文献55

引证文献6

二级引证文献29

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部