基于半监督学习的k平均聚类框架被引量：3

k-means clustering framework based on semi-supervised learning

下载PDF

导出

摘要为克服k-means算法难以探测出一些局部分布稀疏不均、聚类区域的形状与大小不规整数据点集的聚类分布结构这个缺点,在半监督学习思想的指导下,针对混合属性空间区域中具有同一分布性质的带有类别标记的小样本数据集和无类别标记的大样本数据集,提出了一种基于半监督学习的k平均聚类框架。仿真实验表明:该框架经常能取得比k-means更好的聚类精度,从而说明这个半监督学习框架具有一定的有效性。 For some sparse-odd data sets with different size and shape of clusters, ordinary k-means algorithm cannot work well in exploiting the cluster-distribution.In order to conquer this shortcom-ing, under the idea of semi-supervised learning, a k-means clustering framework based on semi-su-pervised leaning is presented for an unlabeled large sample which has the same distribution with a labeled small sample in a hybrid attributes space.Simulations show that the framework can often get better clustering accuracy than k-means algorithm, validating the effectiveness of the semi-supervised learning framework to some extent.

作者陈新泉苏锦钿

机构地区电子科技大学互联网科学中心华南理工大学计算机科学与工程学院

出处《广西大学学报（自然科学版）》 CAS 北大核心 2014年第5期1074-1082,共9页 Journal of Guangxi University（Natural Science Edition）

基金国家自然科学基金资助项目(61103038)

关键词半监督学习混合属性 k平均聚类归属度 semi-supervised learning hybrid attributes k-means clustering attributive measure

分类号 TP181 [自动化与计算机技术—控制理论与控制工程]

引文网络
相关文献

参考文献16

1ARTHUR D, VASSILVITSKII S. k-means++: The advantages of careful seeding[ C ]//Proceedings of the eighteenth annu- M ACM-SIAM symposium on Discrete algorithms. Society for Industrial and Applied Mathematics, 2007: 1027-1035. 被引量：1
2ZALIK K R. An efficient k-means clustering algorithm [ J]. Pattern Recognition Letters, 2008,29 (9) :1385-1391. 被引量：1
3CAO F, LIANG J, JIANG G. An initialization method for the k-means algorithm using neighborhood model [ J ]. Comput- ers and Mathematics with Applications, 2009, 58 (3) : 474-483. 被引量：1
4HUBERT L J, ARABIE P. Comparing partitions [ J ]. Journal of Classification, 1985, 2 (1) : 193-218. 被引量：1
5DAVIES D L, BOULDIN D W. A cluster separation measure [ J ]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 1979, 1(2) :224-227. 被引量：1
6DUNN J C. A fuzzy relative of the isodata process and its use in detecting compact well-separated clusters [ J ]. Cybernet- ics and Systems, 1973, 3 (3) : 32-57. 被引量：1
7BEZDEK J C, PAL N R. Some new indexes of cluster validity [ J ]. IEEE Transactions on Systems, Man, and Cybernet- ics, 1998, 28( 3): 301-315. 被引量：1
8庞晓敏,闵子建,阚江明.基于HSI和LAB颜色空间的彩色图像分割[J].广西大学学报（自然科学版）,2011,36(6):976-980. 被引量：77
9THOMAS H. CORME N, CHARLES E, et al. Rivest and Clifford Stein. Introduction to Algorithms[ M ]. 3rd Edition. The MIT Press,2009. 被引量：1
10CHEN Xin-quan. Weighted clustering and evolutionary analysis of hybrid attributes data streams [ J ]. Journal of Comput- ers, 2008, 12(3) : 60-67. 被引量：1

二级参考文献50

1李存华,孙志挥,陈耿,胡云.核密度估计及其在聚类算法构造中的应用[J].计算机研究与发展,2004,41(10):1712-1719. 被引量：63
2张廷宪,郑志刚.耦合非线性振子系统的同步研究[J].物理学报,2004,53(10):3287-3292. 被引量：15
3孙业明,关山,牛海波.基于小波变换的针叶苗木彩色图像分割[J].东北电力学院学报,2005,25(6):9-13. 被引量：2
4Pikovsky A, Rosenblum M, Kurths J. Synchronization, a universal concept in nonlinear sciences. Cambridge: Cam?bridge University Press, 2001. 1-23. 被引量：1
5Boccalettia S, Kurths J, Osipov G, et al. The Synchronization of chaotic systems. Phys Rep, 2002, 366: 1-101. 被引量：1
6Kuramoto Y. Self-entrainment of a population of coupled non-linear oscillators. In: International Symposium on Mathematical Problems in Theoretical Physics, Tyoto, 1975. 420-422. 被引量：1
7Acebron J A, Bonilla L L, Vicente C J P, et a1. The Kuramoto Model: A simple paradigm for synchronization phenomena. Rev Mod Phys, 2005, 77: 137-185. 被引量：1
8Kuramoto Y. Chemical Oscillations, Waves, and Turbulence. Berlin: Springer-Verlag, 1984. 5-21. 被引量：1
9Bohm C, Plant C, Shao J M, et al. Clustering by synchronization. In: Proceedings of ACM SIGKDD'10, Washington, 2010. 583-592. 被引量：1
10Pelleg D, Moore A. X-means: Extending K-means with efficient estimation of the number of clusters. In: Proceedings of ICML'OO, Stanford, 2000. 727-734. 被引量：1

共引文献91

1谢闯,李磊民.基于彩色图像分割的路标检测算法研究[J].西南科技大学学报,2012,27(3):87-91. 被引量：4
2宋怀波,何东健,潘景朋.基于凸壳理论的遮挡苹果目标识别与定位方法[J].农业工程学报,2012,28(22):174-180. 被引量：33
3李震,洪添胜,曾祥业,郑健宝.基于K-means聚类的柑橘红蜘蛛图像目标识别[J].农业工程学报,2012,28(23):147-153. 被引量：44
4罗学刚,吕俊瑞,王华军,黄伟.基于超像素的互惠最近邻聚类彩色图像分割[J].广西大学学报（自然科学版）,2013,38(2):374-378. 被引量：12
5刘博宇,辛斌杰,刘晓霞.一种单色织物表面颜色数字的图像表征方法[J].上海纺织科技,2013,41(6):28-31.
6王婷婷,侯德文.基于颜色空间和优化初始中心的模糊c均值聚类算法[J].山东师范大学学报（自然科学版）,2013,28(3):30-33.
7程国建,杨静,黄全舟,刘烨.基于概率神经网络的岩石薄片图像分类识别研究[J].科学技术与工程,2013,21(31):9231-9235. 被引量：20
8田昊,王维新,毕新胜,马本学,王玉刚.基于图像处理的机采棉杂质提取算法[J].江苏农业科学,2014,42(1):366-368. 被引量：6
9李云,杨海清.多光谱图像技术在土壤酸碱度检测中的应用[J].红外,2014,35(3):43-48. 被引量：7
10姜继春,王晓红,许秦蓉.H-Cb混合颜色模型下快递单手写体提取算法研究[J].包装工程,2014,35(19):114-118.

同被引文献31

1CAMACHO J, PIC J, FERRER A. Data understanding with PCA: Structural and variance information plots[J]. Ch- emometrics and Intelligent Laboratory Systems, 2010,100(1) : 48-56. 被引量：1
2LIPOVETSKY S. PCA and SVD with nonnegative loadings[J]. Pattern Recognition, 2009,42( 1): 68-76. 被引量：1
3LEE D D, SEUNG H S. Learning the parts of objects by non-negative matrix factorization[J]. Nature, 1999,401 (6755) : 788-791. 被引量：1
4RADULOVIC J, RANKOVIC V. Feedforward neural network and adaptive network-based fuzzy inference system in study of power lines[J]. Expert Systems with Applications, 2010,37 (1) : 165-170. 被引量：1
5PETER N B,JOAO P H,DAVID J K,et al. Fisherfaces.. Recognition using class specific linear projection[J]. IEEE Trans on Pattern Analysis and Machine Intelligence, 1997,19(7):711-720. 被引量：1
6ROWELS S T,SAUL L K. Nonlinear dimensionality reduction by locally linear embedding[J].Science, 2000,290 (5500) : 2323-2326. 被引量：1
7HE Xiao-feng, NIYOGI P. Locality preserving projections[C] ffAdvances in Neural Information Processing Sys- tems. Vancouver: [s. n. 7,2003:153-160. 被引量：1
8LOPEZ M M, RAMIREZ J, ALVAREZ I, et al. SVM-based CAD system for early detection of the Alzheimer~s dis- ease using kernel PCA and LDA[J].Neuroscience Letters, 2009,464(3) : 233-238. 被引量：1
9MIKA S, RATSCH G,WESTON J, et al. Constructing descriptive and discriminative nonlinear features: Rayleigh coefficients in kernel feature spaces[J].Pattern Analysis and Machine Intelligence, 2003,25(5):623-628. 被引量：1
10HSUAN Y M. Kernel eigenfaces vs kernel fisherfaces: Face recognition using kernel methods[C] ff Processing of the 5th IEEE International Conference on Automatic Face and Gesture Recognition. Washington D C:IEEE Press, 2002 : 215-220. 被引量：1

引证文献3

1张雅清,刘忠宝.融合全局和局部特征的图像特征提取方法[J].华侨大学学报（自然科学版）,2015,36(4):406-411. 被引量：4
2彭太乐,张文俊,蓝建梁,谢志峰.基于半监督聚类的微视频标注方法[J].计算机应用研究,2016,33(3):948-952. 被引量：2
3景妮琴.基于先验信息遗传算法的图像分割[J].信息技术,2017,41(11):176-180. 被引量：1

二级引证文献7

1路晓亚,杜丽娟.模糊生物图像特征优化提取仿真研究[J].计算机仿真,2017,34(5):397-400. 被引量：6
2杨麟,杜吉祥,聂一亮.块聚类的协同显著性检测[J].华侨大学学报（自然科学版）,2018,39(3):445-450. 被引量：1
3安强强,张峰,李赵兴,张雅琼.基于机器学习的图像分割研究[J].自动化与仪器仪表,2018,0(6):29-31. 被引量：4
4叶福玲.一种改进的图像骨架提取算法[J].西昌学院学报（自然科学版）,2018,32(3):91-93. 被引量：16
5李勇,柳建.基于遗传算法的智能车行驶路线优化[J].电子设计工程,2019,27(6):83-86. 被引量：4
6秦悦,丁世飞.半监督聚类综述[J].计算机科学,2019,46(9):15-21. 被引量：18
7Mingyang Duan,Jin Liu,Shiqi Lv.Encoder-Decoder Based Multi-Feature Fusion Model for Image Caption Generation[J].Journal on Big Data,2021,3(2):77-83.

1张刚,周昭涛,王斌.基于主题的分布式信息检索技术研究[J].计算机工程,2006,32(12):80-81. 被引量：1
2满春涛,李晓霞,张礼勇.一种基于ACO的RBF神经网络训练方法[J].哈尔滨理工大学学报,2008,13(1):56-58. 被引量：5
3张国荣,印鉴.分布式环境下保持隐私的聚类挖掘算法[J].计算机工程与应用,2007,43(18):165-167. 被引量：5
4王朝辉,黎鑫.基于WEKA的序列最小化算法的改进研究[J].工业控制计算机,2012,25(8):81-82.
5虞国全.基于SVM的径向基网络滚动轴承故障诊断方法研究[J].黑龙江科技信息,2009(7):31-31.
6张荣,邓赵红,王士同,蔡及时,钱鹏江.针对小样本数据集的鲁棒单隐层前馈网络建模方法[J].控制与决策,2012,27(9):1308-1312. 被引量：5
7潘崇,朱红斌.改进k-means算法在图像标注和检索中的应用[J].计算机工程与应用,2010,46(4):183-185. 被引量：7
8姜雅文,贾彩燕,于剑.基于类原型的复杂网络重叠社区发现方法[J].模式识别与人工智能,2013,26(7):648-659. 被引量：6
9张华楠,刘胜全,刘艳,刘华鹏,李鹏.基于动态权值的多策略领域本体概念自动抽取[J].计算机工程与应用,2014,50(21):152-156. 被引量：1
10汤九斌,陆建峰,唐振民,杨静宇.基于层次的K-means初始化算法[J].中国工程科学,2007,9(11):74-79. 被引量：2

广西大学学报（自然科学版）

2014年第5期

浏览历史

内容加载中请稍等...

基于半监督学习的k平均聚类框架被引量：3

参考文献16

二级参考文献50

共引文献91

同被引文献31

引证文献3

二级引证文献7

相关作者

相关机构

相关主题

浏览历史

基于半监督学习的k平均聚类框架 被引量：3

参考文献16

二级参考文献50

共引文献91

同被引文献31

引证文献3

二级引证文献7

相关作者

相关机构

相关主题

浏览历史

基于半监督学习的k平均聚类框架被引量：3