摘要
针对传统模糊C-均值(FCM)聚类算法初始聚类中心不确定,且需要人为预先设定聚类类别数,从而导致结果不准确的问题,提出了一种基于中点密度函数的模糊聚类算法。首先,结合逐步回归思想作为初始聚类中心选取的方法,避免收敛结果陷入局部循环;其次,确定可能的聚类类别数目;最后,对结果进行重叠度和分离度的模糊聚类有效性指标判定,确定最佳的聚类类别数。实验证明该算法与原改进C-均值聚类算法相比,减少了迭代次数,平均准确率提高了12%。实验结果表明该算法能够减少聚类的处理时间,并在平均准确率和聚类性能指标上优于对比算法。
In the traditional Fuzzy C-Means( FCM) clustering algorithm, the initial clustering center is uncertain and the number of clusters should be preset in advance which may lead to inaccurate results. The fuzzy clustering algorithm based on midpoint density function was put forward. Firstly, the stepwise regression thought was integrated as the initial clustering center selection method to avoid convergence from local circulation, and then the number of clusters was determined, finally according to the results, the validity index of fuzzy clustering including overlap degree and resolution was judged to determin the optimal number of clusters. The results prove that, compared with the traditional improved FCM, the proposed algorithm reduces the number of iterations and increases the average accuracy by 12%. The experimental results show that the proposed algorithm can reduce the processing time of clustering, and it is better than the comparison algorithm on the average accuracy and the clustering performance index.
出处
《计算机应用》
CSCD
北大核心
2016年第1期150-153,170,共5页
journal of Computer Applications
基金
国家自然科学基金资助项目(61202100)~~
关键词
模糊C-均值
中点法
类集密度函数法
逐步回归思想
有效性指标
Fuzzy C-Means(FCM)
midpoint method
class set density function method
stepwise regression thought
validity index