摘要
本文引入HowNet知识库,实现中文文档的概念聚类.提高文本聚类分析的效率;应用形式概念分析的技术对概念聚类后的中文文本类簇的主题进行抽取。并对类簇间关联进行分析,提高了文本聚类结果的可读性。最后,通过两个实验,评测了该聚类分析和类簇主题抽取方法的优缺点。
This paper introduces HowNet knowledge system on the studying of Chinese text clustering baed on concept. The experiment verified the good performance of this algorithms. Then, it brings in formal conceptual analysis technique in the process of text clusters for improving the readability of text cluster. Finally, Two experiments have evaluated merit and the shortcoming of this method.
作者
庄世芳
ZHUANG Shi- fang (Quan Zhou Normal College, Quanzhou 362000,China)
出处
《电脑知识与技术》
2008年第4期138-140,共3页
Computer Knowledge and Technology