摘要
利用词频分析、共词分析、聚类分析、多维尺度分析,绘制我国2005—2010年间文本聚类研究的知识图谱,得出领域研究结构,结合关键词粘合力,归纳出该领域四个类团研究群:相似度研究、向量空间模型、搜索引擎、Web文本挖掘。
Word-frequency analysis, Co-word analysis, together with Cluster analysis and Multi-dimen- sional analysis, are used in the paper to draw the mapping of knowledge of the Text clustering in China from the year of 2005 to 2010. Combining with key words adhesion method reveals the research structure of this field. The conclusion indicates that there are four groups in the research of text clustering,which is Similarity study, Vector Space Model, Search Engine, Web Text mining.
出处
《情报科学》
CSSCI
北大核心
2014年第3期23-27,共5页
Information Science
基金
广州市科技计划项目(2011J4300046)
关键词
文本聚类
知识图谱
共词分析
多元统计分析
text clustering
knowledge mapping
co-word analysis
multivariate statistical analysis