期刊文献+

基于分类和关键词组抽取的信息检索算法 被引量:10

An Information-retrieval Algorithm Based on Classification and Key Phrase Extraction
下载PDF
导出
摘要 本文提出一种基于分类和关键词组抽取的信息检索算法。该算法利用文本分类和信息抽取技术辅助检索,避免了向量空间模型算法中时间复杂度过大,查准率不高的缺点。针对传统的信息检索性能指标无法有效地衡量检索结果的排序状况,本文还引入了排序误差率概念用于评价检索结果的排序。实验结果表明,所提算法与TFIDF算法、基于分类的交互式检索算法相比,具有更快的查询速度,更高的查准率和更小的排序误差率。 In this paper, a new information retrieval algorithm based on classification and key phrase extraction is proposed. Compared with traditional vector space model, this algorithm reduces time complexity and improves precision using of text classification and information extraction. Then a new performance criterion named ranking error is contributed to solve the problem that the traditional performance evaluation methodology cant evaluate the ranking results of retrieved documents efficiently. The experiment result shows that the proposed algorithm outperforms TF*IDF and Interactive Retrieval based on classification in speed, precision and ranking error.
出处 《系统仿真学报》 CAS CSCD 2004年第5期1009-1012,1016,共5页 Journal of System Simulation
基金 国家自然科学基金(60272051)
关键词 向量空间模型 文本分类 关键词组抽取 查准率 排序误差率 vector space model text classification key phrase extraction precision ranking error
  • 相关文献

参考文献11

  • 1Jian Zhang, Jianfeng Gao. Improving the effective of information retrieval with clustering and fusion [J].Computational Linguistics and Chinese Language Processing,2001,6(1): 109-125. 被引量:1
  • 2MingFang Wu, Michael Fuller, Ross Wilkinson. Using Clustering and Classification Approaches in Interactive Retrieval [J].Information Processing & Management, 2001,37(3): 459-484. 被引量:1
  • 3Kiduk Yang. Combining text-,link-,and classification-based retrieval methods to enhance information discovery on the Web. [D]. PHD thesis, Chapel Hill:Univ. of North Carolina, 2002-5, 157-171. 被引量:1
  • 4Anton Leuski. Evaluating Document Clustering for Interactive Information Retrieval. [A]. In the Proceedings of the ACM CIKM 2001 Tenth International Conference on Information and Knowledge Management[C], 2001, 33-40. 被引量:1
  • 5A.Leuski, J.Allan. Improving interactive retrieval by combining ranked lists and clustering. [A]. In the proceedings of RIAO 2000 conference[C], 2000,665-681. 被引量:1
  • 6Salton,G Wong,A and Yang,C.S. On the specification of term values in automatic indexing. Journal of Documentation, 1973, 29(4): 351-372. 被引量:1
  • 7Buckley C. Implementation of the SMART information retrieval system. Technical Report, Cornell University, TR85-686, 1985. 被引量:1
  • 8P. Husbands, H. Simon &C.Ding. On the Use of Singular Value Decomposition for Text Retrieval. http://www.citeseer.nj.nec.com/ 540137.html, 2000. 被引量:1
  • 9Mei Kobayashi & Koichi Takeda, Information Retrieval on the Web [J]. ACM Computing Surveys, 2000,32(2). 被引量:1
  • 10王继成,萧嵘,孙正兴,张福炎.Web信息检索研究进展[J].计算机研究与发展,2001,38(2):187-193. 被引量:118

二级参考文献4

共引文献139

同被引文献62

引证文献10

二级引证文献43

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部