文本检索的查询性能预测被引量：8

Predicting Query Performance for Text Retrieval

下载PDF

导出

摘要目前,查询性能预测(predicting query performance,简称PQP)已经被认为是检索系统最重要的功能之一.近几年的研究和实验表明,PQP技术在文本检索领域有着广阔的发展前景和拓展空间.对文本检索中的PQP进行综述,重点论述其主要方法和关键技术.首先介绍了常用的实验语料和评价体系;然后介绍了影响查询性能的各方面因素;之后,按照基于检索前和检索后的分类体系概述了目前主要的PQP方法;简介了PQP在几个方面的应用;最后讨论了PQP所面临的一些挑战. Predicting query performance （PQP） has recently been recognized by the IR （information retrieval） community as an important capability for IR systems. In recent years, research work carried out by many groups has confirmed that predicting query performance is a good method to figure out the robustness problem of the IR system and useful to give feedback to users, search engines and database creators. In this paper, the basic predicting query performance approaches for text retrieval are surveyed. The data for experiments and the methods for evaluation are introduced, the contributions of different factors to overall retrieval variability across queries are presented, the main PQP approaches are described from Pre-Retrieval to Post-Retrieval aspects, and some applications of PQP are presented. Finally, several primary challenges and open issues in PQP are summarized.

作者郎皓王斌李锦涛丁凡

机构地区中国科学院计算技术研究所

出处《软件学报》 EI CSCD 北大核心 2008年第2期291-300,共10页 Journal of Software

基金 Supported by the National Natural Science Foundation of China under Grant No.60603094 (国家自然科学基金) the National Basic Research Program of China under Grant No.2004CB318109 (国家重点基础研究发展计划(973)) the Beijing Science and Technology Planning Program of China under Grant No.D0106008040291 (北京市科技计划)

关键词信息检索查询性能预测 information retrieval query performance prediction

分类号 TP311 [自动化与计算机技术—计算机软件与理论]

引文网络
相关文献

参考文献21

1Yates B, Neto R. Modern Information Retrieval. New York: ACM Press, 1999. 被引量：1
2Harman D, Buckley C. The NRRC reliable information access (RIA) workshop. In: Sanderson M, Jarveln K, Allan J, Bruza P, eds. Proc. of the 27th Annual Int'l ACM SIGIR Conf. on Research and Development in Information Retrieval. Sheffield: ACM Press, 2004. 528-529. 被引量：1
3Voorhees EM. Overview of the TREC 2004 robust retrieval track. In: Online Proc. of the 2004 Text Retrieval Conf. (TREC 2004). 2004. http://trec.nist.gov/pubs/trec 13/t13_roceedings.html 被引量：1
4Yom-Tov E, Fine S, Carmel D, Darlow A. Learning to estimate query difficulty: Including applications to missing content detection and distributed information retrieval. In: Proc. of the 28th Annual Int'l ACM SIGIR Conf. on Research and Development in Information Retrieval. Salvador: ACM Press, 2005.512-519. 被引量：1
5Vinay V, Cox I J, Milic-Frayling N, Wood K. On ranking the effectiveness of searches. In: Proc. of the 29th Annual Int'l ACM SIGIR Conf. on Research and Development in Information Retrieval. New York: ACM Press, 2006. 398-404. 被引量：1
6Zhou Y, Croft WB. Ranking robustness: A novel framework to predict query performance. In: Proc. of the 15th ACM Int'l Conf. on Information and Knowledge Management. Arlington: ACM Press, 2006. 567-574. 被引量：1
7Cronen-Townsend S, Zhou Y, Croft WB. Predicting query performance. In: Proc. of the 25th Annual Int'l ACM SIGIR Conf. on Research and Development in Information Retrieval. Tampere: ACM Press, 2002.299-306. 被引量：1
8Gibbons JD, Chakraborty S. Nonpararnetric Statistical Inference. 3rd ed., New York: Marcel Dekker, 1992. 被引量：1
9Kreyszig E. Advanced Engineering Mathematics. John Wiley & Sons, Inc., 1997. 被引量：1
10Carmel D, Yom-Tov E, Darlow A, Pelleg D. What makes a query difficult? In: Proc. of the 29th Annual Int'l ACM SIGIR Conf. on Research and Development in Information Retrieval. New York: ACM Press, 2006. 390-397. 被引量：1

同被引文献74

1马亮,陈群秀,蔡莲红.一种改进的自适应文本信息过滤模型[J].计算机研究与发展,2005,42(1):79-84. 被引量：18
2刘云峰,齐欢,代建民.潜在语义分析在中文信息处理中的应用[J].计算机工程与应用,2005,41(3):91-93. 被引量：18
3耿焕同,肖明军,邹翔,蔡庆生.聚类算法在范例库维护中的应用研究[J].计算机工程,2005,31(12):166-168. 被引量：10
4赵军,金千里,徐波.面向文本检索的语义计算[J].计算机学报,2005,28(12):2068-2078. 被引量：28
5丁国栋,白硕,王斌.文本检索的统计语言建模方法综述[J].计算机研究与发展,2006,43(5):769-776. 被引量：19
6付鸿鹄,张晓林.段落检索及其相关算法研究[J].现代图书情报技术,2007(2):39-43. 被引量：3
7郝春风,王忠民.一种用于大规模文本分类的特征表示方法[J].计算机工程与应用,2007,43(15):170-172. 被引量：12
8Schafer J B,Konstan J A,Riedl J.Recommender Systems in E-Conference[J].In ACM Conference on Electronic Commerce(EC99),1999. 被引量：1
9Mclvor R T,Humphreys P K.A Case-based Reasoning Approach to the Make or Buy Decision[J].Integrated Manufacturing Systems,2000,11(5):295-310. 被引量：1
10Leake D B.CBR in Context:the Present and Future[A].Case-based Reasoning,Experiences,Lessons&Future Directions[C].Menlo Park CA,USA:AAAI Press/the MIT Press,1996:1-30. 被引量：1

引证文献8

1徐嬴,刘屹,阴红志,崔斌.查询性能预测方法的性能评测研究(英文)[J].计算机研究与发展,2013,50(S1):70-79. 被引量：2
2李锴.基于查询性能预测的案例库维护策略[J].山西电子技术,2010(2):68-70. 被引量：1
3安俊秀.基于服务器集群的云检索系统的研究与示范[J].计算机科学,2010,37(7):179-182. 被引量：7
4吴世勇,王明文.基于聚类分析的搜索引擎自动性能评价[J].中文信息学报,2010,24(5):62-69. 被引量：2
5金小峰.一种大容量文本集的智能检索方法[J].计算机工程与应用,2011,47(7):143-145.
6陶永全.基于一种改进离散度的检索前查询性能预测[J].软件导刊,2015,14(9):37-39.
7刘泽林,钱仲焱,陈泳.基于概念图的民机概念设计要求识别与获取[J].计算机集成制造系统,2015,21(10):2549-2557. 被引量：3
8刘奕群.搜索引擎用户满意度评估[J].计算机研究与发展,2017,54(6):1133-1143. 被引量：5

二级引证文献20

1郝玉龙,孙阳,李冰.基于云计算的卫星地面应用系统设计[J].计算机应用与软件,2012,29(4):216-219. 被引量：7
2彭敏,杨铭,孙松涛,何炎祥.基于语句查询扩展和高性能计算平台的分布式信息检索系统DQSSQE[J].武汉大学学报（理学版）,2012,58(3):243-250.
3程芃森,安俊秀.基于特征词群的新闻类重复网页和近似网页识别算法[J].成都信息工程学院学报,2012,27(4):374-379.
4沈青,董波,肖德宝.基于服务器集群的云监控系统设计与实现[J].计算机工程与科学,2012,34(10):73-77. 被引量：16
5马志杰.我国搜索引擎评价研究的现状、问题及对策[J].图书馆学研究,2013(4):11-17. 被引量：9
6洪霞.云计算环境下绿色信息检索系统的研究初探[J].图书馆界,2014(1):1-4. 被引量：4
7高华.依托云服务系统的档案快速检索技术研究[J].四川档案,2015(2):31-32.
8丁杰,王继业,程志华.面向电力关系数据的云排序算法研究[J].计算机技术与发展,2015,25(7):5-10.
9陈福,林闯,薛超,徐月梅,孟坤,倪艺函.短句语义向量计算方法[J].通信学报,2016,37(2):11-19. 被引量：3
10石雁,李朝锋.基于协同相似计算的查询推荐[J].计算机工程,2016,42(8):188-193. 被引量：3

1陶永全.基于一种改进离散度的检索前查询性能预测[J].软件导刊,2015,14(9):37-39.
2陈苏海.基于VB的排序算法研究[J].电脑编程技巧与维护,2015(21):33-34.
3薛源海,俞晓明,刘悦,关峰,程学旗.基于查询性能预测的鲁棒检索排序研究[J].中文信息学报,2016,30(5):169-175.
4朱运航,邓明元.地区市电子政务网络总控中心体系概述[J].办公自动化,2007,0(2):14-16.
5乔亚男,齐勇.查询语义图辅助的信息检索性能预测模型[J].电子学报,2011,39(A03):158-162. 被引量：2
6徐嬴,刘屹,阴红志,崔斌.查询性能预测方法的性能评测研究(英文)[J].计算机研究与发展,2013,50(S1):70-79. 被引量：2
7盖庆书,白雪.基于神经网络模型的信息融合技术[J].华北水利水电学院学报,2009,30(1):67-69. 被引量：2
8豆瑞星.P2P的转机[J].互联网周刊,2011(7):40-43.
9黄锐.美国NIST信息安全风险管理体系概述[J].保密科学技术,2012(10):33-36.
10李锴.基于查询性能预测的案例库维护策略[J].山西电子技术,2010(2):68-70. 被引量：1

软件学报

2008年第2期

浏览历史

内容加载中请稍等...

文本检索的查询性能预测被引量：8

参考文献21

同被引文献74

引证文献8

二级引证文献20

相关作者

相关机构

相关主题

浏览历史

文本检索的查询性能预测 被引量：8

参考文献21

同被引文献74

引证文献8

二级引证文献20

相关作者

相关机构

相关主题

浏览历史

文本检索的查询性能预测被引量：8