期刊文献+

一种综合语义和时效性意图的检索结果多样化方法 被引量:7

Search Result Diversification Combing Semantic and Temporal Intent
下载PDF
导出
摘要 当前,检索结果多样化作为一种提升用户满意度的有效方法已成为Web和数据库检索、文本摘要及推荐系统等领域的研究热点之一.但已有研究工作大都只考虑语义多样化策略.而实际上,多样化是一个非常复杂的优化问题,还需考虑许多其他的策略,如新颖性、质量、价值等.众所周知,Web是一个动态的信息空间,用户的查询需求也随时间不断演化,只有在一个特定的时间模式下,检索系统才能返回满意的结果.故该文提出一种新的结合语义和时效性两个维度的查询结果多样化方法.该文首先给出了多维度查询结果多样化框架的通用定义.然后,对于给定的查询,探讨了如何基于文档、词和查询频率来计算其时效性意图的概率分布.之后,提出一种新的针对时效性多样化的评价方法.最后,构建了针对多维度多样化问题的真实数据集,并通过实验证明该文提出的方法,不管是在传统的多样化评价指标上,还是在该文提出的时效性多样化指标上,性能都超过了当前主流的基准方法. Result diversification has recently been an active research area aimed at improving user satisfaction in Web and database search,text summarization,as well as recommendation system.To the best of our knowledge,almost all existing work only takes semantic strategies into account.However,result diversity is a very complex optimization problem and there may be many other strategies to be considered,such as,freshness,quality,value and so on.Additionally,it is well known that the Web is a dynamic information space and many queries could only be answered accurately under a specific temporal pattern.In this paper we propose a novel multidimensional diversification framework which combines the temporal space and the semantic space together to generate diversified search results.Firstly,we give a formal definition of our multidimensional diversification framework.Then,we study how to compute the probability distribution of temporal intents directly based on document,word and query frequency data.And then,we present a new evaluation measure especially for temporal diversification.Finally,we construct a real-world dataset for multidimensional diversity problem.The experiments demonstrate that our method can outperform these baseline approaches significantly in terms of both popular diversified measures and a new measure proposed in this paper.
出处 《计算机学报》 EI CSCD 北大核心 2015年第10期2076-2091,共16页 Chinese Journal of Computers
基金 国家自然科学基金(61272240 61103151 61173068) 教育部博士点基金(20110131110028) 山东省自然科学基金(ZR2012FM037) 山东省优秀中青年科学家科研奖励基金(BS2012DX017)资助~~
关键词 多维度多样化 时效性意图 子主题 语义 时间 社交网络 社会计算 multidimensional diversity temporal intent subtopic semantic time social networks social computing
  • 相关文献

参考文献39

  • 1Raman K, Bennett P N, Collins-Thompson K. Toward whole-session relevance.. Exploring intrinsic diversity in Web search//Proceedings of the 36th International ACM SIGIR Conference on Research and Development in Information Retrieval. New York, USA, 2013:463-472. 被引量:1
  • 2Hong Dzung, Si Luo. Search result diversification in resource selection for federated search//Proceedings of the 36th International ACM SIGIR Conference on Researeh and Development in Information Retrieval. New York, USA, 2013:613-622. 被引量:1
  • 3Berberich K, Bedathur S. Temporal diversification of search results//Proceedings of the SIGIR 2013 Workshop on Time- Aware Information Access. Dublin, Ireland, 2013:101-105. 被引量:1
  • 4Dang V, Croft W B. Term level search result diversification //Proceedings of the 36 th International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR'13). New York, USA, 2013:603-612. 被引量:1
  • 5Kanhabua N, Nejdl W. Understanding the diversity of tweets in the time of outbreaks//Proceedings of the 22nd International Conference on World Wide Web Companion. Geneva, Switzerland, 2013:1335-1342. 被引量:1
  • 6Ren Zhaochun, Liang Shangsong, Meij E, de Rijke M. Personalized time-aware tweets summarization//Proceedings of the 36th International ACM SIGIR Conference on Research and Development in Information Retrieval. New York, USA, 2013:513-522. 被引量:1
  • 7Zhao G, Lee M L, Hsu W, et al. Increasing temporal diversity with purchase intervals//Proceedings of the 35th International ACM SIGIR Conference on Research and Development in Information Retrieval. New York, USA, 2012:165-174. 被引量:1
  • 8Lathia N, Hailes S, Capra L, Amatriain X. Temporal diversity in recommender systems//Proceedings of the 33rd International ACM SIGIR Conference on Research and Development in Information Retrieval. New York, USA, 2010:210-217. 被引量:1
  • 9Aktolga E, Allan J. Sentiment diversification with different hiases//Proceedings of the 36th International ACM SIGIR Conference on Research and Development in Information Retrieval. New York, USA, 2013: 593-602. 被引量:1
  • 10McCreadie R, Macdonald C, Ounis I. News vertical search: When and what to display to users//Proceedings of the 36th International ACM SIGIR Conference on Research and Development in Information Retrieval. New York, USA, 2013:253-262. 被引量:1

二级参考文献39

  • 1Goffman W.On relevance as a measure[J].Information Storage and Retrieval,1964,2 (3):201-203. 被引量:1
  • 2Bennett P N,Carterette B,Chapelle O,et al.Beyond binary relevance:preferences,diversity,and set-level judg-ments[J].ACM SIGIR Forum,2008,42(2):53-58. 被引量:1
  • 3Radlinski F,Carterette B,Bennett P N,et al.Redundancy,diversity and interdependent document relevance[J].ACM SIGIR Forum,2009,43(2):46-52. 被引量:1
  • 4Carbonell J,Goldstein J.The use of mmr,diversity-based reranking for reordering documents and producing summaries[C] //Proceedings of the 21st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval.New York:ACM,1998:335-336. 被引量:1
  • 5Zhai Chenxiang,Cohen W W,Lafferty J.Beyond independent relevance:methods and evaluation metrics for subtopic retrieval[C] //Proceedings of the 26th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval.New York:ACM,2003:10-17. 被引量:1
  • 6Gollapudi S,Sharma A.An axiomatic approach for result diversification[C] //Proceedings of the 18th International Conference on World Wide Web.New York:ACM,2009:381-390. 被引量:1
  • 7Zhu Xiaojin,Goldberg A B,Van Gael J,et al.Improving diversity in ranking using absorbing random walks[C] //Proceedings of Human Language Technologies:the Annual Conference of the North American Chapter of the Association for Computational Linguistics.Rochester:NAACL,2007:97-104. 被引量:1
  • 8Swaminathan A,Mathew C,Kirovski D.Essential pages[R].Redmond:Microsoft Research,2008. 被引量:1
  • 9Yue Y,Joachims T.Predicting diverse subsets using structural svms[C] //Proceedings of the 25th International Conference on Machine Learning.New York:ACM,2008:1224-1231. 被引量:1
  • 10Agrawal R,Gollapudi S,Halverson A,et al.Diversifying search results[C] // Proceedings of the Second ACM International Conference on Web Search and Data Mining.New York:ACM,2009:5-14. 被引量:1

共引文献7

同被引文献40

引证文献7

二级引证文献34

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部