期刊文献+

基于时间分布特征的博客突发事件检测 被引量:9

Blog Emergent Event Detection Based on Temporal Distribution
下载PDF
导出
摘要 博客是目前网络舆论的重要载体之一,如何自动检测博客中的突发事件对于舆情分析与疏导具有重要的研究价值。针对目前突发事件检测中存在的时间信息有歧义的虚假突发事件问题,本文提出了一种基于时间分布特征的博客突发事件检测方法。该方法通过波峰检测和计算事件文档与背景语料文档之间、事件相关文档和不相关文档之间的时间分布差异来判断该事件在时间特征上是否具有突发性和关联性。实验结果表明,该方法可有效检测博客中的突发事件并可有效去除时间信息有歧义的虚假突发事件。 Blog is one of the most important carriers for public opinions, and how to automatically detect emergent events of the blog has an important research value for analyzing and diverting public opinions. Because the false emergent events can be detected by ambiguous temporal information, this paper presents a blog emergent event detection method based on temporal distribution. This method can determine whether there are emergency and relevance between events and temporal information through the peak detection and calculating the difference of temporal distribution between the event documents and the background corpus documents, and between eventrelevant documents and eventirrelevant documents. The experimental results show that the method can effectively detect emergent events in the blog, and can effectively remove the false emergent events which have ambiguous temporal information.
出处 《计算机工程与科学》 CSCD 北大核心 2010年第10期145-149,共5页 Computer Engineering & Science
基金 国家自然科学基金资助项目(60873179) 深圳市科技计划基础研究资助项目(JC200903180630A) 高等学校博士学科点专项科研基金资助项目(20090121110032)
关键词 时间分布特征 KL距离 时间信息明确的事件 时间信息有歧义的事件 temporal distribution KullbackLeibler divergence temporally unambiguous event temporally ambiguous event
  • 相关文献

参考文献9

  • 1杨尔弘..突发事件信息提取研究[D].北京语言大学,2005:
  • 2Gruhl D, Nowell D L, Guha R, et al. Information Diffusion Through Blogspace[J]. ACM SIGKDD Explorations Newsletter, 2004,6(2) :43-52. 被引量:1
  • 3Qamra A, Tseng B, Chang E Y. Mining Blog Stories Using Community-Based and Temporal Clustering[C]//Proc of the 15th ACM Int'l Conf on Information and Knowledge Management, 2006 : 58-67. 被引量:1
  • 4Li Xiaoyan, Croft W B. Time-Based Language Models[C]// Proc of the 12th Int'l Conf on Information and Knowledge Mmanagement, 2003:469-475. 被引量:1
  • 5Diaz F, Jones R. Using Temporal Profiles of Queries for Precision Prediction[C]//Proc of the 27th Annual Int' l ACM SIGIR Conf on Research and Development in Information Retrieval, 2004:18-24. 被引量:1
  • 6Jones R, Diaz F. Temporal Profiles of Queries[J]. ACM Trans on Information Systems, 2007,25(3) : 14. 被引量:1
  • 7Steve C-T, Zhou Yun, Croft W B. Predicting Query Performance[C]//Proc of the 25th Annual Int'l ACM SIGIR Conf on Research and Development in Information Retrieval, 2002 : 299-306. 被引量:1
  • 8云健,刘勇奎,陈华,于洪志.AC和FKP融合算法在民族突发事件聚类分析中的应用[J].华中科技大学学报(社会科学版),2009,23(1):117-121. 被引量:2
  • 9Macdonald C, Ounis I. The TREC Blogs06 Collection : Creating and Analysing a Blog Test Collection[R]. Technical Report TR-2006-224, Department of Computer Science, University of Glasgow, 2006. 被引量:1

二级参考文献2

共引文献1

同被引文献206

引证文献9

二级引证文献53

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部