
基于微博文本分类的突发地震事件检测方法 被引量:2

A method for detecting sudden earthquake events based on micro-blog text classification
摘要 事件检测是文本挖掘的一个重要研究方向,以微博文本的突发地震事件检测为例做了深入研究。首先分别运用三种经典的分类算法来实现突发地震事件检测,将检测结果进行比较,选择出一种最优的分类算法和最适合的特征数。在此基础上提出关键词过滤和时间关系识别的方法将错分的实例进行再分类来提高检测结果。实验表明该方法的检测结果与仅采用经典分类算法相比F_1值提高了5.3%。 Event detection is one of the most important research fields in text mining,and an in-depth stiidy is carried out in the case of micro-blog. First, used three kinds of classical classification algorithms to detection the sudden earthquake events,and the results were compared, selected an optimal classification algorithm and the most suitable number of features. Then, basis of this, put forward the metliod of keyword filtering and temporal relation recognition to classify the wrong examples to improve the detection results. Experimental results show that the proposed method can improve the I value by 5. 3% compared with the classical classification algorithm.
出处 《微型机与应用》 2017年第19期58-61,65,共5页 Microcomputer & Its Applications
关键词 事件检测 文本分类 关键字过滤 时间关系识别关键词 event detection text categorization keyword filtering temporal relation recognition
  • 相关文献



  • 1王昀,苑春法.基于转换的时间-事件关系映射[J].中文信息学报,2004,18(4):23-30. 被引量:19
  • 2杜津,杨一平,曾隽芳.自然语言时间信息的模拟与计算[J].计算机工程与设计,2006,27(13):2419-2422. 被引量:1
  • 3徐永东,徐志明,王晓龙,刘远超.中文文本时间信息获取及语义计算[J].哈尔滨工业大学学报,2007,39(3):438-442. 被引量:10
  • 4Eun Ha, Alok Baikadi, Carlyle Licata. et al. NCSU: Modeling Temporal Relations with Markov Logic and Lexical Ontology[C]//Proceedings of the 5th International Workshop on Semantic Evaluation. Uppsala, Sweden, 2010: 341-344. 被引量:1
  • 5Leon Derczynski, Robert Gaizauskas. USFD2 : Annotating Temporal Expresions and TLINKs for TempEval-2[C]//Proeeedings of the 5th International Workshop on Semantic Evaluation, Uppsala, Sweden, 2010: 337-340. 被引量:1
  • 6Claire Grover, Richard Tobin, Beatrice Alex. et al. Edinburgh-LTG: TempEval-2 System Description [C]//Proceedings of the 5th International Workshop on Semantic Evaluation, Uppsala, Sweden, 2010: 333-336. 被引量:1
  • 7Jannik Strotgen, Michael Gertz. HeidelTime.- High Quality Rule-Based Extraction and Normalization of Temporal Expressions [C]//Proceedings of the 5th International Workshop on Semantic Evaluation, Uppsala, Sweden, 2010: 321-324. 被引量:1
  • 8Hector Llorens, Estela Saquete, Borja Navarro. TIP- Sere (English and Spanish): Evaluating CRFs and Semantic Roles in TempEval-2 [C]//Proceedings of the 5th International Workshop on Semantic Evaluation. Uppsala, Sweden, 2010: 284-291. 被引量:1
  • 9Estela Saquete Boro. ID 392: TERSEO + T2T3 Transducer. A systems for Recognizing and Normalizing TIMEX3 [C]//Proceedings of the 5th Interna tional Workshop on Semantic Evaluation, Uppsala, Sweden, 2010: 317-320. 被引量:1
  • 10Maria Teresa Vicente-Diez, Julicin Moreno-Schneider, Paloma Martinez. UC3M System: Determining the Extent, Type and Value of Time Expressions in TempEval-2[C]//Proceedings of the 5th International Workshop on Semantic Evaluation, Uppsala, Sweden, 2010: 329-332. 被引量:1












使用帮助 返回顶部