期刊文献+

基于启发式错误驱动学习的中文时间表达式识别 被引量:3

Recognizing Chinese time expressions based on heuristic error-driven learning
下载PDF
导出
摘要 提出了一种基于启发式错误驱动学习的中文时间表达式识别的新方法。该方法先采用依存分析方法以时间触发词为切入点递归地识别时间表达式,有效地解决了长距离依赖的问题,大大提高了识别效果;在此基础上,对比错误识别结果和人工标注,采用启发式A*算法搜索策略进行错误驱动学习,降低了规则学习的复杂度,并具有区分每条规则的有效性和规则间相容性的优点,使系统性能提高近6%。最终在封闭测试集和开放测试集上,F值分别达到了77.96%和77.92%。 This paper proposes a new method tor recognizing Chinese time expression based on the heuristic error-driven learning. The method begins with a time trigger word to recognize the time expressions regressively using the dependency parsing, so it resolves the problem of long distance dependency effectively and improves the system performance greatly. Based on this, it uses the error-driven learning integrating the A^* algorithm to heuristically learn the rules, which not only decreases the complexity of learning rides, but also differentiates the validity of each rule and compatility among rules, resulting in an increase of 6% in system performance. Finally, it creats the F values of 77.96% and 77.92% on the closed test and the open test respectively.
出处 《高技术通讯》 EI CAS CSCD 北大核心 2008年第12期1258-1262,共5页 Chinese High Technology Letters
基金 863计划(2006AA01Z145) 国家自然科学基金(60435020 60675034)资助项目
关键词 时间表达式识别 时间触发词 依存分析 错误驱动学习 A*算法 time expression recognition, time trigger, dependency parsing, error-driven learning, A ^* algorithm
  • 相关文献

参考文献15

  • 1WuML, LiWJ, Lu Q, etal. CTEMP: A Chinese temporal parser for extracting and normalizing temporal Information. In: Proceeding of the International Joint Conference on Natural language Processing, Jeju Island, Korea, 2005. 694-706 被引量:1
  • 2Ye Y, Fossum V L, Abney S. Latent features in automatic tense translation between Chinese and English. in: Proceedings of the 5th SIGHAN Workshop on Chinese Language Processing, Sydney, Australia, 2006.48-55 被引量:1
  • 3ACE2007 evaluation plan. http://projects. ldc. upenn. edu/ace/intro. html. 2006-11-6 被引量:1
  • 4SemEval-2007. http://nlp. cs. swarthmore.edu/semevaL/index. shtml. 2007-1 被引量:1
  • 5Jang S B, Baldwin J, Mind I. Automatic TIMEX2 tagging of Korean news. ACM Transaction on Asian Language Information processing,2004, 3 (1):51-65 被引量:1
  • 6Vazov N. A system for extraction of temporal expressions French Texts based on syntactic and semantic constraints. In: Proceedings d the Association for Computational Linguistics Workshop on Temporal and Spatial Information Processing, Toulouse, France, 2001. 96-103 被引量:1
  • 7Estela S, Martinez-Barco, Patricio, et al. Recognizing and tagging temporal expressions in Spanish. In: Proceedinss of the Workshop on Annotation Standards for Temporal Information in Natural Language, The International Conference on Language Resources and Evaluation, Las Palmas, Spain, 2002 被引量:1
  • 8Mani I. Recent developments in temporal information extraction. In: Proceedings d the Conference on Recent Advances in Natural Language Processing, Alicante, Spain, 2004 被引量:1
  • 9Hacioglu K, Chen Y, Douglas B. Automatic time exxon labeling for English and Chinese text. In: Proceedings d Conference on Intelligent Text Processing and Computational Linguistics, Mexico City, Mexico, 2005. 被引量:1
  • 10AhnD, Adahe S F, Rijke M de. Towards task-besed temporal extraction and recognition. In: Proceedings of Dagstuhl Workshop on Annotating, Extracting, and Reasoning about Tune and Events, Dagstuhl Castle, Germany, 2005 被引量:1

二级参考文献14

  • 1Mingli Wu,Wenjie Li,Qin Lu,Baoli Li.CTEMP:A Chinese Temporal Parser for Extracting and Normalizing Temporal Information[A].IJCNLP 2005[C].694-706. 被引量:1
  • 2Yang Ye,Victoria Li Fossum,and Steven Abney.Latent features in automatic tense translation between chinese and english[A].In:Proceedings of the Fifth SIGHAN Workshop on Chinese Language Processing[C].Sydney,Australia:July 2006.48-55. 被引量:1
  • 3SemEval-2007[EB/OL].http://nlp.cs.swarthmore.edu/semeval/index.shtml. 被引量:1
  • 4Vazov N.A System for Extraction of Temporal Expressions from French Texts based on Syntactic and Semantic Constraints[A].In:Proceedings of the ACL Workshop on Temporal and Spatial Information Processing (2001)[C].96-103. 被引量:1
  • 5Wilson,G.,Mani,I.,Sundheim,B.,and Ferro,L.2001.A multilingual approach to annotating and extracting temporal information[A].In:Proceedings of the Workshop for Temporal and Spatial Information Processing EACL-ACL 2001[C].Toulouse,France:July,2001. 被引量:1
  • 6SETZER,A.2001.Temporal information in newswire articles:An annotation scheme and corpus study[EB/OL].Ph.D.thesis,Univ.of Sheffield. 被引量:1
  • 7Mani,I.2004.Recent Developments in Temporal Information Extraction[A].In:NICOLOV,N.,AND MITKOV,K.,Proceedings of the Conference on Recent Advances In Natural Language Processing[C].John Benjamins. 被引量:1
  • 8ACE2007 evaluation plan[EB/OL].http://projects.ldc.upenn.edu/ace/intro.html 2006-11-6. 被引量:1
  • 9Jang,S.B.,Baldwin,J.and Mani,I.:Automatic TIMEX2 Tagging of Korean News[J].ACM Transactions on Asian Language Information processing 2004,3(1):51-65. 被引量:1
  • 10Estela,S.,Martinez-Barco,Patricio,and Munoz,R.:Recognizing and Tagging Temporal Expressions in Spanish[A].Workshop on Annotation Standards for Temporal Information in Natural Language,LREC 2002[C]. 被引量:1

共引文献20

同被引文献28

  • 1陈文亮,朱靖波,吕学强.词性标注规则的获取和优化[J].术语标准化与信息技术,2004(2):23-26. 被引量:5
  • 2孙景广,蔡东风,吕德新,董燕举.基于知网的中文问题自动分类[J].中文信息学报,2007,21(1):90-95. 被引量:41
  • 3Seok Bae Jang, Jennifer Baldwin. Inderjeet Mani Automatic TIMEX2 Tagging of Korean News [J].ACM Transactions on Asian Language Information processing (TALIP), 2004, 3(1) : 51-65. 被引量:1
  • 4Nikolai Vazov A System for Extraction of Temporal Expressions from French Texts based on Syntactic and Semantic Constraints[C]//Proceedings of the workshop on Temporal and spatial information processing, 2001, Volume 13: Article No. 14:1-8. 被引量:1
  • 5Estela Saquete, Patricio Martinez-barco. Rafael Mufioz Recognizing and Tagging Temporal Expressions in Spanish [C]//Workshop on Annotation Standards for Temporal Information in Natural Language (LREC), 2002: 44-51. 被引量:1
  • 6Mingli Wu, Wenjie Li, Qin Lu, Baoli Li. A Chinese Temporal Parser for Extracting and Normalizing Temporal Information [C]//International Joint Conference on Natural Language Processing ( IJCNLP), 2005, Volume 3651: 694-706. 被引量:1
  • 7David Ahn, Sisay Fissaha Adafre, Maarten De Rijke Towards Task-Based Temporal Extraction and Recognition [C]//Proceedings Dagstuhl Workshop on Annotating, Extracting, and Reasoning about Time and Events, 2005. 被引量:1
  • 8Kadri Hacioglu, Ying Chen. Benjamin Douglas Auto matic Time Expression Labeling for English and Chi nese Text [C]//Computational Linguistics and Intelli gent Text Processing (CICLing), 2005, Volume 3406 548-559. 被引量:1
  • 9贺瑞芳,秦兵,刘挺,潘越群,李生.基于依存分析和错误驱动的中文时间表达式识别[J].中文信息学报,2007,21(5):36-40. 被引量:21
  • 10LIU Xiao-ming, LIU Li. Question classification based on focus[C]// Proc of the 2012 International Conference on Communication Systems and Network Technologies. 2012:512-516. 被引量:1

引证文献3

二级引证文献20

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部