期刊文献+

基于隐马尔科夫模型的中文术语识别研究 被引量:37

Chinese Term Recognition Based on Hidden Markov Model
下载PDF
导出
摘要 基于对中文文本信息语法构成尤其是词性搭配的概率特征的分析,提出一种基于双层隐马尔科夫模型的中文泛术语识别和提取的思路和系统框架,并实现相关系统,基于训练语料对多个领域的文本信息进行术语提取测试。实验结果表明,所提出的基于隐马尔科夫模型的中文泛术语识别和提取思想具有较好的实践参考意义。 After a perceptive analysis of probabilistic characteristics of syntax composition especially P0S matching of Chinese textual information, a system framework for Chinese term recognition and extraction based on dual layer HMM is presented and implemented. The method proposed shows a good performance in the tests with textual information from different domain, and the terms recognized and extracted by the implemented system can be treated as candidate terms for false - eliminating and optimizing combining with parameters of mutual information, log likelihood and domain dependency.
出处 《现代图书情报技术》 CSSCI 北大核心 2008年第12期54-58,共5页 New Technology of Library and Information Service
关键词 中文术语识别和提取 隐马尔科夫 HMM Chinese term recognition Hidden markov model HMM
  • 相关文献

参考文献5

二级参考文献37

  • 1刘群,张华平,俞鸿魁,程学旗.基于层叠隐马模型的汉语词法分析[J].计算机研究与发展,2004,41(8):1421-1429. 被引量:198
  • 2邹纲,刘洋,刘群,孟遥,于浩,西野文人,亢世勇.面向Internet的中文新词语检测[J].中文信息学报,2004,18(6):1-9. 被引量:59
  • 3崔世起,刘群,孟遥,于浩,西野文人.基于大规模语料库的新词检测[J].计算机研究与发展,2006,43(5):927-932. 被引量:32
  • 4Patrick Pantel,Dekang Lin.A Statistical Corpus-based Term Extractor[C].Ottawa, Canada: Lecture Notes in Artificial Intelligence, 2001. 36- 46. 被引量:1
  • 5Shengfen Luo, Maosong Sun. Two-Character Chinese Word Extraction Based on Hybrid of Internal and Contextual Measures[C]. Sapporo, Japan: Proceedings of the 2nd SIGHAN Work Shop on Chinese Language Processing,2003. 24-30. 被引量:1
  • 6Munpyo Hong, Sisay Fissaha, Johann Haller. Hybrid Filtering for Extraction of Term Candidates from German Technical Texts[C].Nancy: Proceedings of Terminology & Artificial Intelligence,2001. 被引量:1
  • 7Diana Maynard, Sophia Ananiadou. Terminological Acquaintance: The Importance of Contextual Information in Terminology [C]. Patras, Greece:Proceedings of NLP 2000 Workshop on Computational Terminology for Medical and Biological Applications,2000. 19-28. 被引量:1
  • 8Thian-Huat Ong, Hsinchun Chen. Updateable PAT-Tree Approach to Chinese Key Phrase Extraction Using Mutual Information: A Linguistic Foundation for Knowledge Management[C]. Taipei, Taiwan:Proceedings of the 2nd Asian Digital Library Conference,1999.63-84. 被引量:1
  • 9罗智勇 宋柔.现代汉语自动分词中专名的一体化、快速识别方法[A]..ICCC,Singapore[C].,2001.11.. 被引量:2
  • 10季姮,罗振声.基于反比概率模型和规则的中文姓名自动辨识系统[A].自然语言理解与机器翻译[C].北京:清华大学出版社,2001.123-128. 被引量:1

共引文献292

同被引文献408

引证文献37

二级引证文献228

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部