期刊文献+

基于层次模式匹配的命名实体识别模型 被引量:8

Named Entity Extraction Model Based on Hierarchical Pattern Matching
下载PDF
导出
摘要 重点讨论非结构化中文文本中表达式命名实体(ENE)的抽取和分类过程,尝试构造匹配模式集合,建立基于层次模式匹配的ENE识别模型(HPM_ENE_EM),作为竞争情报系统、用户兴趣度获取等情报学应用研究的基础,并以学术论文中的术语缩略语识别为例探讨该模型的具体应用。 This paper emphasizes the process of extraction and classification of Expression Named Entity (ENE) in non- structured Chinese text, attempts to construct pattern collection for matching and builds the ENE Extraction Model Based Hierarchical Pattern Matching( HPM_ENE_EM), which is the base of the application research on intelligence, such as Competitive Intelligence System(CIS) ,user interest degree gaining and so on. At last, the paper discusses the detailed application of this model used for extracting the abbreviative terms in academic papers.
作者 王昊
出处 《现代图书情报技术》 CSSCI 北大核心 2007年第5期62-68,共7页 New Technology of Library and Information Service
关键词 表达式命名实体 层次模式匹配 术语识别 缩略语 Expression named entity Hierarchical pattern matching Term extraction Abbreviative terms
  • 相关文献

参考文献9

  • 1王睿,张洁,张由仪,于禛,姚天昉.基于混合模型的中文命名实体抽取系统[J].清华大学学报(自然科学版),2005,45(S1):1908-1914. 被引量:10
  • 2Chen H H, Ding Y W, Tsa S C, et al. Description of the NITU System Used for MET2. In: Proc. of 7th Message Understanding Conference, 1998 被引量:1
  • 3Black W J, Rinaldi F, Mowatt D. Facile: Description of the NE System Used For MUC - 7. In.. Proc. of 7th Message Understanding Conf, 1998 被引量:1
  • 4Fukumoto J, Shimohata M, Masui F,et al. Electric Industry: Description of the Oki System as Used for MET-2. In: Proc. of 7th Message Understanding Conf, 1998 被引量:1
  • 5Berners- Lee T, Fischetti M,Dertouzos T M. Weaving the Web: The Original Design and Ultimate Destiny of the World Wide Web by its Inventor. Harper, San Francisco. 1999 被引量:1
  • 6Zhou G D, Su J. Named Entity Recognition using an HMM - based Chunk Tagger. In: Proc. of the 40th Annual Meeting of the ACL, Philadelphia, PA 2002, 473 - 480 被引量:1
  • 7Bender O, Och F J, Ney H. Maximum Entropy Models for Named Entity Recognition, Proceedings of the Conference on Computational Natural Language Learning. Edmonton, Canada, 2003, 148- 151 被引量:1
  • 8庄明,老松杨,吴玲达.一种统计和词性相结合的命名实体发现方法[J].计算机应用,2004,24(1):22-24. 被引量:12
  • 9王胜,朱明.基于最大熵马尔可夫模型的地址信息抽取[J].计算机工程与应用,2005,41(21):192-194. 被引量:7

二级参考文献23

  • 1孙茂松,黄昌宁,高海燕,方捷.中文姓名的自动辨识[J].中文信息学报,1995,9(2):16-27. 被引量:87
  • 2北京语言大学计算机科学与技术系.CCRL试用版(A)[EB/OL].ftp://202.112.195. 7/ccrl/ccrlbetaa.zip,2002-12-16. 被引量:1
  • 3Berger A L,S A Della Pietra,V J Della Pietra. A Maximum Entropy Approach to Natural Language Processing[J].Computational Linguistics,1996;22(1):39~71 被引量:1
  • 4Darroch J N,Ratcliff D.Generlized iterative scaling for log-linear models[C].In: The Annals of Mathematical Statistics, 1972 ; 43 (5):1470~1480 被引量:1
  • 5McCallum A,D Freitag,F Pereira. Maximum Entropy Markov Models for Information Extraction and Segmentation[C].In:Machine Learning:Proceedings of the Seventeenth International Conference(ICML 2000),Stanford, California, 2000: 591 ~598 被引量:1
  • 6Leek T R.Information extraction using hidden Markov models[D].Master′s thesis.UC San Diego,1997 被引量:1
  • 7Yamron J,Carp I,Gillick L et al.A hidden Markov model approach to text segmentation and event tracking[C].In:Proceedings of ICASSP′98,IEEE, Volume: 1,1998: 333~336 被引量:1
  • 8张跃,姚天顺.基于结合性自动识别中文姓名[J].小型微型计算机系统,1997,18(10):43-48. 被引量:9
  • 9郑家恒,李鑫,谭红叶.基于语料库的中文姓名识别方法研究[J].中文信息学报,2000,14(1):7-12. 被引量:43
  • 10刘秉伟,黄萱菁,郭以昆,吴立德.基于统计方法的中文姓名识别[J].中文信息学报,2000,14(3):16-24. 被引量:48

共引文献25

同被引文献107

引证文献8

二级引证文献79

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部