期刊文献+

Research on Chinese place name recognition based on kernel classifier

Research on Chinese place name recognition based on kernel classifier
下载PDF
导出
摘要 A SVMs (Support Vector Machines) based method to identify Chinese place names is presented. In our approach, place name candidate is located according to a rational forming assumption, then SVMs based identification strategy is used to distinguish whether one candidate is true place name or not. Referring to linguistic knowledge, basic semanteme of a contextual word and frequency information of words inside place name candidate are selected as features in our methodology. So dimension in the feature space is reduced dramatically and processing procedure is performed more efficiently. Result of open testing on unregistered place names achieves F-measure 83.25 in 8.17 million words news based on this project. A SVMs (Support Vector Machines) based method to identify Chinese place names is presented. In our approach, place name candidate is located according to a rational forming assumption, then SVMs based identification strategy is used to distinguish whether one candidate is true place name or not. Referring to linguistic knowledge, basic semanteme of a contextual word and frequency information of words inside place name candidate are selected as features in our methodology. So dimension in the feature space is reduced dramatically and processing procedure is performed more efficiently. Result of open testing on unregistered place names achieves F-measure 83.25 in 8. 17 million words news based on this project.
出处 《Journal of Harbin Institute of Technology(New Series)》 EI CAS 2007年第1期79-82,共4页 哈尔滨工业大学学报(英文版)
基金 Foundation of China(Grant No.60175020and60673037) and the National High Technology Research and Development Program of China (Grant No.2002AA117010-09).
关键词 SVMS Chinese place name feature selection semanteme kernel function 中国 地名 SVMs 特征选择 核函数 语义 模式识别 支持向量机
  • 相关文献

参考文献4

二级参考文献29

  • 1孙茂松,黄昌宁,高海燕,方捷.中文姓名的自动辨识[J].中文信息学报,1995,9(2):16-27. 被引量:87
  • 2谭红叶 郑家恒 等.中国地名的自动识别方法研究.计算语言学文集[M].北京:清华大学出版社,1999.. 被引量:1
  • 3谭红叶 郑家恒 等.基于变换的中国地名识别方法研究.第六届人工智能会议论文集[M].,2001.. 被引量:1
  • 4Tan Hongye,Proc Computational Linguistics,1999年,174页 被引量:1
  • 5中国地名委员会,中国地名录,1994年 被引量:1
  • 6E F T K Sang, W Daelemans, H Déjean et al. Applying system combination to base noun phrase identification. In: Proc of COLING 2000. Saarbrücken, Germany: Morgan Kaufmann Publishers, 2000. 857~863 被引量:1
  • 7周明 .基于语料库的中文最长名词短语的自动抽取.见:计算语言进展与应用.北京,清华大学出版社,1995. 50-55(Zhou Ming. Corpus-based Chinese maximum noun phrase extraction. In: Computer Linguistic Development and Application(in Chinese). Beijing: Tsinghua University Press, 1995. 50-55) 被引量:1
  • 8K W Church. A stochastic parts program and noun phrase for unrestricted test. In: Proc of the 2nd Conf on Applied Natural Language Processing. Austin, TX, USA: Kluwer Academic Publishers, 1988. 136~143 被引量:1
  • 9S P Abney. Parsing by Chunks. In: R C Berwick, S P Abney eds. PrincipleBased Parsing: Computation and Psycholinguistics. Boston, USA: Kluwer Academic Publishers, 1991. 257~278 被引量:1
  • 10L A Ramshaw, M P Marcus. Text chunking using transformation-based learning. In: Proc of the 3rd Workshop on Very Large Corpora. Kluwer Academic Publishers, 1995. 82~94 被引量:1

共引文献120

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部