期刊文献+

术语关系自动抽取方法研究 被引量:7

Study on Term Relation Extraction from Domain Text
下载PDF
导出
摘要 将术语关系抽取转化为分类问题,给出了基于机器学习的术语关系自动抽取流程。针对现有产生式和判定学习算法的缺点,提出了混合分类算法HC。该算法使得一部分特征值通过训练数据估计而来,另一部分特征值通过判定函数训练得到。实验结果表明,该算法优于原来的产生式学习算法和判断学习算法,在人工标注的小训练集上获得了较好的分类效果。 A term relation extraction approach was proposed. It was cast as a classification task. The hybrid classification algorithm combining the advantages of both naive bayes and perceptron was also presented. In this algorithm, a subset of the features was estimated from training data, and another subset of the features was trained by discriminative function. The experimental results showed that the proposed hybrid algorithm almost always outperforms the naive bayes algorithms and perceptron algorithms when the training set is small.
出处 《计算机科学》 CSCD 北大核心 2010年第2期189-191,215,共4页 Computer Science
基金 陕西省教育厅项目(09JK768 09JK774 09JK738)资助
关键词 机器学习 术语关系抽取 混合学习算法 Machine learning,Term relation extraction,Classification algorithm
  • 相关文献

参考文献11

  • 1Boguraev B, Kennedy C. Applications of term identification technology:domain description and content characterization[J] . Natural I.anguage Engineering, 1999,5 (1) : 17-44. 被引量:1
  • 2Appeit D E. Introduction to Information Extraction[J]. AI Communications, 1999,12(3) : 161- 172. 被引量:1
  • 3Aone C, Ramos M, Rees S. A large-scale relation and event extraction system[C] //Proceedings of the 6th Applied Natural Language Processing Conference. New York;ACM Press, 2000:76-83. 被引量:1
  • 4Hearst M A. Automatic Acquisition of Hyponyms from I.arge Text Corpora[C] //14^th International Conference on Computa tional Linguistics. Nantes, France, 1992 : 539-545. 被引量:1
  • 5Yu Hong, et al. Automatic Extraction of Gene and Protein Synonyms from MEDLINE and Journal Articles[C]// Proceedings of the American Medical Informatics Association 2002 Symposium (AMIA'2002). 2002:919-923. 被引量:1
  • 6刘克彬,李芳,刘磊,韩颖.基于核函数中文关系自动抽取系统的实现[J].计算机研究与发展,2007,44(8):1406-1411. 被引量:58
  • 7Li Wenjie, et al. A Novel Feature-based Approach to Chinese Entity Relation Extraction[C]//Proceedings of ACL-08: HLT. Columbus, USA, 2008 : 89-92. 被引量:1
  • 8Girju R, Badulescu A, Moldovan D. Learning Semantic Constraints for the Automatic Discovery of Part-Whole Relations [C]// Edmonton, Canada. Proceedings of HLT-NAACL. Edmonton, Canada, 2003 : 80-87. 被引量:1
  • 9Fleischman M, Hovy E. Offline Strategies for Online Question Answering: Answering Questions Before They Are Asked[C]// Proceedings of the 41^st Annual Meeting of the Association for Computational Linguistics. Sapporo, Japan, 2003 : 1-7. 被引量:1
  • 10Chang J T. Using Machine Learning to Extract Drug and Gene relationships from Text[D]. Stanford University, 2003. 被引量:1

二级参考文献28

  • 1车万翔,刘挺,李生.实体关系自动抽取[J].中文信息学报,2005,19(2):1-6. 被引量:115
  • 2黄萱菁,吴立德,王文欣,叶丹瑾.基于机器学习的无需人工编制词典的切词系统[J].模式识别与人工智能,1996,9(4):297-303. 被引量:24
  • 3傅兴岭.现代汉语通用字典[M].汉语教学与研究出版社,1987.. 被引量:1
  • 4Ge Xian-ping, Wanda Pratt, Padhraic Smyth. Discovering Chinese words from unsegmented text[C]. In: Proceedings of the 22nd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 1999:271-272. 被引量:1
  • 5Chien Lee-feng. PAT-tree-based adaptive keyphrase extraction for intelligent Chinese information retrieval [A]. Information Processing and Magagement (IPM) [M]. Elsevier Press, 1999,35(4):501-521. 被引量:1
  • 6Christopher S G. Khoo Yubin Dai. Using statistical and contextual information to identify two-and three-character words in Chinese text [J]. Journal of the American Society for Information Science and Technology. 2002,53(5) :365-377. 被引量:1
  • 7Honglan Jin, Kam-Fai Wong. A Chinese dictionary construction algorithm for information retrieval [ EB/OL ]. 2002http://www. se. cuhk. edu. hk/dn/TALIP-02-a35. doc. 被引量:1
  • 8Tang Hai-jiang,Pascale Fung. A multi-path syllable to word decoder with language model optimization and automatic lexicon augmentation[J]. 2000 International Symposium on Chinese Spoken Language Processing,Beijing,China,Oct 2000. 被引量:1
  • 9刘群 李素建.基于《知网》的词汇语义相似度计算.中文计算语言学,2002,7(2):59-76. 被引量:147
  • 10C Aone,M Ramos Santacruz.Rees:A large-scale relation and event extraction system[C].In:Proc of the 6th Applied Natural Language Processing Conference.New York:ACM Press,2000.76-83. 被引量:1

共引文献67

同被引文献53

引证文献7

二级引证文献51

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部