期刊文献+

基于支持向量机的生物医学文献蛋白质关系抽取 被引量:20

Extraction of information on protein-protein interaction from biomedical literatures using an SVM
下载PDF
导出
摘要 从生物医学文献中抽取蛋白质(基因)交互作用关系对蛋白质知识网络的建立、蛋白质关系的预测以及新药的研制等均具有重要的意义.提出了一种基于支持向量机(SVM)的蛋白质(基因)交互作用关系抽取方法.该方法除了选取词项特征、关键词特征、实体距离特征、链接特征外,还利用链接语法分析方法可以获得较高准确率的特性,引入链接语法分析方法抽取结果特征.实验结果表明,该方法的召回率性能与使用同一测试语料的其他系统相比具有明显的优势,综合分类率F指标也高于其他系统. Automated extraction of protein-protein interaction information from biomedical literature is helpful when building a protein knowledge network, predicting protein functions and designing new drugs. This paper presents .a method for protein-protein interaction extraction from biomedical literature using a support vector machine (SVM).In this method, besides common index parameters such as word features, keyword features, entity distance features and link path features, a link grammar extraction feature is used to improve precision when identifying protein-protein interactions. Experimental results indicated that the recall rate and the F-score of this method are much higher than that of other extraction systems for the same dataset.
出处 《智能系统学报》 2008年第4期361-369,共9页 CAAI Transactions on Intelligent Systems
基金 国家自然科学基金资助项目(60373095,60673039) 国家“863”高科技计划资助项目(2006AA01Z151)
关键词 关系抽取 链接语法 支持向量机 interaction extraction link grammar support vector machine ( SVM )
  • 相关文献

参考文献23

  • 1[1]PUSTEJOVSKY J,CASTANO,ZHANG J.Robust relational parsing over biomedical literature:extracting inhibit relations[C]// Proceedings of the Seventh Pacific Symposium on Bio-Computing.[S.l.],2002:362-373. 被引量:1
  • 2[2]LEROY G,CHEN H,MARTINEZ J D.A shallow parser based on closed-class words to capture relations in biomedical text[J].Journal of Biomedical Informatics,2003,36(3):145-158. 被引量:1
  • 3[3]PARK J C,KIM H S,KIM J J.Bidirectional incremental parsing for automatic pathway identification with combinatory categorical grammar[C]// Proceedings of the Pacific Symposium on Bio-Computing.Hawaii,USA,2001:396-407. 被引量:1
  • 4[4]TEMKIN J M,GILDER M R.Extraction of protein interaction information from unstructured text using a context-free grammar[J].Bioinformatics,2003,19:2046-2053. 被引量:1
  • 5[5]AHMED S T,CHINDAMBARAM D,DAVULCU H,et al.IntEx:a syntactic role driven protein-protein interaction extractor for bio-medical text[C]// Proceeding of the ACL-ISMB Workshop on Linking Biological Literature,Ontologies and Databases:Mining Biological Semantics.Detroit,Michigan,USA,2005:54-61. 被引量:1
  • 6[6]ONO T,HISHIGAKI H,TANIGAMIi A,et al.Automatic extraction of information on protein-protein interactions from the biological literature[J].Bioinformatics,2001,17 (2):155-161. 被引量:1
  • 7[7]HUANG M L,ZHU X Y,HAO Y,et al.Discovering patterns to extract protein-protein interactions from full texts[J].Bioinformatics,2004,20 (18):3604-3612. 被引量:1
  • 8[8]DAVID C,BEMARD B,WILLIAM L,et al.BioRAT:extracting biological information from full-length papers[J].Bioinformatics,2004,20(17):3206-3213. 被引量:1
  • 9[9]ANDRADE M A,VALENICA A.Automatic extraction of keywords from scientific text:application to the knowledge domain of protein families[J].Bioinformatic,1998,14(7):600-607. 被引量:1
  • 10[10]CRAVEN M,KUMLIEN J.Constructing biological knowledge bases by extracting information from text sources[C]// Proceedings of the 7th International Conference on Intelligent Systems for Molecular Biology.Heidelberg,Germany,1999:77-86. 被引量:1

同被引文献212

引证文献20

二级引证文献56

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部