期刊文献+

Medical Entity and Attributes Extraction System Based on Relation Annotation 被引量:1

Medical Entity and Attributes Extraction System Based on Relation Annotation
原文传递
导出
摘要 The abundant entities and entity-attribute relations in medical websites are important data resources for medical research.However,the medical websites are usually characterized of storing entity and attribute values in different pages.To extract those data records efficiently,we propose an automatic extraction system which is related to entity and attribute relations(attributes and values)of separate storage.Our system includes following modules:(1)rich-information interactive annotation page rendering;(2)separate storage attribute relations annotating;(3)annotated relations for pattern generating and data records extracting.This paper presents the relations about the attributes which are stored in many pages by effective annotation,then generates rules for data records extraction.The experiments show that the system can not only complete attribute relations of separate storage extraction,but also be compatible with regular relation extraction,while maintaining high accuracy. The abundant entities and entity-attribute relations in medical websites are important data resources for medical research.However,the medical websites are usually characterized of storing entity and attribute values in different pages.To extract those data records efficiently,we propose an automatic extraction system which is related to entity and attribute relations(attributes and values)of separate storage.Our system includes following modules:(1)rich-information interactive annotation page rendering;(2)separate storage attribute relations annotating;(3)annotated relations for pattern generating and data records extracting.This paper presents the relations about the attributes which are stored in many pages by effective annotation,then generates rules for data records extraction.The experiments show that the system can not only complete attribute relations of separate storage extraction,but also be compatible with regular relation extraction,while maintaining high accuracy.
出处 《Wuhan University Journal of Natural Sciences》 CAS CSCD 2016年第2期145-150,共6页 武汉大学学报(自然科学英文版)
基金 Supported by the Natural Science Foundation of Hubei Province(2013CFB334)
关键词 relation annotation information extraction medical data relation extraction relation annotation information extraction medical data relation extraction
  • 相关文献

参考文献13

  • 1Ferrara E, De Meo P, Fiumara G, et al. Web data extraction, applications and techniques: A survey [J], Knowledge-Based Systems, 2014, 70: 301-323. 被引量:1
  • 2Crescenzi V, Mecca G, Merialdo P. Roadrunner: Towards automatic data extraction from large Web sites [C]// Very Large Data Bases. Sam Framcisco: Morgan Kaufmann ,2001, 1: 109-118. 被引量:1
  • 3Myllymaki J, Jackson J. Robust Web Data Extraction with Xml Path Expressions [R], IBM Research Report, RJ10245. Sam Jose: IBM, 2002. 被引量:1
  • 4Zhang Y, Zhang Q, Jiang G. A DOM-based Anchor-Hop-T method for Web application information extraction [J], Journal of Software, 2014, 9(3): 641-647. 被引量:1
  • 5Miao G, Tatemura J, Hsiung W P, et al. Extracting data records from the Web using tag path clustering [C]// Proceedings of the 18th International Conference on World Wide Web. New York: ACM Press, 2009: 981-990. 被引量:1
  • 6Zhai Y, Liu B. Web data extraction based on partial tree d.\\%nmcni[Cy/Proceedings of the 14 th International Conference on World Wide Web. New York: ACM Press,2005: 76-85. 被引量:1
  • 7Kayed M, Chang C H. FiVaTech: Page-level Web data extraction from template pages [J]. Knowledge and Data Engineering, IEEE Transactions on, 2010, 22(2): 249-263. 被引量:1
  • 8Grigalis T, Cenys A. Unsupervised structured data extraction from template-generated Web pages [J]. J Universal Computer Science, 2014, 20(2): 169-192. 被引量:1
  • 9Blanco L, Dalvi N, Machanavajjhala A. Highly efficient algorithms for structural clustering of large websites [C]// Proceedings of the 20th International Conference on World Wide Web. New York: ACM Press , 2011: 437-446. 被引量:1
  • 10Zhai Y, Liu B. Extracting Web data using instance-based learning [J], World Wide Web, 2007,10(2): 113-132. 被引量:1

同被引文献2

引证文献1

二级引证文献12

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部