期刊文献+

开放式Web信息抽取系统研究与实现 被引量:3

RESEARCH AND DEVELOPMENT ON OPEN WEB INFORMATION EXTRACTION
下载PDF
导出
摘要 在分析Web信息资源固有特点的基础上,结合国内外已有的研究成果,提出了一个开放式的Web信息抽取系统,该系统的抽取规则不是内置于系统的“硬编码”,而是由系统通过自动学习归纳并结合用户干预生成的开放式规则,从而扩大了Web信息抽取系统的使用范围. With the help of research achievements home and abroad, an open Web information extraction system is given here based on the structure of Web information. The extraction rule of this system is not “hard encoding”, but is deduced from its automatic learning with users' necessary adjusting. As a result, the system can be widely used.
作者 傅骞 温晓辉
出处 《北京师范大学学报(自然科学版)》 CAS CSCD 北大核心 2005年第6期594-598,共5页 Journal of Beijing Normal University(Natural Science)
基金 北京师范大学青年教师基金项目
关键词 WEB信息抽取 算法归纳 聚类 Web information extraction wrapper induction clustering
  • 相关文献

参考文献8

  • 1李保利,陈玉忠,俞士汶.信息抽取研究综述[J].计算机工程与应用,2003,39(10):1-5. 被引量:178
  • 2Robert Gaizauskas,Yorick Wilks.Information extraction:beyond document retrieval [J].Journal of Documentation,1998,54 (1):70 被引量:1
  • 3Han Jiawei,Karnber M.Data mining concxepts and techiques[M].范明,孟小峰,译.北京:北京工业出版社,2001 被引量:1
  • 4Wadie Sirgany.An introduction to the art and mathematics of cluster analysis[EB/OL].[2004-11-10].http://www.i-m-i.info/bytesofscience/archives/clus.htm 被引量:1
  • 5Dayne Freitag.Information extraction from HTML:application of a general machine learning approach[C]//Proceedings of the 15'th National Conference on Artificial Intelligence (AAAI-98),Madison:Wisconsin,1998 被引量:1
  • 6Mary Elaine Califf.Relational learning techniques for natural language information extraction[EB/OL].[2005-03-10].http://www.cs.utexas.edu/users/mi/papers/rapier-dissertation98.pdf 被引量:1
  • 7Ion Muslea,Steve Minton,Craig Knoblock.Hierarchical wrapper induction for semi-structured sources [J].Journal of Autonomous Agents and Multi-Agent Systems,2001,4:93 被引量:1
  • 8Liu Ling,Calton Pu,Han Wei.XWRAP:an XML-based wrapper construction system for web information sources[EB/OL].[2005-03-10].http://citeseer.ist.psu.edu/215418.html 被引量:1

二级参考文献20

  • 1[16]Hobbs J,Appelt D,Bear J et al.FASTUS:A Cascaded Finite-State Transducer for Extracting Information from Natural-Language Text[C].In:Roche,Schabes eds. Finite State Devices for Natural Language Processing, MIT Press,Cambridge MA, 1996 被引量:1
  • 2[17]Appelt D E.Introduction to Information Extraction[J].AI COMMUNICATIONS, 1999; 12(3) 被引量:1
  • 3[18]Yangarber R.Scenario Customization for Information Extraction[D].Ph D Thesis.New York University,2001-01 被引量:1
  • 4[19]Cowie J, Lehnert W.Information Extraction[J].Communications of the ACM, 1996;39(1) 被引量:1
  • 5[20]Grishman R Adaptive information extraction and sublangu age analysis[C].In:Proceedings of IJCAI-2001 Workshop on Adaptive Text Extraction and Mining,2001 被引量:1
  • 6[1]Applet D E,Israel D J.Introduction to Information Extraction Technology. A Tutorial for IJCAI-99,1999 被引量:1
  • 7[2]Gaizauskas R,Wilks Y.Information Extraction:Beyond Document Retrieval[J].Journal of Documentation, 1997 被引量:1
  • 8[3]Sager N.Natural Language Information Processing. Reading,Massachusetts:Addison Wesley, 1981 被引量:1
  • 9[4]Dejong G.An Overview of the FRUMP System[C].In:LEHNERT W,RINGLE M h eds. Strategies for Natural Language Processing,Lawrence Erlbaum, 1982:149~176 被引量:1
  • 10[5]Grishman R,Sundheim B.Message Understanding Conference-6:A Brief History[C].In :Proceedings of the 16h International Conference on Computational Linguistics(COLING-96),1996-08 被引量:1

共引文献177

同被引文献14

引证文献3

二级引证文献16

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部