期刊文献+

基于SVM的人物实体全局模式动态构建

Person entity global schema constructed dynamically with SVM
下载PDF
导出
摘要 针对结构化网页中人物实体全局模式构建问题,提出了基于SVM的动态构建方法。构建分为两个阶段,第一个阶段是来自同一数据源的人物实体结构化实例到人物实体局部模式的转化,第二个阶段是利用SVM分类器完成人物实体局部模式到人物实体全局模式的映射。本方法能适应数据源的不断变化,保证了全局模式的完整性。通过实验,验证了构建算法的有效性和可行性,并对随着结构化网页不断增多时全局模式的稳定性进行了考察。 To constructed person entity global schema in structured web pages,a method of dynamic construction with SVM is proposed, it has two stages. The first stage is to transform person entity in- stances from the same data source to person entity local schema. The second stage is to map person entity local schema to person entity global schema with the SVM classifier. Our method can adapt to the change of data sources and ensure the completeness of the global model. Through experiments, the effectiveness and feasibility of the construction algorithm are verified and the stability of the global schema with increasing of structured web pages is studied.
作者 曹鲁慧
出处 《计算机工程与科学》 CSCD 北大核心 2014年第10期1888-1893,共6页 Computer Engineering & Science
关键词 SVM 人物实体局部模式 人物实体全局模式 结构化网页 SVM person entity local schema person entity global schema structured web pages
  • 相关文献

参考文献11

  • 1Li Yu-kun,Meng Xiao-feng.Research on personal dataspace management[C]∥Proc of the 2nd SIGMOD PhD Workshop on Innovative Database Research,2008:7-12. 被引量:1
  • 2Jiang Shuo,Le Jia-jin,Li Ye-feng.Research on operationbased correlation in personal dataspace[J].Journal of Electrical Engineering,2014,12(5):3297-3302. 被引量:1
  • 3Liu X,Zhang S,Wei F,et al.Recognizing named entities in tweets[C]∥Proc of the 49th Annual Meeting of the Association for Computational Linguistics:Human Language Technologies-Volume 1,2011:359-367. 被引量:1
  • 4Hong Xu-dong,Shen Tao,Shen Long-hua,et al.Unstructured data extraction of Chinese expert web page[J].International Journal of Wireless and Mobile Computing,2014,7(2):132-136. 被引量:1
  • 5Jiang Fang-jiao,Meng Yue-hong,Wei Ming-sheng,et al.Entity Identification in Deep Web[C]∥Proc of Web Information Systems Engineering-WISE 2013 Workshops,2014:144-152. 被引量:1
  • 6Rizzo G,Troncy R.NERD:A framework for unifying named entity recognition and disambiguation extraction tools[C]∥Proc of the Demonstrations at the 13th Conference of the European Chapter of the Association for Computational Linguistics,2012:73-76. 被引量:1
  • 7姚从磊..Web实体提取与实体踪迹发现研究[D].北京大学,2008:
  • 8Yao Chong-lei,Yu Yong-jian,Shou Si-cong,et al.Towards aglobal schema for web entities[C]∥Proc of the 17th International Conference on World Wide Web,2008:999-1008. 被引量:1
  • 9Ding Yan-hui,Li Qing-zhong.Building the schema of web entity dynamically[J].Journal of Computational Information Systems,2011,7(9):3194-3201. 被引量:1
  • 10徐秀星..Web数据集成中全局模式构建方法研究[D].山东大学,2011:

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部