期刊文献+

基于本体的DeepWeb数据源发现方法 被引量:1

Deep Web Data Sources Discovery Method Based on Ontology
下载PDF
导出
摘要 提出一种基于本体的Deep Web数据源发现方法,采用网页分类、表单内容分类、表单结构分类方式,确定符合某领域的DeepWeb查询接口。在网页分类和表单内容分类中引入本体的半自动构建和自动扩展模块,在表单结构分类中添加启发式规则。实验结果证明,该方法能有效提高Deep Web数据源的查全率和查准率。 This paper presents a Deep Web data sources discovery method based on ontology. It uses webpage classification, form structure classification and form content classification to find Deep Web querying interface in some fields. It proposes that semi-automatic construction and automatic extension of ontology are added to the webpage and form content classification, and heuristic rules are enriched in the form structure classification. Experimental results show that this method can improve the precision and recall of Deep Web database discovery effectively.
作者 李道申 刘勇
出处 《计算机工程》 CAS CSCD 2012年第4期52-54,共3页 Computer Engineering
基金 国家自然科学基金资助项目(70671035)
关键词 深网 本体 数据源 半自动构建 分类模型 Deep Web ontology data sources semi-automatic construction classification model
  • 相关文献

参考文献7

二级参考文献106

  • 1Chang K C C, He Bin, Li Chengkai, et al. Struetured Databases on the Web: Observations and Implieations[J]. ACM SIGMOD Record, 2004, 33(3): 61-70. 被引量:1
  • 2Bergmanm M K. The Deep Web: Surfacing the Hidden Value[J]. Journal of Electronic Publishing in Taking License: Recognizing a Need to Change, 2001, 7(1): 30-32. 被引量:1
  • 3.[EB/OL].http://www.cogsci.Princeton.edu,. 被引量:2
  • 4Fetterly D,Manasse M,Najork M,Wiener J L.A largescale study of the evolution of Web pages//Proceedings of the 12th International World Wide Web Conference.Budapest,2003:669-678 被引量:1
  • 5Chang K C,He B,Li C,Patel M,Zhang Z.Structured databases on the Web:Observations and Implications.SIGMOD Record,2004,33(3):61-70 被引量:1
  • 6Cope J,Craswell N,Hawking D.Automated discovery of search interfaces on the Web//Proceedings of the 14th Australasian Database Conference(ADC 2003).Adelaide,2003:181-189 被引量:1
  • 7Zhang Z,He B,Chang K C.Understanding Web query interfaces:Best-effort parsing with hidden syntax//Proceedings of the 23rd ACM SIGMOD International Conference on Management of Data.Paris,2004:107-118 被引量:1
  • 8Arasu A,Garcia-Molina H.Extracting structured data from Web pages//Proceedings of the 22nd ACM SIGMOD International Conference on Management of Data.San Diego,2003:337-348 被引量:1
  • 9Crescenzi V,Mecca G,Merialdo P.RoadRunner:Towards automatic data extraction from large Web sites//Proceedings of the 27th International Conference on Very Large Data Bases.Italy,2001:109-118 被引量:1
  • 10Wittenburg K,Weitzman L.Visual grammars and incremental parsing for interface languages//Proceedings of the IEEE Symposium on Visual Languages (VL).Skokie,1990:111-118 被引量:1

共引文献160

同被引文献15

  • 1Balakrishnan R,Kambhampati S. Source Rank: Relevance and Trust Assessment for Deep Web Sources Based on Inter-source Agreement [ C ]//Proceedings of the 20th International Conference on World Wide Web. New York, USA :ACM Press,2011:227-236. 被引量:1
  • 2Dong X L, Saha B, Srivastava D. Less Is More: Selecting Sources Wisely for Integration [ C ]//Proceedings of the 39th International Conference on Very Large Data Bases. [ S. 1. ] :Morgan Kaufmann Publishers,2013 : 37-48. 被引量:1
  • 3Rekatsinas T, Dong X L. Finding Quality in Quantity: The Challenge of Discovering Valuable Sources for Integration [ C ]//Proceedings of the 7th Biennial Con- ference on Innovative Data Systems Research. New York, USA:ACM Press ,2015 : 1-7. 被引量:1
  • 4Rekatsinas T, Dong X L. Characterizing and Selecting Fresh Data Sources [ C ]//Proceedings of 2014 ACMSIGMOD International Conference on Management of Data. New York, USA : ACM Press ,2014:919-930. 被引量:1
  • 5Wang Ying, Zuo Wanli, He Fengling, et al. Ontology- assisted Deep Web Source Selection [J]- Computer Science for Environmental Engineering and Ecolnformatics, 2011,159(2) :66-71. 被引量:1
  • 6Nguyen K, Cao J. K-Graphs: Selecting Top-k Data Sources for XML Keyword Queries [ C ]//Proceedings of the 22nd International Conference on Database and Expert Systems Applications. Berlin, Germany : Springer- Verlag ,2011:425-439. 被引量:1
  • 7Markov I, Azzopardi L, Crestani F. Reducing the Uncertainty in Resource Selection [ C ]//Proceedings of the 35th European Conference on IR Research. Berlin, Germany : Springer-Verlag, 2013 : 507 -519. 被引量:1
  • 8Hong D, Si Luo. Search Result Diversification in Resource Selection for Federated Search[ C ]//Pro- ceedings of the 36th International ACM SIGIR Con- ference on Research and Development in Information Retrieval. New York, USA : ACM Press, 2013:613-622. 被引量:1
  • 9范举,周立柱.基于关键词的深度万维网数据库选择[J].计算机学报,2011,34(10):1797-1804. 被引量:11
  • 10朱冠胜,黄浩,杨卫东.XML关键字检索系统的数据源选择[J].小型微型计算机系统,2012,33(6):1183-1188. 被引量:4

引证文献1

二级引证文献2

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部