期刊文献+

结合匹配度和语义相似度的Deep Web查询接口模式匹配 被引量:1

Deep Web query interface schema matching based on matching degree and semantic similarity
下载PDF
导出
摘要 查询接口模式匹配是Deep Web信息集成中的关键部分,双重相关性挖掘方法(DCM)能有效利用关联挖掘方法解决复杂接口模式匹配问题。针对DCM方法在匹配效率、匹配准确性方面的不足,提出了一种基于匹配度和语义相似度的新模式匹配方法。该方法首先使用矩阵存储属性间的关联关系,然后采用匹配度计算属性间的相关度,最后利用语义相似度计算候选匹配的相似性。通过在美国伊利诺斯大学的BAMM数据集上进行实验,所提方法与DCM及其改进方法比较有更高的匹配效率和准确性,表明该方法能更好地处理接口之间模式匹配问题。 Query interface schema matching is a key step in Deep Web data integration.Dual Correlated Mining(DCM) is able to make full use of association mining method to solve the problems of complex interface schema matching.There are some problems about DCM,such as inefficiency and inaccuracy in matching.Therefore,a new method based on matching degree and semantic similarity was presented in this paper to solve the problems.Firstly,the method used correlation matrix to save the association relationship among attributes;and then,matching degree was applied to calculate the degree of correlation between attributes;at last,semantic similarity was used to ensure the accuracy of final results.The experimental results on BAMM data sets of University of Illinois show that the proposed method has higher precision and efficiency than DCM and improved DCM,and indicate that the method can deal with the query interface schema matching problems very well.
作者 冯永 张洋
出处 《计算机应用》 CSCD 北大核心 2012年第6期1688-1691,共4页 journal of Computer Applications
基金 国家自然科学基金资助项目(61103114) 重庆市高等教育教学改革研究重点项目(112023) "211工程"三期建设项目(S-10218) 中央高校基本科研业务基金资助项目(CDJXS11181164)
关键词 DEEP WEB 模式匹配 匹配度 语义相似度 Deep Web schema matching matching degree semantic similarity
  • 相关文献

参考文献16

  • 1The Deep Web: surfacing hidden value[ EB/OL]. [ 2011- 10- 20]. http://brightplanet, com. 被引量:1
  • 2JAYANT M, JEFFERY S R, COHEN S. Web-scale data integra- tion: you can only afford to pay as you go[ EB/OL]. [2011-10- 22]. http://www, eidrdb, org/cidr2007/papers/eidr07p40, pdf. 被引量:1
  • 3张慧斌..Deep Web查询接口及查询结果抽取研究[D].南开大学,2010:
  • 4DONG YONGQUAN, LI QINGZHONG, DING YANHUI, et al. ET- TA-IM: A deep Web query interface matching approach based on evidence theory and task assignment[ J]. Expert Systems with Appli- cations, 2011,38(8) : 10218 - 10228. 被引量:1
  • 5姜芳艽,孟小峰.Deep Web数据集成中查询处理的研究与进展[J].计算机科学与探索,2009,3(2):113-129. 被引量:4
  • 6HE B, CHANG K C. Statistical schema matching across Web query interfaces[ C] // Proceedings of the 22nd ACM SIGMOD Internation- al Conference on Management of Data. New York: ACM, 2003:217 - 228. 被引量:1
  • 7HE BIN, CHANG K C C, HAN JIAWEI. Discovering complex matching across Web query interfaces: a correlation mining approach [ C]// Proceedings of the lOth International Conference on Knowl- edge Discovery and Data Mining. New York: ACM, 2004:148 - 157. 被引量:1
  • 8WU W, YU C, DOAN A, et al. An interactive clustering-based ap- proach to integrating source query interface on the deep Web[ C]// Proceedings of ACM SIGMOD International Conference on Manage- merit of Data. New York: ACM, 2004:95 - 106. 被引量:1
  • 9MADHAVAN J, BERNSTEIN P A, DOAN A, et al. Corpus-based schema matching[ C]// Proceedings of the 21 st International Confer- ence on Data Engineering. Washington, DC: IEEE Computer Socie- ty, 2005:57 -68. 被引量:1
  • 10伊卫国,卫金茂,王名扬.挖掘有效的关联规则[J].计算机工程与科学,2005,27(7):91-94. 被引量:9

二级参考文献37

  • 1He Bin, Chang K C C. Statistical Schema Matching Across Web Query Interfaces[C] //Proc. of the ACM SIGMOD International Conf. on Management of Data. San Diego, California, USA:[s. n.] , 2003. 被引量:1
  • 2Madhavan J, Bernstein P A, Doan A, et al. Corpus-based Schema Matching[C] //Proc. of the 21st International Conf. on Data Engineering. Tokyo, Japan:[s. n.] , 2005. 被引量:1
  • 3He Bin, Chang K C C. Automatic Complex Schema Matching Across Web Query Interfaces: A Correlation Mining Approach[J]. ACM Transactions on Database Systems, 2006, 31(1): 1-45. 被引量:1
  • 4Wu Wensheng, Yu C, Doan A, et al. An Interactive Clustering-based Approach to Integrating Source Query Interfaces on the Deep Web[C] //Proc. of ACM SIGMOD International Conf. on Management of Data. Paris, France:[s. n.] , 2004. 被引量:1
  • 5JIANG Fangjiao JIA Linlin MENG Xiaofeng.Query Translation on the Fly in Deep Web Integration[J].Wuhan University Journal of Natural Sciences,2007,12(5):819-824. 被引量:2
  • 6R Agrawal, T Imielinski, A Swami.Mining Association Rules Between Sets of Items in Large Databases [A].Proc 1993 ACM SIGMOD Conf on Management of Data[C].1993.207-216. 被引量:1
  • 7http://www.ics.uci.edu/~mlearn/MLSummary.html,2003-05. 被引量:1
  • 8Halevy A Y, Rajaraman A, Ordille J J. Data integration: The teenage years//Proceedings of the 32nd International Conference on Very Large Data Bases. Seoul, 2006:9-16 被引量:1
  • 9Elmagarmid A K, Ipeirotis P G, Verykios V S. Duplicate record detection: A survey. IEEE Transactions on Knowledge and Data Engineering, 2007, 19(1): 1-16 被引量:1
  • 10He H, Meng W, Yu C T, Wu Z. WISE-integrator: An automatic integrator of Web search interfaces for E-commerce// Proceedings of the 29th International Conference on Very Large Data Bases. Berlin, 2003:357-368 被引量:1

共引文献26

同被引文献13

  • 1赵朋朋,崔志明,高岭,仲华.关于中国Deep Web的规模、分布和结构[J].小型微型计算机系统,2007,28(10):1799-1802. 被引量:13
  • 2Sherman C,Price G.The invisible Web:uncovering information sources search engines can't see[M].Medford,New Jersey,USA:Information Today,Inc,2001. 被引量:1
  • 3Chang Kevin Chen-Chuan,He Bin,Zhang Z.Structured databases on the Web:observations and implications[J].SIGMOD Record,2004,33(3). 被引量:1
  • 4Bergman M.The Deep Web:surfacing hidden value[J].The Journal of Electronic Publishing,2001,7(1):8912-8914. 被引量:1
  • 5Fetterly D,Manasse M,Najork M.A large-scale study of the evolution of Web pages[J].Software-Practice and Experience,2003,1(1). 被引量:1
  • 6Chang Kevin Chen-Chuan,He Bing,Zhang Z.Toward large scale integration:building a meta querier over databases on the Web[C]//CIDR,Asilomar,Galifornia,2005. 被引量:1
  • 7He H,Meng W Y,Lu Y Y,et al.Towards deeper understanding of the search interfaces of the Deep Web[J].Word Wide Web Journal,2007,10(2):133-155. 被引量:1
  • 8He Bin,Chang K C C.Statistical schema matching across Web query interfaces[C]//SIGMOD Conference,San Diego,California,USA,2003:217-228. 被引量:1
  • 9Wu W,Yu C,Doan A,et al.An interactive clustering-based approach to integrating source query interfaces on the Deep Web[C]//SIGMOD Conference,Paris,2004.New York:ACM Press,2004:95-106. 被引量:1
  • 10Hacene M R,Napoli A.Ontology learning from text using relational concept analysis[C]//Proceedings of International MCETECH Conference on e-Technologies,2008:154-163. 被引量:1

引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部