期刊文献+

VDoc+:a virtual document based approach for matching large ontologies using MapReduce 被引量:4

VDoc+:a virtual document based approach for matching large ontologies using MapReduce
原文传递
导出
摘要 Many ontologies have been published on the Semantic Web,to be shared to describe resources.Among them,large ontologies of real-world areas have the scalability problem in presenting semantic technologies such as ontology matching(OM).This either suffers from too long run time or has strong hypotheses on the running environment.To deal with this issue,we propose a three-stage MapReduce-based approach V-Doc+ for matching large ontologies,based on the MapReduce framework and virtual document technique.Specifically,two MapReduce processes are performed in the first stage to extract the textual descriptions of named entities(classes,properties,and instances) and blank nodes,respectively.In the second stage,the extracted descriptions are exchanged with neighbors in Resource Description Framework(RDF) graphs to construct virtual documents.This extraction process also benefits from the MapReduce-based implementation.A word-weight-based partitioning method is proposed in the third stage to conduct parallel similarity calculation using the term frequency-inverse document frequency(TF-IDF) model.Experimental results on two large-scale real datasets and the benchmark testbed from Ontology Alignment Evaluation Initiative(OAEI) are reported,showing that the proposed approach significantly reduces the run time with minor loss in precision and recall. Many ontologies have been published on the Semantic Web, to be shared to describe resources. Among them, large ontologies of real-world areas have the scalability problem in presenting semantic technologies such as ontology matching (OM). This either suffers from too long run time or has strong hypotheses on the running environment. To deal with this issue, we propose a three-stage MapReduce-based approach V-Doc+ for matching large ontologies, based on the MapReduce framework and virtual document technique. Specifically, two MapReduce processes are performed in the first stage to extract the textual descriptions of named entities (classes, properties, and instances) and blank nodes, respectively. In the second stage, the extracted descriptions are exchanged with neighbors in Resource Description Framework (RDF) graphs to construct virtual documents. This extraction process also benefits from the MapReduce-based implementation. A word-weight-based partitioning method is proposed in the third stage to conduct parallel similarity calculation using the term frequency-inverse document frequency (TF-IDF) model. Experimental results on two large-scale real datasets and the benchmark testbed from Ontology Alignment Evaluation Initiative (OAEI) are reported, showing that the proposed approach significantly reduces the run time with minor loss in precision and recall.
出处 《Journal of Zhejiang University-Science C(Computers and Electronics)》 SCIE EI 2012年第4期257-267,共11页 浙江大学学报C辑(计算机与电子(英文版)
基金 supported by the National Natural Science Foundation of China (No.61003018) the Natural Science Foundation of Jiangsu Province,China (No.BK2011189) the National Social Science Foundation of China (No.11AZD121)
  • 相关文献

参考文献23

  • 1Bethca,W.L,Fink,C.R,Beecher-Dcighan,J.S. JHU/APL Onto-Mapology Results for OAEI 2006[A].2006.144-152. 被引量:1
  • 2Castano,S,Ferrara,A,Mcssa,G. Results of the HMatch Ontology Matchmaker in OAEI 2006[A].2006.134-143. 被引量:1
  • 3Dean,J,Ghemawat,S. MapRcduce:simplified data processing on large clusters[J].Communications of the ACM,2008,(01):107-113. 被引量:1
  • 4Do,H.H,Rahm,E. Matching large schemas:approaches and evaluation[J].Information Systems,2007,(06):857-885. 被引量:1
  • 5Euzenat,J,Shvaiko,P. Ontology Matching[M].Springer,Heidelberg,Germany,2007. 被引量:1
  • 6Euzenat,J,Ferrara,A,Meilicke.C,Nikolov,A,Pane,J,Scharffe,F,Shvaiko,P,Stuckenschmidt.H,(S)vábZamazal,O,Svátek,V. Results of the Ontology Alignment Evaluation Initiative 2010[A].2010.85-117. 被引量:1
  • 7Gross,A,Hartung,M,Kirsten,T,Rahm,E. On matching large life science ontologies in parallel[J].Lecture Notes in Computer Science,2010.35-49. 被引量:1
  • 8Hu,W,Qu.Y.Z,Cheng,G. Matching large ontologies:a divide-and-conquer approach[J].Data and Knowledge Engineering,2008,(01):140-160. 被引量:1
  • 9Kotis,K,Valarakos,A.G,Vouros,G.A. AUTOMS:Automated Ontology Mapping Through Synthesis of Methods[A].2006.96-106. 被引量:1
  • 10Li,J.Z,Tang,J,Li,Y,Luo.Q. RiMOM:a dynamic multistrategy ontology alignment framework[J].IEEE Transactions on Knowledge and Data Engineering,2009,(08):1218-1232. 被引量:1

同被引文献28

引证文献4

二级引证文献19

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部