期刊文献+

大规模RDF三元组转换及存储工具比较研究 被引量:12

A Comparative Study of Large-scale RDF Triple Conversion and Storage Tools
下载PDF
导出
摘要 富含语义知识的数据网络是实现大数据智能的基石。资源描述框架(Resource Description Framework,RDF)是用于描述网络资源的W3C标准。大规模转换、存储管理RDF三元组是构建关联数据网络或语义知识图谱,实现数据可查找、可访问、可交互、可再用的重要路径。本文选择国际主流的10种RDF三元组转换工具,以及6种广受欢迎的RDF存储系统,从技术原理、性能特点及应用场景等多个视角进行对比分析,并总结存在问题和不足。提出未来大规模RDF三元组数据转换与存储管理需要实现的目标是实现RDF抽取、转换和加载(ETL)的流程化和集成化,并重点支撑4类典型应用需求场景,包括从非RDF数据到RDF数据的转换,不同RDF数据格式之间的双向转换,RDF三元组在数据库之间的数据迁移,以及RDF数据的动态更新和进化管理。 Data network rich in semantic knowledge is the cornerstone of realizing big data intelligence. Resource Description Framework(RDF) is the W3 C standard for describing web resources. Large-scale conversion and storage management of RDF triples is an important path for building a linked data network or semantic knowledge graph and realizing data Findable, Accessible, Interoperable and Reusable(FAIR principle). In this paper, ten international mainstream RDF conversion tools and six popular RDF triple storage systems are selected, and a comparative analysis is made from the perspectives of technical principles, performance characteristics and application scenarios, and briefly summarize the existing problems and shortcomings. It is proposed that the goal of large-scale RDF triple data conversion and storage management is to realize the flow, integration and integration of RDF Extract-TransformLoad(ETL), and to focus on supporting four typical application requirements scenarios, including: conversion from non-RDF data to RDF data;bidirectional conversion between different RDF data formats;data migration of RDF triples between databases;dynamic update and evolutionary management of RDF data.
作者 李悦 孙坦 赵瑞雪 李娇 黄永文 罗婷婷 鲜国建 LI Yue;SUN Tan;ZHAO RuiXue;LI Jiao;HUANG YongWen;LUO TingTing;XIAN GuoJian(Agricultural Information Institute of CAAS,Beijing 100081;Key Laboratory of Agricultural Big Data,Ministry of Agriculture and Rural Affairs,Beijing 100081)
出处 《数字图书馆论坛》 CSSCI 2020年第11期2-12,共11页 Digital Library Forum
基金 国家社会科学基金项目“科技论文全景式摘要知识图谱构建与应用研究”(编号:19BTQ061)资助。
关键词 RDB2RDF RDF转换 RDF存储 大数据智能 知识图谱 RDB2RDF RDF Conversion RDF Triple Store Big Data Intelligence Knowledge Graph
  • 相关文献

参考文献8

二级参考文献49

  • 1杨政,康磊.ETL技术在银行成本分摊系统数据处理中的应用[J].智能计算机与应用,2020,0(1):211-213. 被引量:3
  • 2鲍捷,宋靖雁.分布式网络计算机域的一种系统模型及其文件系统[J].计算机应用与软件,2006,23(5):86-89. 被引量:3
  • 3田敬,代亚非.P2P持久存储研究[J].软件学报,2007,18(6):1379-1399. 被引量:52
  • 4Lin M, Marzullo K. Directional Gossip: Gossip in a Wide - Area Network[ R]. San Diego: Dept of Computer Science and Eng, University of California, 1999. 被引量:1
  • 5Lamport L,Shostak R,Pease M. The Byzantine generals problem[J ]. ACM TO PLAS, 1982,4(3) :382 - 401. 被引量:1
  • 6Weatherspoon H, Kubiatowicz J. Erasure coding vs. replication: A quantitative comparison[ C]//In: Proc. of the 1 st Int' l Workshop on Peer - to - Peer Systems. Berlin: Springer, 2002: 328 - 337. 被引量:1
  • 7Heath T, Bizer C. Linked Data: Evolving the Web into a Global Data Space[ M. Morgan & Claypool Publishers, 2011. 被引量:1
  • 8Hausenblas M. NoSQL Solutions for Linked Data Processing [ EB/ OL]. (2011 -05 -03). [2012 - 12 -08]. https ://docs. google, corn/ document/pub? id = lxeb43XJz43qVzoq22PyplASNYMRYXKiq - At2QLvp8_c&embedded = tree. 被引量:1
  • 9Bendiken A. How RDF Databases Differ from Other NoSQL Solu- tions[ EB/OL]. (2010 - 04 - 22 ). [ 2012 - 12 - 08 ]. http:// blog. datagraph, org/2010/04/rdf- nosql - diff. 被引量:1
  • 10Sahoo S S, Halb W, Hellmann S, et al. A Survey of Current Ap- proaches for Mapping of Relational Databases to RDF [ EB/OL ]. (2012- 10 - 16). [2012 - 12 - 10]. http://www, w3. org/ 2005/Incubator/rdb2rdf/RDB2RDF_SurveyReport. pdf. 被引量:1

共引文献262

同被引文献119

引证文献12

二级引证文献46

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部