期刊文献+

针对字典序依赖的分布式数据修复

DISTRIBUTED DATA REPAIRING FOR LEXICOGRAPHICAL ORDER DEPENDENCE
下载PDF
导出
摘要 字典序次序依赖用于表达数据上属性列间的次序关系。现实数据往往具有很大的规模而且包含错误。研究针对字典序次序依赖的分布式数据修复技术,目标是将数据修改为满足给定次序依赖定义的形式。基于Spark平台,设计和实现分布式修复算法,同时通过实验验证该方法的有效性和运行效率。 Lexicographical order dependencies can define order specifications on lists of attributes.In practice,data are large and contain errors.This paper investigated the problem of distributed data repairing for lexicographical order dependencies,aiming at repairing data such that order dependencies defined on the data were satisfied.We designed and implemented distributed algorithms based on Spark framework,and conducted extensive experiments to verify the effectiveness and efficiency of our approach.
作者 郭乃网 覃晟 谈子敬 曹满亮 Guo Naiwang;Qin Sheng;Tan Zijing;Cao Manliang(State Grid Shanghai Municipal Electric Power Company,Shanghai 200437,China;Fudan University,Shanghai 200433,China)
出处 《计算机应用与软件》 北大核心 2023年第9期37-42,108,共7页 Computer Applications and Software
基金 科技部重点研发计划项目(2018YFB1402600) 上海市科委项目(19DZ2252800) 国网上海市科技项目(52094020001A)。
关键词 数据修复 字典序次序依赖 分布式计算 Data repairing Lexicographical order dependency Distributed computing
  • 相关文献

参考文献2

二级参考文献22

  • 1Armstrong WW. Dependency structures of data base relationships. Processings of IFIP Congress 74, 1974,74:580-583. 被引量:1
  • 2Fan W, Geerts F, Jia X, Kementsietsidis A. Conditional functional dependencies for capturing data inconsistencies. ACM Trans. on Database Systems, 2008,33(2):1-48. [doi: 10.1145/1366102.1366103]. 被引量:1
  • 3Beskales G, Ilyas I, Gotab L. Sampling the repairs of functional dependency violations under hard constraints. Proc. of the VLDB Endowment, 2010,3(1-2):197-207. Idol: 10.14778/1920841.1920870]. 被引量:1
  • 4Fan W, Geerts F, Ma S, Muller H. Detecting inconsistencies in distributed data. In: Proc. of the IEEE ICDE. Long Beach, 2010. Idol: 10.1109/1CDE.2010.5447855]. 被引量:1
  • 5Huyn N. Maintaining global integrity constraints in distributed databases. Constraints, 1997,2(3-4):377-399. [doi: 10.1023/A: 1009 703814570]. 被引量:1
  • 6Garey M, Johnson D. Computers and Intractability: A Guide to the Theory of NP-Completeness. W. H. Freeman and Company, 1979.32-38. 被引量:1
  • 7Kleinberg J, Tardos E. Algorithm Design. New York: Pearson Education, 2006. 600-622. 被引量:1
  • 8http://apps.bts.gov/xml/ontimesummarystatistics/src/index.xml. 被引量:1
  • 9Fan W, Li J, Tang N, Yu W. Incremental detection of inconsistencies in distributed data. In: Proc. of the IEEE ICDE. Washington, 2012. 318-329. [doi: 10.1109/ICDE.2012.82]. 被引量:1
  • 10Gupta A, Widom J. Local verification of global integrity constraints in distributed databases. In: Proc. of the 1993 ACM SIGMOD Int'l Conf. on Management of Data. Washington, 1993. [doi: 10.1145/170035.170048]. 被引量:1

共引文献14

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部