期刊文献+

Attribute Level Lineage in Uncertain Data with Dependencies

Attribute Level Lineage in Uncertain Data with Dependencies
原文传递
导出
摘要 In uncertain data management, lineages are often used for probability computation of result tuples. However, most of existing works focus on tuple level lineage, which results in imprecise data derivation. Besides, correlations among attributes cannot be captured. In this paper, for base tuples with multiple uncertain attributes, we define attribute level annotation to annotate each attribute. Utilizing these annotations to generate lineages of result tuples can realize more precise derivation. Simultaneously,they can be used for dependency graph construction. Utilizing dependency graph, we can represent not only constraints on schemas but also correlations among attributes. Combining the dependency graph and attribute level lineage, we can correctly compute probabilities of result tuples and precisely derivate data. In experiments, comparing lineage on tuple level and attribute level, it shows that our method has advantages on derivation precision and storage cost. In uncertain data management, lineages are often used for probability computation of result tuples. However, most of existing works focus on tuple level lineage, which results in imprecise data derivation. Besides, correlations among attributes cannot be captured. In this paper, for base tuples with multiple uncertain attributes, we define attribute level annotation to annotate each attribute. Utilizing these annotations to generate lineages of result tuples can realize more precise derivation. Simultaneously,they can be used for dependency graph construction. Utilizing dependency graph, we can represent not only constraints on schemas but also correlations among attributes. Combining the dependency graph and attribute level lineage, we can correctly compute probabilities of result tuples and precisely derivate data. In experiments, comparing lineage on tuple level and attribute level, it shows that our method has advantages on derivation precision and storage cost.
出处 《Wuhan University Journal of Natural Sciences》 CAS CSCD 2016年第5期376-386,共11页 武汉大学学报(自然科学英文版)
基金 Supported by the Key Program of National Natural Science Foundation of China(61232002) The National Natural Science Foundation of China(61202033) The Program for Innovative Research Team of Wuhan(2014070504020237) The Ph.D.Seed Foundation of Wuhan University(2012211020207) The Science and Technology Support Program of Hubei Province(2015BAA127)
关键词 uncertain data attribute level lineage DEPENDENCY uncertain data attribute level lineage dependency
  • 相关文献

参考文献20

  • 1Sarma A D, Theobald M, Widom J. Exploiting lineage for confidence computation in uncertain and probabilistic databases [C]//Proc 24th International Conference on Data Engineering. Washington D C : IEEE Computer Society Press, 2008: 1023-1032. 被引量:1
  • 2Dalvi N, Suciu D. Efficient query evaluation on probabilistic databases [J]. The VLDB Journal, 2007, 16(4): 523-544. 被引量:1
  • 3Benjelloun O, Sarma A D, Halevy A, et al. Databases with uncertainty and lineage [J]. The VLDB Journal, 2008, 17(2): 243-264. 被引量:1
  • 4Sen P, Deshpande A. Representing and querying correlated tuples in probabilistic databases [C]//Proc 23rd Internation- al Conference on Data Engineering. Washington D C: IEEE Computer Society Press, 2007: 596-605. 被引量:1
  • 5Huang J, Antova L, Koch C, et al. MayBMS: a probabilistic database management system [C]//Proc 36th ACM Interna- tional Conference on Management of Data. New York: ACM Press, 2009: 1071-1074. 被引量:1
  • 6Singh S, Mayfield C, Shah R, et al. Database support for probabilistic attributes and tuples [C]//Proc 24th Internati- onal Conference on Data Engineering. Washington D C: IEEE Computer Society Press, 2008: 1053-1061. 被引量:1
  • 7周傲英,金澈清,王国仁,李建中.不确定性数据管理技术研究综述[J].计算机学报,2009,32(1):1-16. 被引量:185
  • 8Fuhr N, R611eke T. A probabilistic relational algebra for the integration of information retrieval and database systems [J]. ACM Transactions on Information Systems, 1997, 15(1): 32-66. 被引量:1
  • 9Lakshmanan L V S, Leone N, Ross R, et al. Probview: A flexible probabilistic database system [J]. ACM Transactions on Database Systems, 1997, 22(3): 419-469. 被引量:1
  • 10Sarma A D, Benjelloun O, Halevy A, et al. Working models for uncertain data [C]//Proc 22nd International Conference on Data Engineering.Washington D C: IEEE ComputerSociety Press, 2006. 被引量:1

二级参考文献120

  • 1金澈清,钱卫宁,周傲英.流数据分析与管理综述[J].软件学报,2004,15(8):1172-1181. 被引量:161
  • 2谷峪,于戈,张天成.RFID复杂事件处理技术[J].计算机科学与探索,2007,1(3):255-267. 被引量:54
  • 3Deshpande A, Guestrin C, Madden S, Hellerstein J M, Hong W. Model-driven data acquisition in sensor networks// Proceedings of the 30th International Conference on Very Large Data Bases. Toronto, 2004:588-599 被引量:1
  • 4Madhavan J, Cohen S, Xin D, Halevy A, Jeffery S, Ko D, Yu C. Web-scale data integration: You can afford to pay as you go//Proceedings of the 33rd Biennial Conference on Innovative Data Systems Research. Asilomar, 2007:342-350 被引量:1
  • 5Liu Ling. From data privacy to location privacy: Models and algorithms (tutorial)//Proceedings of the 33rd International Conference on Very Large Data bases. Vienna, 2007: 1429- 1430 被引量:1
  • 6Samarati P, Sweeney L. Generalizing data to provide anonymity when disclosing information (abstract)//Proeeedings of the 17th ACM SIGACT-SIGMOD-SIGART Symposium on Principles of Database Systems. Seattle, 1998:188 被引量:1
  • 7Cavallo R, Pittarelli M. The theory of probabilistic databases//Proceedings of the 13th International Conference on Very Large Data Bases. Brighton, 1987:71-81 被引量:1
  • 8Barbara D, Garcia-Molina H, Porter D. The management of probabilistic data. IEEE Transactions on Knowledge and Data Engineering, 1992, 4(5): 487-502 被引量:1
  • 9Fuhr N, Rolleke T. A probabilistic relational algebra for the integration of information retrieval and database systems. ACM Transactions on Information Systems, 1997, 15(1): 32-66 被引量:1
  • 10Zimanyi E. Query evaluation in probabilistic databases. Theoretical Computer Science, 1997, 171(1-2): 179-219 被引量:1

共引文献188

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部