期刊文献+

基于阈值的概率图可达查询 被引量:3

Answering Threshold-Based Reachability Queries Over Probabilistic Graphs
下载PDF
导出
摘要 图的可达性查询被广泛应用于生物网络、社会网络、本体网络、RDF网络等.由于对数据操作时引入的噪声和错误使这些图数据具有不确定性,而确定图的可达查询不能有效地处理不确定性,因此该文研究用概率语义描述的图可达性查询.具体的,该文使用可能世界概率模型定义不确定图(称为概率图),基于该模型,研究了基于阈值的概率可达查询(T-PR).首先为避免枚举所有可能世界,给出一个基本算法可精确求解T-PR查询.其次为进一步加速基本算法,给出3种改进方法,它们是不确定事件界、同构图的缩减、基于不相交路径和割集的界.通过合理的组合给出3种方法的合并算法.最后基于真实概率图数据的大量实验验证了该文的设计. Graph reachability queries are widely used in biological networks,social networks,ontology networks and RDF networks.Meanwhile,data extracted from those applications is inherently uncertain due to noise,incompleteness and inaccuracy,and traditional certain reachability queries cannot effectively express semantics of such uncertain graph data.Therefore,in this paper,the authors study the reachability queries over uncertain graphs under the probabilistic semantics.Specifically,they study a threshold-based probabilistic reachability(T-PR)query over an uncertain graph using the possible world semantics(called probabilistic graph).Firstly,to avoid enumerating all possible worlds,the authors propose a basic algorithm that can exactly compute T-PR query.To further speed up the basic algorithm,they develop three improved approaches,that is,u-event bounds,isomorphic graph reduction,and disjoint path/cut set bounds.Moreover,the authors combine the three improved algorithms into one entire algorithm.Finally,they have verified the effectiveness of the proposed solutions for T-PR queries through extensive experiments on real probabilistic graph datasets.
作者 袁野 王国仁
出处 《计算机学报》 EI CSCD 北大核心 2010年第12期2219-2228,共10页 Chinese Journal of Computers
基金 国家自然科学基金重点项目(60933001) 国家自然科学基金面上项目(60773221) 国家"八六三"高技术研究发展计划项目基金(2009AA01Z150)资助~~
关键词 概率图 可能世界 不确定事件 同构图缩减 路径集 割集 probabilistic graph possible world uncertain event isomorphic graph reduction path set cut set
  • 相关文献

参考文献24

  • 1Nierman A,Jagadish H V.ProTDB:Probabilistic data in XML//Proceedings of the VLDB.Hong Kong,China,2002:646-657. 被引量:1
  • 2Senellart P,Abiteboul S.On the complexity of managing probabilistic XML data//Proceedings of the PODS.Beijing,China,2007:283-289. 被引量:1
  • 3Haase P,Volker J.Ontology learning and reasoning-dealing with uncertainty and inconsistency//Proceeding of the Workshop on Uncertainty Reasoning for the Semantic Web (URSW).New York,USA,2005:45-55. 被引量:1
  • 4Asthana S,King O D,Gibbons F D et al.Predicting protein complex membership using probabilistic network reliability.Genome Research,2004,14(6):1170-1175. 被引量:1
  • 5Chui H N,Sung W K,Wong L.Exploiting indirect neighbors and topological weight to predict protein function from protein-protein interactions.Bioinformatics,2007,22(13):47-58. 被引量:1
  • 6Jiang R,Tu Z,Chen T et al.Network motif identification in stochastic networks.PNAS,2006,103(25):9404-9409. 被引量:1
  • 7Saito R,Suzuki H,Hayashizaki Y.Interaction generality:A measurement to assess the reliability of a protein-protein interaction.Nucleic Acids Research,2002,30(5):1163-1168. 被引量:1
  • 8Dalvi N N,Suciu D.Management of probabilistic data:Foundations and challenges//Proceedings of the PODS.Beijing,China,2007:1-12. 被引量:1
  • 9Khoussainova N,Balazinska M.Towards correcting input data errors probabilistically using integrity constraints//Proceedings of the MobiDE Workshop.Chicago,Illinois,USA,2006:43-50. 被引量:1
  • 10Jin R,Xiang Y,Ruan N.Efficiently answering reachability queries on very large directed graphs//Proceedings of the SIGMOD.Vancouver,Canada,2008:595-608. 被引量:1

二级参考文献41

  • 1周水庚 蔚赵春 蒋豪良.图结构数据搜索的概念、问题与进展[J].中国计算机学会通讯,2007,3(8):59-65. 被引量:2
  • 2Shasha D, Wang T L J, Giugno R. Algorithmics and applications of tree and graph searching//Proceedings of the 21st ACM SIGMOD-SIGACT-SIGART Symposium on Principles of Database Systems. Madison, 2002:39-52. 被引量:1
  • 3Yan X, Yu P S, Han J. Graph indexing: A frequent structure-based approach//Proeeedings of the 2004 ACM SIGMOD International Conference on Management of Data. Paris, 2004:335-346. 被引量:1
  • 4Saito R, Suzuki H, Hayashizaki Y. Interaction generality, a measurement to assess the reliability of a protein-protein interaction. Nucleic Acids Research, 2002, 30(5): 1163-1168. 被引量:1
  • 5李建中 于戈 周傲英.不确定性数据管理的要求与挑战[J].中国计算机学会通讯,2009,5(4):6-14. 被引量:8
  • 6高宏 张炜.不确定图数据管理研究现状[J].中国计算机学会通讯,2009,5(4):31-36. 被引量:2
  • 7Dalvi N N, Suciu D. Efficient query evaluation on probabilistic databases. VLDB Journal, 2007, 16(4): 523-544. 被引量:1
  • 8Soliman M A, Ilyas I F, Chang K C. Top-k query processing in uncertain databases//Proceedings of the 23rd International Conference on Data Engineering. Istanbul, 2007:896-905. 被引量:1
  • 9Hintsanen P. The most reliable subgraph problem//Proceedings of the llth European Conference on Principles and Practice of Knowledge Discovery in Databases. Warsaw, 2007: 471-478. 被引量:1
  • 10Hintsanen P, Toivonen H. Finding reliable subgraphs from large probabilistic graphs. Data Mining and Knowledge Discovery, 2008, 17(1): 3-23. 被引量:1

共引文献50

同被引文献16

引证文献3

二级引证文献2

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部