期刊文献+

Threshold-Based Shortest Path Query over Large Correlated Uncertain Graphs

Threshold-Based Shortest Path Query over Large Correlated Uncertain Graphs
原文传递
导出
摘要 With the popularity of uncertain data, queries over uncertain graphs have become a hot topic in the database community. As one of the important queries, the shortest path query over an uncertain graph has attracted much attention of researchers due to its wide applications. Although there are some e?cient solutions addressing this problem, all existing models ignore an important property existing in uncertain graphs: the correlation among the edges sharing the same vertex. In this paper, we apply Markov network to model the hidden correlation in uncertain graphs and compute the shortest path. Unfortunately, calculating the shortest path and corresponding probability over uncertain graphs modeled by Markov networks is a #P-hard problem. Thus, we propose a filtering-and-verification framework to accelerate the queries. In the filtering phase, we design a probabilistic shortest path index based on vertex cuts and blocks of a graph. We find a series of upper bounds and prune the vertices and edges whose upper bounds of the shortest path probability are lower than the threshold. By carefully picking up the blocks and vertex cuts, the index is optimized to have the maximum pruning capability, so that we can filter a large number of vertices which make no contribution to the final shortest path query results. In the verification phase, we develop an e?cient sampling algorithm to determine the final query answers. Finally, we verify the e?ciency and effectiveness of our solutions with extensive experiments. With the popularity of uncertain data, queries over uncertain graphs have become a hot topic in the database community. As one of the important queries, the shortest path query over an uncertain graph has attracted much attention of researchers due to its wide applications. Although there are some e?cient solutions addressing this problem, all existing models ignore an important property existing in uncertain graphs: the correlation among the edges sharing the same vertex. In this paper, we apply Markov network to model the hidden correlation in uncertain graphs and compute the shortest path. Unfortunately, calculating the shortest path and corresponding probability over uncertain graphs modeled by Markov networks is a #P-hard problem. Thus, we propose a filtering-and-verification framework to accelerate the queries. In the filtering phase, we design a probabilistic shortest path index based on vertex cuts and blocks of a graph. We find a series of upper bounds and prune the vertices and edges whose upper bounds of the shortest path probability are lower than the threshold. By carefully picking up the blocks and vertex cuts, the index is optimized to have the maximum pruning capability, so that we can filter a large number of vertices which make no contribution to the final shortest path query results. In the verification phase, we develop an e?cient sampling algorithm to determine the final query answers. Finally, we verify the e?ciency and effectiveness of our solutions with extensive experiments.
出处 《Journal of Computer Science & Technology》 SCIE EI CSCD 2015年第4期762-780,共19页 计算机科学技术学报(英文版)
基金 This work is supported in part by the National Natural Science Foundation of China under Grant Nos. 61332006, U1401256, 61328202, 61173029, the Fundamental Research Funds for the Central Universities of China under Grant No. N130504006, the Hong Kong RGC Project under Grant No. N_HKUST637/13, the National Basic Research 973 Program of China under Grant No. 2014CB340300, Microsoft Research Asia Gift Grant and Google Faculty Award 2013.
关键词 shortest path correlated uncertain graph probabilistic shortest path index shortest path, correlated uncertain graph, probabilistic shortest path index
  • 相关文献

参考文献50

  • 1Yuan Y, Chen L, Wang G. Efficiently answering proba- bility threshold-based shortest path queries over uncertain graphs. In Proc. the 15th DASFAA, Apr. 2010, pp.155-170. 被引量:1
  • 2Asthana S, King O D, Gibbons F D, Roth F P. Predicting protein complex membership using probabilistic network re- liability. Genome Research, 2004, 14(6): 1170-1175. 被引量:1
  • 3Nierman A, Jagadish H. ProTDB: Probabilistic data in XML. In Proc. the 28th VLDB, Aug. 2002, pp.646-657. 被引量:1
  • 4Adar E, R@ C. Managing uncertainty in social networks. IEEE Data Eng. Bull, 2007, 30(2): 15-22. 被引量:1
  • 5Lian X, Chen L. Efficient query answering in probabilistic RDF graphs. In Proc. A CM S1GMOD International Con- ference on Management of Data, May 2011, pp.157-168. 被引量:1
  • 6Zou L, Peng P, Zhao D. Top possible shortest path query over a large uncertain graph. In Proe. the 12th Int. Conf. Web Information System Engineering, Oct. 2011, pp.72-86. 被引量:1
  • 7Tao Y, Sheng C, Pei J. On k-skip shortest paths. In Proc. A CM SIGMOD International Conference on Management of Data, Jun. 2011, pp.421-432. 被引量:1
  • 8Yao B, Tang M, Li F. Multi-approximate-keyword routing in GIS data. In Proc. the 19th ACM SIGSPATIAL Interna- tional Conference on Advances in Geographic Information Systems, Nov. 2011, pp.201-210. 被引量:1
  • 9Fan W, Wang X, Wu Y. Performance guarantees for dis- tributed reachability queries. Proceedings of the VLDB En- dowment, 2012, 5(11): 1304-1315. 被引量:1
  • 10Gao J, Jin R, Zhou J, Yu J X, Jiang X, Wang T. Relational approach for shortest path discovery over large graphs. Pro- ceedings of the VLDB Endowment, 2011, 5(4): 358-369. 被引量:1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部