期刊文献+

一种自适应失效检测算法的研究与应用

Study and Application of an Adaptive Failure Detection Algorithm
下载PDF
导出
摘要 失效检测技术是保证容灾备份系统高可用性的关键技术之一,但经典的自适应失效检测算法失效检测时间较长、误判率较高。为此,提出一种基于指数分布的自适应失效检测算法λ-FD,采用Push与Pull 2种心跳模式结合的方法实现算法的重查策略。实验结果表明,λ-FD在阈值取0.68时性能较优,失效检测时间为1339.5 ms,误判率为0.055 7%,远低于同等失效检测时间下经典算法φ-FD的15.19%和Chen-FD的24.92%。λ-FD在相同失效检测时间下误判率普遍低于经典的自适应失效检测算法,相同误判率时耗费的失效检测时间较短,有效提高失效检测的性能,更符合广域网中灾备系统的应用需求。 Failure detection is one of the crucial techniques to promise the disaster recovery system's serviceability, and classical adaptive failure detection algorithm has the shortage of long failure detection time and high error rate. For this problem, this paper studies an adaptive failure detection algorithm )^-FD, based on exponential distribution. Simultaneously, ~,-FD combines Pull heartbeat and Push heartbeat to achieve re-check. Experimental results show that ~,-FD has the optimal performance when it sets the threshold to 0.68, the failure detection time to 1 339.5 ms and the error rate to 0.055 7%, and the latter is much lower than the error rate of Φ-FD, 15.19%, and the error rate of Chen-FD, 24.92%. So the error rate of λ-FD is generally lower than the classical algorithms which have the same failure detection time, and ~,-FD takes the shortest failure detection time if its error rate is the same with classical algorithm, λ-FD can better adapt to the disaster recovery system in the Wide Area Network(WAN).
出处 《计算机工程》 CAS CSCD 2014年第3期303-305,309,共4页 Computer Engineering
基金 国家自然科学基金资助项目(61173159) 教育部重大项目培育基金资助项目(708075)
关键词 自适应 失效检测 指数分布 容灾备份 心跳 阈值 adaptive failure detection exponential distribution disaster recovery heartbeat threshold
  • 相关文献

参考文献4

二级参考文献57

  • 1陈宁江,魏峻,杨波,黄涛.Web应用服务器的适应性失效检测[J].软件学报,2005,16(11):1929-1938. 被引量:18
  • 2董剑,左德承,刘宏伟,杨孝宗.一种基于QoS的自适应网格失效检测器[J].软件学报,2006,17(11):2362-2372. 被引量:12
  • 3姚兰,桂勋,巨军让.一种基于中间件的容错系统的研究与设计[J].计算机工程,2007,33(6):83-85. 被引量:2
  • 4Bagchi S, Srinivasan B, Whisnant K, Kalbarczyk Z, Iyer RK. Hierarchical error detection in a software implemented fault tolerance(SIFT) environment. IEEE Trans. on Knowledge and Data Engineering, 2000,12(2):203-224. 被引量:1
  • 5Wichadakul D, Nahrstedt K, Gu XH, Xu DY. 2K^Q+: An integrated approach of QoS compilation and reconfigurable,component-based run-time middleware for the unified QoS management framework. In: Guerraoui R, ed. Middleware 2001. New York: Springer-Verlag, 2001. 373-394. 被引量:1
  • 6Chandra TD, Toueg S. Unreliable failure detectors for reliable distributed systems. Journal of ACM, 1996,43(2):225-267. 被引量:1
  • 7Hayashibara N, Cherif A. Failure detectors for large-scale distributed systems. In: Kikuno T, ed. Proc. of the 21st IEEE Symp. on Reliable Distributed Systems (SRDS 2002). Washington: IEEE Computer Society, 2002. 404-409. 被引量:1
  • 8Chen W, Toueg S, Aguilera MK. On the quality of service of failure detectors. IEEE Trans. on Computers, 2002,51(5):561-580. 被引量:1
  • 9Sun Microsystems, Inc. Java management extensions instrumentation and Agent specification, vl.0. 2000. 被引量:1
  • 10Schmidt D, Stal M, Rohnert H, Buschmann F. Pattern-Oriented Software Architecture: Patterns for Concurrency and Distributed Objects, Volume 2. New York: John Wiley & Sons, 2000. 被引量:1

共引文献33

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部