摘要
失效检测技术是保证容灾备份系统高可用性的关键技术之一,但经典的自适应失效检测算法失效检测时间较长、误判率较高。为此,提出一种基于指数分布的自适应失效检测算法λ-FD,采用Push与Pull 2种心跳模式结合的方法实现算法的重查策略。实验结果表明,λ-FD在阈值取0.68时性能较优,失效检测时间为1339.5 ms,误判率为0.055 7%,远低于同等失效检测时间下经典算法φ-FD的15.19%和Chen-FD的24.92%。λ-FD在相同失效检测时间下误判率普遍低于经典的自适应失效检测算法,相同误判率时耗费的失效检测时间较短,有效提高失效检测的性能,更符合广域网中灾备系统的应用需求。
Failure detection is one of the crucial techniques to promise the disaster recovery system's serviceability, and classical adaptive failure detection algorithm has the shortage of long failure detection time and high error rate. For this problem, this paper studies an adaptive failure detection algorithm )^-FD, based on exponential distribution. Simultaneously, ~,-FD combines Pull heartbeat and Push heartbeat to achieve re-check. Experimental results show that ~,-FD has the optimal performance when it sets the threshold to 0.68, the failure detection time to 1 339.5 ms and the error rate to 0.055 7%, and the latter is much lower than the error rate of Φ-FD, 15.19%, and the error rate of Chen-FD, 24.92%. So the error rate of λ-FD is generally lower than the classical algorithms which have the same failure detection time, and ~,-FD takes the shortest failure detection time if its error rate is the same with classical algorithm, λ-FD can better adapt to the disaster recovery system in the Wide Area Network(WAN).
出处
《计算机工程》
CAS
CSCD
2014年第3期303-305,309,共4页
Computer Engineering
基金
国家自然科学基金资助项目(61173159)
教育部重大项目培育基金资助项目(708075)
关键词
自适应
失效检测
指数分布
容灾备份
心跳
阈值
adaptive
failure detection
exponential distribution
disaster recovery
heartbeat
threshold