期刊文献+

网格检查点恢复服务及其应用编程接口分析 被引量:1

Analysis of GridCPR Services & API
下载PDF
导出
摘要 检查点机制作为一种软件容错机制 ,可以与新出现的广域分布式系统网格相结合 ,更好地满足网格系统的容错要求。文中详细分析了检查点回卷恢复协议的关键点 ,并对数据网格中GridCPRAPI进行了解析 ,提出一些改进 。 Checkpointing is an important method for fault tolerance of software. It can be combined with wide area distributed system grid. Consequently it can be very effective for the fault tolerance demand of grid system. Checkpoint rollback recovery protocols in data grid is described in this paper, and grid API is also analyzed in detail. This paper also gives some proposes, which are more useful to improve fault-detection and fault tolerance in grid system.
作者 张琳 杨静
出处 《计算机应用》 CSCD 北大核心 2004年第7期16-17,21,共3页 journal of Computer Applications
关键词 网格计算 检查点机制 容错 网格检查点恢复应用编程接口 grid computing checkpoint fault tolerance grid checkpoint recovery API
  • 相关文献

参考文献6

二级参考文献1

  • 1Elnozahy E N,CMU Technical Report CMU-CS-99-148,1999年 被引量:1

共引文献16

同被引文献7

  • 1李春江,肖侬,杨学军.基于作业进展描述的计算网格作业检查点[J].计算机工程,2005,31(10):57-59. 被引量:1
  • 2Foster I.The Grid:A New Infrastructure for 21 Century Science Physics Today,2002,54(2). 被引量:1
  • 3GridCPR Working Group.An Architecture for Grid Checkpoint Recovery Services and a GridCPR API.http://www.gridforum.org/meetings/ggf7/drafts/GridCPR001.doc. 被引量:1
  • 4G.Allen T.Dramlitsch,I.Foster,N.Karonis,M.Rpieanu,I.Seidel,and B.Toonen.Supporting Efficient Exectution in Heterogeneous Distributed Computing Environments with Cactus and Globus.In Proceedings of the ACM/IEEE SC2001 Conference,November 10-16,2001 Denver,Colorado. 被引量:1
  • 5DataGrid Project.WPI Document,Job description language.Howto.Version 0.1 September2001.http://server11.infn.it/workload-grid/docs/Datagrid-01-TEN-0102-02-Document.pdf 被引量:1
  • 6Jos'e Carlos Sancho,Fabrizio Petrini,Kei Davis,Roberto Gioiosa,Song Jiang,"Current Pracitice and a Direction Forward in Checkpoint/Restart Implementations for Fault Tolerance," Proceedings of the 19th IEEE International Parallel and Distributed Processing Symposium (IPDPS'05). 被引量:1
  • 7J.Duell,P.argrove,and E.Roman,"The Design and Implementation of Berkeley Lab's Linux Checkpoint/Restart," 2002 10-13. 被引量:1

引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部