期刊文献+

分布式存储系统中的低修复成本纠删码 被引量:6

Erasure code with low recovery-overhead in distributed storage systems
下载PDF
导出
摘要 纠删码技术是分布式存储系统中典型的数据容错方法,与多副本技术相比,能够以较低的存储开销提供较高的数据可靠性;然而,纠删码修复成本过高的特点限制了其应用。针对现有纠删码修复成本高、编码复杂和灵活性差的问题,提出一种编码简单的低修复成本的纠删码——旋转分组修复码(RGRC)。RGRC首先将多个条带组合成条带集,然后利用条带之间的关联关系对条带集内的数据块进行分层旋转编码,以此得到相应的冗余块。RGRC大幅度地减少了单节点修复过程中所需要读取和传输的数据量,从而能节省大量的网络带宽资源。同时RGRC在解决单节点修复成本高的问题时,依然保留着较高的容错能力,且为满足分布式存储系统的不同需求,可以灵活地权衡系统的存储开销和修复成本。在分布式存储系统中进行的对比实验分析结果展示,与其他常用的RS(Reed-Solomon)码、LRC(Locally Repairable Codes)、basic-Pyramid、DLRC(Dynamic Local Reconstruction Codes)、pLRC(proactive Locally Repairable Codes)、GRC(Group Repairable Codes)、UFP-LRC(Unequal Failure Protection based Local Reconstruction Codes)相比,RGRC只需要增加少量的存储开销,就能降低单节点修复14%~61%的修复成本,同时减少14%~58%的修复时间。 Erasure code technology is a typical data fault tolerance method in distributed storage systems.Compared with multi-copy technology,it can provide high data reliability with low storage overhead.However,the high repair cost limits the practical applications of erasure code technology.Aiming at problems of high repair cost,complex encoding and poor flexibility of existing erasure codes,a simple-encoding erasure code with low repair cost—Rotation Group Repairable Code(RGRC)was proposed.According to RGRC,multiple strips were combined into a strip set at first.After that,the association relationship between the strips was used to hierarchically rotate and encode the data blocks in the strip set to obtain the corresponding redundant blocks.RGRC greatly reduced the amount of data needed to be read and transmitted in the process of single-node repair,thus saving a lot of network bandwidth resources.Meanwhile,RGRC still retained high fault tolerance when solving the problem of high repair cost of a single node.And,in order to meet the different needs of distributed storage systems,RGRC was able to flexibly weigh the storage overhead and repair cost of the system.Comparison experiments were conducted on a distributed storage system,the experimental analysis shows that compared with RS(Reed-Solomon)codes,LRC(Locally Repairable Codes),basic-Pyramid,DLRC(Dynamic Local Reconstruction Codes),pLRC(proactive Locally Repairable Codes),GRC(Group Repairable Codes)and UFP-LRC(Unequal Failure Protection based Local Reconstruction Codes),RGRC can reduce the repair cost of single node repair by 14%-61%through adding a small amount of storage overhead,and reduces the repair time by 14%-58%.
作者 张航 刘善政 唐聃 蔡红亮 ZHANG Hang;LIU Shanzheng;TANG Dan;CAI Hongliang(School of Software Engineering,Chengdu University of Information Technology,Chengdu Sichuan 610225,China)
出处 《计算机应用》 CSCD 北大核心 2020年第10期2942-2950,共9页 journal of Computer Applications
基金 四川省科技计划项目(2020YFG0150) 四川人工智能重大专项(2018GZDZX0030) 四川省科技成果转移转化示范项目(2018CC0093)。
关键词 分布式存储系统 数据修复 单节点修复 纠删码 低修复成本 distributed storage system data recovery single-node repair erasure code low repair cost
  • 相关文献

参考文献6

二级参考文献24

  • 1周松,王意洁.EXPyramid:一种灵活的基于阵列结构的高容错低修复成本编码方案[J].计算机研究与发展,2011,48(S1):30-36. 被引量:5
  • 2亚马逊Web服务[EB/OL].[2013-05-02].http://aws.amazon.com. 被引量:1
  • 3Hamilton J.An Architecture for Modular Data Centers. Conf on Innovative Data Systems Research(CIDR2007) . 2007 被引量:1
  • 4Huang C,Xu L.Star:An efficient coding scheme for correcting triple storage node failures. Proc of the4th USENIX Conf on File and Storage Technologies (FAST’’05) . 2005 被引量:1
  • 5Plank J S.The RAID-6liberation codes. Proc of the6th USENIX Conf on File and Storage Technologies (FAST’’08) . 2008 被引量:1
  • 6Ghemawat S,Gobioff H,Leung S T.The Google file system. Proc of the19th ACM Symp on Operating Systems Principles(SOSP2003) . 2003 被引量:1
  • 7Zhang Zhe,Deshpande Amey,Ma Xiaosong,et al.Does erasure coding have a role to play in my data center. Microsoft Research Technical Report MSR-TR-2010-52 . 2010 被引量:1
  • 8Huang Cheng,Chen Minghua,Li Jin.Pyramid codes:Flexible schemes to trade space for access efficiency in reliable data storage systems. Proc of the6th IEEE Int Symp on Network Computing and Applications(NCA2007) . 2007 被引量:1
  • 9Dimakis A G,Godfrey P B,Wu Y,et al.Network coding for distributed storage systems. Proc of the26th IEEE Conf on Computer Communications . 2007 被引量:1
  • 10Wu Y,Dimakis A.Reducing repair traffic for erasure codingbased storage via interference alignment. Proc of ISIT . 2009 被引量:1

共引文献344

同被引文献49

引证文献6

二级引证文献4

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部