期刊文献+

一种用于重复数据删除的非对称最大值分块算法研究 被引量:1

Research on an asymmetric maximum chunking algorithm for deduplication
下载PDF
导出
摘要 分块是一种将文件划分成更小文件的过程,该方法被广泛应用在重复数据删除系统中。针对传统的基于内容分块(CDC)中面临的高额计算开销问题,提出了一种称为非对称最大值的分块算法(CAAM)。采用字节值代替哈希值来声明切点,利用固定大小窗口和可变大小窗口来查找作为切点的最大值,并且允许在保留内容定义分块(CDC)属性的同时进行较少的计算开销。最后将CAAM与现有的基于散列和无哈希的分块算法进行了比较,实验结果表明,CAAM算法比其他算法具有更低的计算开销和更高的分块吞吐量。 bChunking is a process of spilting files into smaller files,which is widely used in deduplication systems. Aiming at the problem of high computational overhead in traditional content-based chunking( CDC),a new chunking algorithm called asymmetric maximum( CAAM) is proposed. Instead of using hashes,CAAM uses the byte value to declare the cut points. The algorithm utilizes the fixed size window and the variable size window to find the maximum value which is cut points. The algorithm allows less computation overhead while keeping the CDC property. Finally,the CAAM algorithm is compared with the existing hash-based and hash-less. The experimental results show that the CAAM algorithm has lower computational cost and higher chunking throughput than other algorithms.
出处 《微型机与应用》 2017年第22期30-33,共4页 Microcomputer & Its Applications
关键词 重复数据删除 非对称窗口 内容定义分块 无哈希分块 切点 deduplication asymmetric window content-defined chunking hash-less chunking cut points
  • 相关文献

参考文献4

二级参考文献61

  • 1Tolia N, Kozuch M, Satyanarayanan M, et al. Opportunistic use of content addressable storage for distributed file systems[C]//Proc Usenix Annual Technical Conference. USENIX Assoc, 2003: 127-140. 被引量:1
  • 2Quinlan S, Dorward S. Venti: A new approach to archival storage[C]// Proc Conf File and Storage Technologies. USENIX Assoc, 2002:89 - 102. 被引量:1
  • 3Muthitacharoen A, CHEN Benjie, Mazieres D, et al. A low bandwidth network file system[C]// Proc 18th ACM Syrup Operat Syst Principles. New York:ACM Press, 2001: 174 - 187. 被引量:1
  • 4Rabin M O. Fingerprinting by random polynomials, TR -15- 81[R]. Boston: Center for Research in Computing Technology, Harvard University, 1981. 被引量:1
  • 5LIU Chuanyi, LU Yingping, DU David, et al. ADMAD: Application-driven metadata aware de duplication archival storage system [C]// Proc 25th IEEG Conf Mass Storage Syst and Technol. Piseataway, NJ: IEEE Press, 2008:29 - 35. 被引量:1
  • 6Wylie J J, Swaminathan R. Determining fault tolerance of XOR-based erasure codes efficiently [C]// Proc 37th Annual IEEE/IFIP Int Conf Dependable Syst Networks. Piscataway, NJ: IEEE Press, 2007: 206-215. 被引量:1
  • 7Peterson W W, Weldon E J. Error Correcting Codes, 2nd Edition[M]. Cambridge: MIT Press, 1972. 被引量:1
  • 8MacWilliams F J, Sloane N J A. The Theory of Error-Correcting Codes, Part I[M]. Amsterdam: North-Holland, 1977. 被引量:1
  • 9Luby M G, Mitzenmacher M, Shokrollahi M A, et al. Efficient erasure correcting codes [J]. IEEE Trans Info Theo, 2001, 47(2): 569-584. 被引量:1
  • 10Ghemawat S, Gobioff H, Leung S T. The Google file system [C]// Proc ACM SIGOPS Operating Systems Review. New York; ACM Press, 2003; 29-43. 被引量:1

共引文献8

同被引文献13

引证文献1

二级引证文献14

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部