期刊文献+

海量文件系统中基于特征实现文件多维度浏览 被引量:2

Multi-dimension browsing based on features in massive file system
下载PDF
导出
摘要 SMDFS可以高效地管理百亿级数量文件。然而针对照片、音乐等海量数据,往往需要从多个维度快速浏览文件,基于目录结构管理海量文件的传统文件组织方式很难满足这一要求。在SMDFS文件系统基础之上,为文件引入特征属性,并提出基于特征的海量小文件倒排索引技术和分布索引技术,使SMDFS可根据多个特征快速浏览文件。实验数据表明,支持特征的SMDFS能为海量小文件提供高效管理和多维度快速浏览能力,同时基于文件目录结构访问海量小文件的性能并没有明显下降。 The small files distributed file system (SMDFS) can efficiently manage ten billions of files. However, a huge amount of data such as photos, music, etc. , often needs to quickly browse files from multiple dimensions, and traditional files organization schemes based on the directory structure to manage massive files cannot easily meet this requirement. Based on the SMDFS file system, we intro- duce features to file attributes and put forward a feature-based massive small files inverted indexing tech- nique and a distributed indexing technique, which enables the SMDFS browse files based on multiple fea- tures. Experimental results show that the feature-supported SMDFS can provide efficient management and rapid multi-dimensional browsing capability for massive small files while the massive small files ac- cess performance based on file-directory structure is not significantly decreased.
出处 《计算机工程与科学》 CSCD 北大核心 2017年第5期849-854,共6页 Computer Engineering & Science
关键词 海量小文件 检索 倒排索引 动态重构 massive small files search inverted index dynamic reconstruction
  • 相关文献

参考文献1

二级参考文献11

  • 1GHEMAWAT S,GOBIOFF H,LEUNG S-T.The Google file system. Proceedings of the 19th ACM Symposium on Operating Sys-tems Principles . 2003 被引量:3
  • 2White T.The Small Files Problem. Cloudera . 2009 被引量:1
  • 3X. Liu,J. Han,Y. Zhong,C. Han etal.'Implementing WebGIS on Hadoop: A case study ofimproving small file I/O performance on HDFS,'. Proceedings of t he2009IEEE InternationalConference on Cluster Computing . 2009 被引量:1
  • 4Beaver D,Kumar S,Li H C, et al.Finding a needle in Haystack: facebook’’s photo storage. Proceedings of the9th USENIX conference on Operating systems design and implementa-tion . 2010 被引量:1
  • 5Mackey G,Sehrish S,Wang J.Improving metadata management for small files in HDFS. Proc of the 2009IEEE Int Conf on Cluster Computing(CLUSTER′09) . 2009 被引量:1
  • 6The Apache Hadoop project.Welcome to Apache Hadoop. http://hadoop.apache.org . 2015 被引量:1
  • 7AMAZON.Amazon simple storage service. http://www.amazon.com/s3 . 2015 被引量:1
  • 8McKusick M K,Quinlan S.GFS:Evolution on fast forward. http://queue.acre.org/detail.cfm/id=1594206 . 2015 被引量:1
  • 9Chu Yu.Taobao file system. http://code.taobao.org/p/tfs/-wiki/index . 2015 被引量:1
  • 10Java Platform SE 6-ConcurrentSkipListMap. http://docs.oracle.com/javase/6/docs/api/java/util/concurrent/ConcurrentSkiConcurre.html . 2015 被引量:1

共引文献3

同被引文献16

引证文献2

二级引证文献7

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部