期刊文献+

大量气象数据小文件自适应优化传输 被引量:6

Adaptive Optimization in Small Size File Transmission of Massive Meteorological Data
下载PDF
导出
摘要 为满足气象实时资料传输中大量气象数据小文件的高时效传输需求,对其中的数据传输服务进行优化,提出一种基于实时网络状况的自适应数据传输优化方法。该方法采用优化网络传输协议并使用文件压缩技术,通过获取网络传输线路上的实时参数,实时调整压缩参数和网络传输参数以优化传输性能。在自适应压缩时,通过试验分析和归纳,确立了气象数据小文件标准为文件小于50 KB;根据网络实时状况,设计了基于实时网络状况自适应调整压缩等级的算法。在自适应传输参数调优中,研究了TCP缓冲区大小和TCP并发连接数在GridFTP协议中的重要性,针对实时网络状况,分别设计了自适应调整TCP缓冲区大小和TCP并发连接数的算法,算法提升传输性能65%。对以上提出各自适应参数调整算法进行试验验证表明,融合压缩和网络传输的自适应调优方法能显著提升气象小数据文件的传输性能近500倍。 The data transfer and service architecture constructed by National Meteorological Information Center is the fundament for most meteorological data transmission.How to improve the timeliness of transmission of various data is a hot topic to enhance capabilities of meteorological services.According to requirements of transmission performance of massive small files,transmission parameters are optimized.And a self-adapting data transmission method is proposed based on real-time network status,which emphasizes network transmission protocol and file compression.Compression parameters and network transmission parameters are adjusted in real-time operation.Meteorological data include a great amount of heterogeneous small files,therefore compressing small files into a big file when being transformed will effectively reduce I/O accesses.First,50 KB is defined as the threshold for small meteorological data files through experiments.Then,by analyzing the file transfer time,the appropriate file amount in compressed packages is calculated to achieve the best transmission efficiency.Finally,considering the variability of network conditions and real-time network conditions,a selfadapting compression methods based on real-network is designed by means of real-time adjusting the compression level.This entire compression process is controlled by setting various parameters of lzop commands on the basis of the lzop algorithm library and the LZO algorithm.To achieve the goal of adjusting compression levels according to real-time network conditions,RTT(round trip time) is taken advantage of judging the current state of the network congestion.By comparing current RTT and previous RTT,changing the compression level or not is decided.In network transmission optimization,conclusions are made that TCP buffer and parallel transmission will consume memory resources according to experiments in Globus platform.At the same time,more parallel streams and larger size of TCP buffers will result in network congestion.Then,the self-adapting adjust
出处 《应用气象学报》 CSCD 北大核心 2014年第5期629-637,共9页 Journal of Applied Meteorological Science
基金 江苏省教育厅青蓝工程中青年学术带头人项目"基于云计算的气象数据共享平台" 江苏省"六大人才高峰"高层次人才项目(2012WLW-022)
关键词 气象数据 小文件 压缩 传输优化 meteorological data small files compression transmission optimization
  • 相关文献

参考文献22

二级参考文献70

共引文献108

同被引文献58

  • 1李亚丽,妙娟利,贺音.日平均计算方法对气温统计值的影响[J].气象科技,2013,41(1):88-92. 被引量:13
  • 2刘小宁,任芝花.地面气象资料质量控制方法研究概述[J].气象科技,2005,33(3):199-203. 被引量:150
  • 3胡英楣,沈文海,宋之光.多进程并发在国内气象通信系统的应用[J].应用气象学报,2007,18(6):877-884. 被引量:14
  • 4Ghemawat S, Gobioff H, Shun TakLeung. The Google File System [C]// 19th ACM Symposium on Operating Systems Principles,The Sagamore Eolton Landing New York, October 19-22 2003,USA ACM,2003 Pages29-43. 被引量:1
  • 5Dean J, Ghemawat S. MapReduce Simplified Data Processing on Large Clusters [C]//6th Symposium on Operating System Design and Implementation (OSDI 2004), San Francisco Cali- forniaUSA, December 6 -- 8, 2004. 被引量:1
  • 6USA: USENIXAssocia tion,2004Pages137 150 Chang F, Dean J, C-hemawat S, et al. Bigtable A distributed storage system for structured data [C] //Tth Symposium on Operating Systems Design and Implementation (OSDI "06), Seattle WA USA, November 6 -- 8, USA USENIX Associa tion,2006:Pages205 218. 被引量:1
  • 7WhiteT.HadoopTheDefinitiveGuide:Hadoop权威指南[M].周敏奇,王晓玲,金澈清,等译.2版.北京:清华大学出版社,2011:4145. 被引量:1
  • 8Gray J. Distributed Computing Economics, MSR TR 2003 24 [R]. San Francisco, California, USA Microsoft Research, 2003. 被引量:1
  • 9Mackey G, Sehrish S, Wang Jun. Improving metadata man agement for small files in HDFS [C]//Proceedings of the 2009 IEEE International Conference on Cluster Computing, 2009, USAIEEE, 2009, 10. ll09/CLUSTR, 2009, 5289133: 1-4. 被引量:1
  • 10Carns P, Lang S, Ross R et al. Small file access in parallel file systems [C] // the 2009 IEEE International Symposium on ParallelDistributed Processing, Rome, Italy, May 23 29 2009, Washington, DC, USA IEEE, 2009,10. 1109/IPDPS, 2009, 5161029: 1-11. 被引量:1

引证文献6

二级引证文献33

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部