期刊文献+

Storm流处理平台中负载均衡机制的实现 被引量:7

Realization of Load Balancing Mechanism in Storm Streaming Processing Platform
下载PDF
导出
摘要 Storm流处理平台解决了传统的基于Hadoop的批处理系统实时性不高的问题,为多源异构大数据处理提供了高效、快速、实时的数据处理框架。然而Storm平台在任务分配过程中只考虑了不同节点之间可用Slot的排序,并没有充分考虑节点的实际负载情况,从而容易产生负载不均衡的问题。针对以上问题,本文在Storm分布式流处理系统上实现对可用Slot和节点负载情况的加权排序改进Storm调度算法,通过数据结构设计,保证rowkey的随机性和唯一性,确保Region Server的负载平衡;同时通过批量写入的机制,提高Hbase数写入速度,从而提高流数据存储效率。通过与原生Storm系统的对比实验,表明本文算法的改进和机制优化保证了数据的快速写入,提高了集群资源的利用率,改进后的系统在实用性与效率上具有明显的优势。 Compared with Hadoop, Stormhas advantage of real-time data stream processing, which preal-time data processing framework for multi-source heterogeneous data processing. However, the worker assignments in theStorm cluster only consider the sort of available Slot between different nodes, while ignoring the current lonodes, which may fail to meet the command of load balancing when more than one topology rprove the efficiency and achieve load balancing of real-time stream processing, a Storm scheduling algoritlim is proposed which is weighted sorting of available Slot and node load conditions and based on Storm-basedload imbalance. And through designing the data structure reasonably, the paper designs the rowkey in Hbase randomly and even-ly, which can ensure the load balance of the various RegionServer, improve the utilization of cluster resources and increase the speed of data writing greatly. Through the comparison experiment with the original Storm srithim improvement and mechanism optimization ensure the fast writing of data and improve theThe improved system has obvious advantages in practicality and efficiency.
出处 《计算机与现代化》 2017年第12期65-70,76,共7页 Computer and Modernization
基金 中国电子科技集团公司第三十二研究所自立项目(ZQ160006 ZQ160007)
关键词 STORM 流处理 分布式计算 批量处理 负载均衡 Storm streaming processing distributed computing batch processing load balancing
  • 相关文献

参考文献14

二级参考文献101

  • 1金澈清,钱卫宁,周傲英.流数据分析与管理综述[J].软件学报,2004,15(8):1172-1181. 被引量:161
  • 2邓华锋,刘云生,肖迎元.分布式数据流处理系统的动态负载平衡技术[J].计算机科学,2007,34(7):120-123. 被引量:13
  • 3GROSSMAN R L. The case for cloud computing[ J]. IT Profession- al,2009:11 (2) :23 -27. 被引量:1
  • 4BUYYA R, YEO C S, VENUGOPAL S, et al. Cloud computing and emerging IT platforms: Vision, hype, and reality for delivering com- puting as the 5th utility [ J ]. Future Generation Computer Systems, 2009:25(6) :599 -616. 被引量:1
  • 5RANDLES M, LAMB D,TALEB-BENDIAB A. A comparative study into distributed load balancing algorithms for cloud computing [ C ]//Proc of the 24th IEEE International Conference on Advanced Information Networking and Applications Workshops. 2011 : 551 - 556. 被引量:1
  • 6ZHAO Yi, HUANG Wenlong. Adaptive distributed load balancing algorithm based on live migration of virtual machines in cloud[ C] // Proc of the 5th International Joint Conference on INC, IMS and IDC. 2009 : 170 - 175. 被引量:1
  • 7HU Jinhua,GU Jianhua,SUN Guofei ,et al. A scheduling strategy on load balancing of virtual machine resources in cloud computing envi- ronment[ C]//Proc of the 3rd International Symposium on Parallel Architectures, Algorithms and Programming. 2010:89 - 96. 被引量:1
  • 8SADHASIVAM S, JAYARANI R, NAGAVENI N, et al. Design and implementation of an efficient two-level scheduler for cloud compu- ting environment [ C ]//International Conference on Advances in Re- cent Technologies in Communication and Computing. 2009: 884 - 886. 被引量:1
  • 9WANG Shuching, YAN Kuoqin, LIAO Wenpin, et al. Towards a load balancing in a three-level cloud computing network [ C ] //Proc of the 3rd IEEE International Conference on Computer Science and In- formation Technology. 2010 : 108 - 113. 被引量:1
  • 10ZHANG Zehua,ZHANG Xuejie. A load balancing mechanism based on ant colony and complex network theory in open cloud computing federation[ C]//Proc of the 2nd International Conference on Indus- trial Mechatronics and Automation. 2010 : 240 - 243. 被引量:1

共引文献448

同被引文献47

引证文献7

二级引证文献13

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部