摘要
HBase作为Hadoop分布式开源云数据库因其高可用性等优势越来越受到青睐,但是随着大量数据的注入,HBase对负载的分配状况将直接影响到整个集群的性能优劣.针对原有负载均衡算法在负载分配过程中可能产生的负载严重不均衡问题,通过分析原有算法和问题出现的因素,提出一种基于子表限制的负载均衡改进方法,并通过与不均衡状况下的对比实验,验证改进后的分配方式可以有效利用集群中各个节点的资源,从而提高分布式集群性能.
As the Hadoop distributed open source database, HBase gradually gained a lot of attention due to its high- performance. However, with a large amount of data pouring into HBase, the situation of load distribution by HBase will directly affect the performance of the whole cluster. With the problem of load seriously imbalances in the process of original algorithm, proposed a novel load-balancing approach based on the limited child table, with the analysis of original algorithm and the causes of the problem. The contrast experiment showed that the improved approach of scheduling can make effective use of each node resources in the cluster, thereby improved the performance of the distributed cluster.
出处
《微电子学与计算机》
CSCD
北大核心
2016年第4期125-128,共4页
Microelectronics & Computer
基金
河北省自然科学基金项目(F2015402077)
河北省高等学校科学技术研究重点项目(ZD2014054)
关键词
云计算
HBASE
负载均衡
节点资源
集群性能
cloud computing
HBase
load balancing
node resources
cluster performance