期刊文献+

Hadoop节点资源参数优化策略 被引量:2

Node Resource Parameter Optimization Strategy for Hadoop
下载PDF
导出
摘要 针对Hadoop平台的节点资源优化问题,提出MapReduce参数优化策略。获取新作业执行时的资源占用特征值,计算其与作业特征库中作业的相对距离,选择相对距离最小作业的配置作为新作业的最优配置,如果获取失败,则以迭代方式获取新作业的最优配置并更新作业特征库。实验结果表明,与默认参数配置相比,该策略能够提高作业执行效率,缩短作业运行时间。 For solving the problem of node resources optimization in Hadoop platform,this paper proposes a MapReduce parameter optimization strategy.When a new job is submitted,it first gets feature value of resource utilization,and then calculates the relative distance with the jobs in the signature database.At last,it selects the configuration of the job with the minimum relative distance as the optimal configuration.If the configuration is not found,it gets optimal configuration by the way of iteration and then updates the feature database.Experimental results show that the proposed strategy can effectively improve the efficiency of job execution and reduce the execution time compared with the default parameter configuration.
出处 《计算机工程》 CAS CSCD 北大核心 2016年第1期1-6,共6页 Computer Engineering
基金 国家自然科学基金资助项目(61272447) 国家科技支撑计划基金资助项目(2012BAH18B05)
关键词 HADOOP集群 MAPREDUCE框架 参数优化 资源利用率 执行效率 特征库 相对距离 Hadoop cluster MapReduce framework parameter optimization resource utilization rate execution efficiency feature database relative distance
  • 相关文献

参考文献15

  • 1黄承真,王雷,刘小龙,况亚萍.Hadoop任务分配策略的改进[J].计算机应用,2013,33(8):2158-2162. 被引量:4
  • 2Hadoop[EB/OL].(2012-12-27).http://hadoop.apache.org/. 被引量:1
  • 3北京百度网讯科技有限公司.百度开放云[EB/OL].[2014-09-10].http://bce.baidu.com/Product/BMR.html. 被引量:1
  • 4The Apache Software Foundation.Applications Powered by Hadoop[EB/OL].(2009-07-01).http://wiki.apache.org/hadoop/PoweredBy. 被引量:1
  • 5Rasooli A,Down D G.COSHH:A Classification and Optimization Based Scheduler for Heterogeneous Hadoop Systems[J].Future Generation Computer Systems,2014,36(3):1-15. 被引量:1
  • 6Nykiel T,Potamias M,Mishra C,et al.MRShare:Sharing Across Multiple Queries in MapReduce[J].Proceedings of the VLDB Endowment,2010,3(1/2):494-505. 被引量:1
  • 7Lim H,Herodotou H,Babu S.Stubby:A Transformation-based Optimizer for MapReduce Workflows[J].Pro-ceedings of the VLDB Endowment,2012,5(11):1196-1207. 被引量:1
  • 8Herodotou H,Dong F,Babu S.No One(Cluster) Size Fits All:Automatic Cluster Sizing for Data-intensive Analy-tics[C]//Proceedings of the 2nd ACM Symposium on Cloud Computing.New York,USA:ACM Press,2011:1-18. 被引量:1
  • 9Intel Corporation.Intel Distribution for Apache Hadoop Software:Optimization and Tuning Guide[EB/OL].(2012-10-11).http://hadoop.intel.com/pdfs/Intel DistributionTuningGuide.pdf. 被引量:1
  • 10Amazon Elastic MapReduce[EB/OL].[2014-09-10].http://aws.amazon.com/elasticmapreduce/. 被引量:1

二级参考文献27

  • 1段凡丁.关于最短路径的SPFA快速算法[J].西南交通大学学报,1994,29(2):207-212. 被引量:57
  • 2ARMBRUST M, FOX A, GRIFFITH R, et al. Above the clouds: a Berkeley view of cloud computing [ J]. Communications of the ACM, 2010, 53(4): 50-58. 被引量:1
  • 3DEAN J, GHEMAWAT S. MapReduce: simplified data processing on large clusters[ J]. Communications of the ACM, 2008, 51(1) : 107 -113. 被引量:1
  • 4ISARD M, BUDIU M, YU Y, et al. Dryad: distributed data-paral- lel programs from sequential building blocks [ C ]// EuroSys '07: Proceedings of the 2007 2nd ACM SIGOPS/EuroSys European Con- ference on Computer Systems. New York: ACM, 2007:59 -72. 被引量:1
  • 5Hadoop [ EB/OL]. [ 2012 - 12 - 27]. http://hadoop, apache. org/. 被引量:1
  • 6THUSOO A, SHAO Z, ANTHONY S, et al. Data warehousing and analytics in-frastructure at Facebook [ C]// SIGMOD '10: Proceed- ings of the 2010 ACM SIGMOD International Conference on Manage- ment of Data. New York: ACM, 2010:1013 - 1020. 被引量:1
  • 7WHITE T. Hadoop: the definitive guide[M]. Sebastopol, CA, USA: O'Reilly Media, 2009. 被引量:1
  • 8Fair Scheduler for Hadoop [ EB/OL]. [2012 - 12 - 10]. http:// Hadoop. apache, org/common/docs/current/Fair_scheduler, html. 被引量:1
  • 9ZAHARIA M, BORTHAKUR D, SARMA J S, et al. Delay schedu- ling: a simple technique for achieving locality and fairness in cluster scheduling [ C]// EuroSys '10: Proceedings of the 5th European Conference on Computer Systems. New York: ACM, 2010: 265- 278. 被引量:1
  • 10Capacity scheduler for Hadoop[ EB/OL]. [ 2012 - 12 - 13]. ht- tp://Hadoop, apache, org/commort/docs/current/Capacity_schedu- ler. html. 被引量:1

共引文献8

同被引文献3

引证文献2

二级引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部