期刊文献+

同构Hadoop环境作业执行时间计算方法 被引量:1

Method for computing execution time of jobs in homogeneous hadoop environments
下载PDF
导出
摘要 执行时间是作业调度的重要参考因素之一。通过分析Hadoop MapReduce环境作业的执行特征,提出了以map任务和reduce任务执行时间为输入,估算作业执行时间的方法。该方法在一定假设条件下,借助作业预执行来获取map任务和reduce任务的执行时间。实验结果表明,该方法估算作业执行时间的误差率小于7%。 Execution time is very important for job scheduling. In this paper, the execution characters of Hadoop MapReduce jobs are analyzed, and then a new method is proposed to compute the execution times of these jobs. The method takes the execution times of map task and reduce task as input data. It captures these execution times by pre-executing under an assumption. The method has been evaluated in a Linux cluster, the experiment results show that the method computed the execution times of jobs with the error rate no more than 7%.
出处 《计算机工程与应用》 CSCD 2014年第10期249-252,共4页 Computer Engineering and Applications
基金 国家自然科学基金面上项目(No.51274088) 河南省教育厅项目(No.ITE12103) 河南理工大学矿山信息化省级重点实验室项目(No.KY2012-05) 河南理工大学博士基金项目(No.B2012-099) 河南省基础与前沿技术研究计划项目(No.122300410415)
关键词 HADOOP MAPREDUCE 作业执行时间 调度 Hadoop MapReduce execution time scheduling
  • 相关文献

参考文献15

  • 1Gantz J, Reinsel D.The digital universe decade-are you ready? [EB/OL].[2012-12-26].http://www.emc.eom/coUateral/ demos/microsites/idc-digi-taluniverse/iview.htm. 被引量:1
  • 2Dean J, Ghemawat S.Mapreduce: simplified data processing on large elusters[J].Communications of the ACM,2008, 51: 107-113. 被引量:1
  • 3Ghemawat S, Gobioff H,Leung S.The Google file sys- tem[C]//Proceedings of the 19th ACM Symposium on Operating Systems Principles,Bolton Landing,USA.New York: ACM,2003,37(5) :29-43. 被引量:1
  • 4Apache.Welcome to hadoop mapreduce![EB/OL].[2012-12-26]. http ://hadoop.apache.org/mapreduce/. 被引量:1
  • 5Thusoo A, Sarma J S, Jain N, et al.Hive: a warehousing solution over a Map-Reduce framework[J].Proceedings of the VLDB Endowment,2009,2(2) : 1626-1629. 被引量:1
  • 6Panda B, Herbach J S,Basu S, et al.Planet: massively parallel learning of tree ensembles with MapReduce[J].Pro- ceedings of the VLDB Endowment, 2009,2 (2) : 1426-1437. 被引量:1
  • 7He B,Fang W,Luo Q,et al.Mars:a MapReduce flame- work on graphics processors[C]//Proceedings of Parallel Architectures and Compilation Techniques (PACT).New York: Institute of Electrical and Electronics Engineers lnc, 2008:260-269. 被引量:1
  • 8陶洋,黄涛,唐毅.基于主机负载的任务执行时间预测研究[J].计算机应用,2009,29(10):2617-2619. 被引量:3
  • 9蒋炎华.网格环境下任务的执行时间预测技术研究[J].计算机工程与设计,2011,32(10):3428-3430. 被引量:4
  • 10Domingues P,Marqttes P, Silva L.DGSchedSim:a trace- driven simulator to evaluate scheduling algorithm for desktop denvironments[C]//14th Euro Micro International Conference on Parallel, Distributed and Network-Based Processing.Montbe liardSochaux, France : IEEE, 2006 : 83-90. 被引量:1

二级参考文献27

  • 1REED D A, MENDEC C L. Intelligent monitoring for adaptation in grid applications[ J]. Proceedings of the IEEE, 2005, 93(2) : 426 - 435. 被引量:1
  • 2JAMSHED M, KHALIQUE S, SUGURI H, et al. Grid node monitoring architecture for autonomous resource management[ J]. Broadband Networks, 2005,2:1362 - 1369. 被引量:1
  • 3DINDA P A, O'HALLARON D R. The statistical properties of host load[ C]// LCR 98: Fourth Workshop on Languages, Compilers, and Run-time Systems for Scalable Computers. The Netherlands: IOS Press, 1998:1 -23. 被引量:1
  • 4Load trace archive[ EB/OL]. [ 2009 - 01 - 01 ]. http://www. cs. northwestern. edu/- pdinda/LoadTraces/. 被引量:1
  • 5Load trace playback[ EB/OL]. [ 2009 - 01 - 01 ]. http://www.cs. northwestern. edu/-pdinda/LoadTraces/playload/playload. 被引量:1
  • 6EI-Ghazawi T, Gaj K,Alexandridis N,et al.A performance study of job management systems[J].Concurrency and Computation: P racti c e & Experience, John Wiley & Son, 2004,16( 13): 1229-1246. 被引量:1
  • 7Zhou D,Lo V.Wave scheduler:Scheduling for faster turnaround time in peer-based desktop grid systems[C].Boston,MA,USA: Proc of 11th Workshop on Job Scheduling Strategies for Parallel Processing, Lecture Notes in Computer Science 3834. Berlin: Springer,2005. 被引量:1
  • 8Kondo D,Chien A,Casanova H.Resource management for rapid application turnaround on enterprise desktop grids [C].Proc of Super Computing Conference,2004. 被引量:1
  • 9Kondo D,Chien A,Casanova H.Scheduling task parallel applica- tions for rapid application turnaround on enterprise desktop grids [J].Journal of Grid Computing,2007,5(4):379-405. 被引量:1
  • 10R脚本编程软件[OL],http://www.r-yser.org/.2010. 被引量:1

共引文献7

同被引文献8

引证文献1

二级引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部