基于匹配规则的MapReduce任务调度模型被引量：7

MapReduce tasks scheduling model based on matching rules

下载PDF

导出

摘要基于开源云计算平台Hadoop的MapReduce是当前流行的分布式计算框架之一,然而其先进先出(FIFO)调度算法存在资源利用效率低下的问题。提出了一种基于资源匹配规则的MapReduce任务调度模型并进行了算法实现。该调度模型通过获取任务的资源需求与计算节点的剩余资源,依据资源的匹配性进行任务分配,提高了系统的资源使用效率。首先对MapReduce的调度过程进行建模,提出了资源及匹配度的量化定义和相应的计算公式;然后给出了资源测量的具体方法及算法实现;最后利用TeraSort、GrepCount和WordCount任务与FIFO调度算法进行实验对比,实验结果显示,最好的情况下,提出的调度模型任务完成时间减少了22.19%,而最差情况下的吞吐量也提高了25.39%。 MapReduce is one of the popular distributed computing frameworks based on an open source cloud platform named Hadoop.However,the First-In First-Out （FIFO） scheduling algorithm of MapReduce is inefficient in resources utilization.A new tasks scheduling model based on resources matching rules was proposed and implemented.After obtaining the tasks resources requirement and remainder resources on computing nodes,the model assigned tasks to computing nodes based on resources matching degree to improve the usage efficiency of system resources.First of all,the model for MapReduce scheduling was established,the quantitative definition of resources and matching degree were given,and the corresponding calculation formulas were put forward.Second,the specific methods of resource measurement and the implementation of the algorithm were introduced.Compared with FIFO scheduling algorithm on TeraSort,GrepCount and WordCount,the experimental results show that the proposed model reduces by 22.19％ in tasks completion time in the best case,and increases by 25.39％ in throughput even in the worst case.

作者金伟健王春枝

机构地区义乌工商职业技术学院机电信息分院湖北工业大学计算机学院

出处《计算机应用》 CSCD 北大核心 2014年第4期1010-1013,1018,共5页 journal of Computer Applications

基金国家自然科学基金项目资助项目(61170135)

关键词云计算调度算法 HADOOP MAPREDUCE 先进先出 cloud computing scheduling algorithm Hadoop MapReduce First-In First-Out （FIFO）

分类号 TP302.1 [自动化与计算机技术—计算机系统结构]

引文网络
相关文献

参考文献13

1VAQUERO L, RODERO-MERINO L, CACERES J, et al. A break in the clouds: towards a cloud definition [ J]. ACM SIGCOMM Computer Communication Review, 2008, 39(1) : 50 -55. 被引量：1
2GHEMAWAT S, GOBIOFF H, LEUNG S-T. The Google file system [ C]//SOSP '03: Proceedings of the 19th ACM Symposium on Op- erating Systems Principles. New York: ACM, 2003:29 -43. 被引量：1
3DEAN J, GHEMAWAT S. MapReduce: Simplified data processing on large clusters [J]. Communications of the ACM, 2008, 51( 1): 107 - 113. 被引量：1
4BURROWS M. The Chubby lock service for loosely-coupled distrib- uted systems [ C]//OSDI '06: Proceedings of the 7th Symposium on Operating Systems Design and Implementation. Berkeley, CA: USENIX Association, 2006:335-350. 被引量：1
5BERLI!)ISKA J, DROZDOWSKI M. Scheduling divisible MapRe- duce computations [ J]. Journal of Parallel and Distributed Compu- ting, 2011, 71(3):450-459. 被引量：1
6SANDHOLM T, LAI K. Dynamic proportional share scheduling in Hadoop [ C]// JSSPP '10: Proceedings of the 15th International Conference on Job Scheduling Strategies for Parallel Processing, LNCS 6253. Berlin: Springer, 2010:110-131. 被引量：1
7ZAHARIA M, BORTHAKUR D, SARMA J S, et al. Job schedu- ling for multi-user MapReduce clusters, UCB/EECS-2009-55 [ R]. Berkeley: University of California, 2009. 被引量：1
8DOELITZSCHER F, SULISTIO A, REICH C, et al. Private cloud for collaboration and e-Learning services: from IaaS to SaaS [ J]. Computing, 2011, 91(1) : 23 -42. 被引量：1
9O'MALLEY O. Terabyte sort on apache Hadoop [ EB/OL]. [ 2011O'MALLEY O. Terabyte sort on apache Hadoop [ EB/OL]. [ 2013-O1 - 08]. http://citeseerx, ist. psu. edu/viewdoc/download? doi = 10.1.1. 178. l187&rep = repl &type = pdf. 被引量：1
10O'MALLEY OMURTHY A C. Winning a 60 second dash with a yellow elephant [ EB/OL]. [2013 -02 - 13]. http://mfotos, n/- NZc29ydGJlbmNobWFyaySvcmc. ZN-Yahoo2009. pdf. 被引量：1

同被引文献71

1Apache Hadoop I EB/OL ]. ( 20~4-03-11 ). http ://hadoop. apache, org/. 被引量：1
2Azzedin F. Towards a Scalable HDFS Architec- ture [ C ]//Proceedings of 2013 International Conference on Cc~l, laboration Technologies and Systems. New York, USA : ACM Press ,2013 : 155 -161. 被引量：1
3Zaharia M,Konwinski A, Joseph A D, et al. Improving MapReduce Performance in Heterogeneous Environ- mentsICl// Proceedings of the 8th Symposium on Operating Systems' Design and Implementation. New York, USA .. ACM Press ,2008. 被引量：1
4Zaharia M, Borthakur D, Sarma J S, et al. Delay Scheduling:A Simple Technique for Achieving Locality and Fairness in Cluster Scheduling~ C ~//Proceedings of the 5th European Conference on Computer Systems. New York, USA : ACM Press : 2010 : 265 -278. 被引量：1
5Chen Quan, Zhang Daqiang, Guo Minyi, et al. SAMR : A Self-adaptive MapReduce Scheduling Algorithm in Heterogeneous Environment ~ C 1//Proceedings of IEEE International Conference on Computer and Information Technology. Washington D. C. , USA : IEEE Press ,2010 : 2736-2743. 被引量：1
6Sandholm T, Lai K. Dynamic Proportional Share Scheduling in Hadoop I C ]//Proceedings of the 15th Workshop on Job Scheduling Strategies for Parallel Processing. New York ,USA :ACM Press ,2010 : 110-131. 被引量：1
7K.amal K,Anyanwu K. Scheduling Hadoop Jobs to Meet Deadlines E C]//Proceedings of the 2nd IEEE Inter- national Conference on Cloud Computing Technology and Science. Washington D. C., USA: IEEE Press, 2010:388-392. 被引量：1
8Yong M,Garegrat N, Mohan S S. Towards a Resource Aware Scheduler in Hadoop I C l//Proceedings of ICWS' 09. Washington D. C. , USA : IEEE Press, 2009 : 102-109. 被引量：1
9Patil S, Deshmukh S. Survey on Task AssignmentTechniques in Hadoop [ J 1. International Journal of Computer Applications ,2012,59 ( 14 ) : 15-18. 被引量：1
10Dhok J, Maheshwari N, Varma V. Learning Based Opportunistic Admission Control Algorithm for MapReduce as a Service [ C l//Proceedings of the 3rd India Software Engineering Conference. New York, USA: ACM Press, 2010:153-160. 被引量：1

引证文献7

1徐化祥,陈林,陈浩.开源IaaS平台中的随机调度任务动态融合仿真[J].计算机仿真,2015,32(7):386-389.
2薛涛,燕明磊.基于CHC遗传算法的Hadoop作业调度研究[J].计算机工程,2016,42(3):61-68. 被引量：2
3程邺华.云平台的任务合理化调度模型仿真分析[J].计算机仿真,2016,33(5):376-379.
4黄伟建,杨海龙.基于Dijkstra算法分布式JobTracker节点模型通信方式的优化[J].河南师范大学学报（自然科学版）,2016,44(3):154-159.
5苑中梁,陈兴蜀,王毅桐.IaaS环境下多租户安全资源分配算法和安全服务调度框架[J].计算机应用,2017,37(2):383-387. 被引量：5
6李慧芳,刘秀平.凸价格函数下基于堆栈执行的云计算资源调度方案[J].计算机应用研究,2017,34(10):3129-3132. 被引量：2
7王钟斐,王钟磊.一种改进的延时调度算法[J].电子设计工程,2018,26(15):23-26.

二级引证文献9

1王丽红,夏魁良,金丹.求解Hadoop作业调度问题的混合遗传算法[J].齐齐哈尔大学学报（自然科学版）,2018,34(3):6-10.
2杨梅芳.云计算模式下多租户软件最优调度方法仿真[J].计算机仿真,2018,35(7):447-451.
3王钟斐,王钟磊.一种改进的延时调度算法[J].电子设计工程,2018,26(15):23-26.
4周三玲,童一飞.物流制造产品按需求资源优化分配仿真[J].计算机仿真,2018,35(8):420-423. 被引量：1
5武慧虹,钱淑渠.改进的非传统遗传算法用于逆变器PWM控制[J].吉林师范大学学报（自然科学版）,2018,39(3):123-129. 被引量：1
6王锋,张璟,张彤,马维纲.基于Eucalyptus的多租户水利系统应用研究[J].计算机系统应用,2018,27(9):107-111.
7臧艳辉,赵雪章,席运江.基于MF-R和AWS密钥管理机制的物联网健康监测大数据分析系统[J].计算机应用研究,2019,36(7):2065-2069. 被引量：11
8庄小叶,李轲.云资源分配优化的非合作博弈定价机制[J].新乡学院学报,2021,38(3):48-52. 被引量：2
9闫春荣,牟宏蕾,郝亚飞,孟凡博.IT支撑系统多租户模式应用研究[J].通信管理与技术,2021(5):41-44. 被引量：1

1夏建兵,周永塔.基于数据交换的数据整合及其应用探讨[J].电子测试,2013,24(3):159-159. 被引量：1
2江楠.浅谈服务器虚拟化核心技术[J].数字技术与应用,2014,32(11):223-223. 被引量：2
3苏蕊,徐炜民,钱晓竞.基于双向匹配模型的任务调度策略的研究[J].计算机工程与设计,2005,26(8):2045-2047. 被引量：4
4柯汉松.高效使用INTERNET资源的捷径[J].韩山师范学院学报,1998,19(2):83-85.
5卢清,李春生,尹义方.一种面向服务的云计算数字化校园架构模型[J].长江大学学报（自科版）（上旬）,2012,9(5):142-144.
6古俐明.集群服务器负载均衡技术研究[J].微计算机信息,2007,23(04X):112-114. 被引量：15
7廖珊.基于桌面云服务的计算机实验管理研究[J].中国新通信,2017,19(3):67-68.
8信息中心云自动化管理系统[J].金融电子化,2014(9):75-76.
9张千,牛伟伟,邢常振,梁鸿.一种基于DAG图划分的网格关联任务调度算法[J].小型微型计算机系统,2012,33(5):971-975. 被引量：2
10朴杰,朱云龙.网格技术在虚拟企业中的应用研究[J].计算机集成制造系统,2005,11(7):1019-1024. 被引量：11

计算机应用

2014年第4期

浏览历史

内容加载中请稍等...

基于匹配规则的MapReduce任务调度模型被引量：7

参考文献13

同被引文献71

引证文献7

二级引证文献9

相关作者

相关机构

相关主题

浏览历史

基于匹配规则的MapReduce任务调度模型 被引量：7

参考文献13

同被引文献71

引证文献7

二级引证文献9

相关作者

相关机构

相关主题

浏览历史

基于匹配规则的MapReduce任务调度模型被引量：7