COURIER:基于非抢占式优先排队和优先经验重放DRL的边缘计算任务调度与卸载方法

COURIER:Edge Computing Task Scheduling and Offloading Method Based on Non-preemptive Priorities Queuing and Prioritized Experience Replay DRL

下载PDF

导出

摘要边缘计算(Edge Computing,EC)将计算、存储等资源部署在网络边缘,以满足业务对时延和能耗的要求。计算卸载是EC中的关键技术之一。现有的计算卸载方法在估计任务排队时延时使用M/M/1/∞/∞/FCFS或M/M/n/∞/∞/FCFS排队模型,未考虑高时延敏感型任务的优先执行问题,使得一些对时延要求不敏感的计算任务长期占用计算资源,导致系统的时延开销过大。此外,现有的经验重放方法大多采用随机采样方式,该方式不能区分经验的优劣,造成经验利用率低,神经网络收敛速度慢。基于确定性策略深度强化学习(Deep Reinforcement Learning,DRL)的计算卸载方法存在智能体对环境的探索能力弱和鲁棒性低等问题,降低了求解计算卸载问题的精度。为解决以上问题,考虑边缘计算中多任务移动设备、多边缘服务器的计算卸载场景,以最小化系统时延和能耗联合开销为目标,研究任务调度与卸载决策问题,并提出了基于非抢占式优先排队和优先经验重放DRL的计算卸载方法(Computation Offloading qUeuing pRioritIzed Experience Replay DRL,COURIER)。COURIER针对任务调度问题,设计了非抢占式优先排队模型(M/M/n/∞/∞/NPR)以优化任务的排队时延;针对卸载决策问题,基于软演员-评论家(Soft Actor Critic,SAC)提出了优先经验重放SAC的卸载决策机制,该机制在目标函数中加入信息熵,使智能体采取随机策略,同时优化机制中的经验采样方式以加快网络的收敛速度。仿真实验结果表明,COURIER能有效降低EC系统时延和能耗联合开销。 Edge computing(EC)deploy a large number of computing and storage resources at the edge of the network to meet requirements on latency and power consumption of tasks.Computing offloading is one of the key technologies in EC.When estimating the delay of task queuing,the existing computation offloading methods usually use M/M/1/∞/∞/FCFS or M/M/n/∞/∞/FCFS models.Without considering the priority of high delay sensitive tasks,these methods cause some computation tasks that do not require sensitive delay always occupy the computation resources,increasing the delay cost of these methods.Meanwhile,most of the existing playback methods use random sampling to replay experience,which cannot distinguish the pros and cons of expe-rience,resulting in low experience utilization and slow neural network convergence.At last,the deterministic policy deep reinforcement learning(DRL)based on computational offloading methods have problems,such as weak ability of exploring environment,low robustness and low experience utilization rate,which reduces the accuracy of solving computational unload problem.To solve the above problems,considering the multi-task mobile device and multi-edge server computing offload scenarios,aims to minimize the system delay and energy consumption,study task scheduling and offloading decision-making problems,and computation offloading qUeuing and pRioritIzed experience replay DRL(COURIER)is proposed.COURIER first designs a non-preemptive priority queuing model(M/M/n/∞/∞/NPR)to optimize the queuing delay of tasks.Then,it proposes a maximum entropy deep reinforcement learning algorithm based on prioritized experience replay.For the offloading decision problem,an offloading decision mechanism of priority experience replay SAC is proposed,based on soft actor-critic(SAC)algorithm.In this mechanism,information entropy is added to the objective function to make the agent adopt random strategy,and the empirical sampling me-thod is optimized to accelerate the convergence rate of the network.Simulation result

作者杨秀文崔允贺钱清郭春申国伟 YANG Xiuwen;CUI Yunhe;QIAN Qing;GUO Chun;SHEN Guowei(College of Computer Science and Technology,Guizhou University,Guiyang 550025,China;State Key Laboratory of Public Big Data,Guiyang 550025,China;Engineering Research Center of Text Computing&Cognitive Intelligence,Ministry of Education,Guiyang 550025,China;School of Information,Guizhou University of Finance and Economics,Guiyang 550000,China)

机构地区贵州大学计算机科学与技术学院省部共建公共大数据国家重点实验室文本计算与认知智能教育部工程研究中心贵州财经大学信息学院

出处《计算机科学》 CSCD 北大核心 2024年第5期293-305,共13页 Computer Science

基金国家自然科学基金(62102111) 贵州省科技计划项目([2020]1 Y267,黔科合重大专项字[2024]003号) 贵州省教育厅自然科学研究项目([2021136]) 贵州大学引进人才项目((2019)52)。

关键词边缘计算计算卸载非抢占式优先排队信息熵深度强化学习优先经验重放 Edge computing Computing offloading Non-preemptive priority queuing Information entropy Deep reinforcement learning Priority experience replay

分类号 TP391 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献3

1Liang Huang,Xu Feng,Cheng Zhang,Liping Qian,Yuan Wu.Deep reinforcement learning-based joint task offloading and bandwidth allocation for multi-user mobile edge computing[J].Digital Communications and Networks,2019,5(1):10-17. 被引量：30
2邝祝芳,陈清林,李林峰,邓晓衡,陈志刚.基于深度强化学习的多用户边缘计算任务卸载调度与资源分配算法[J].计算机学报,2022,45(4):812-824. 被引量：38
3缪巍巍,王传君,曾锃,李世豪,张震,张明轩.一种面向多物联代理在线应用的弹性资源调度算法[J].重庆理工大学学报（自然科学）,2022,36(2):151-161. 被引量：2

二级参考文献2

1杨东升,王道浩,周博文,陈麒宇,杨之乐,胥国毅,崔明建.泛在电力物联网的关键技术与应用前景[J].发电技术,2019,40(2):107-114. 被引量：130
2Tianchu Zhao,Sheng Zhou,Linqi Song,Zhiyuan Jiang,Xueying Guo,Zhisheng Niu.Energy-Optimal and Delay-Bounded Computation Offloading in Mobile Edge Computing with Heterogeneous Clouds[J].China Communications,2020,17(5):191-210. 被引量：24

共引文献67

1吕洁娜,张家波,张祖凡,甘臣权.移动边缘计算卸载策略综述[J].小型微型计算机系统,2020,41(9):1866-1877. 被引量：23
2Jienan Chen,Siyu Chen,Siyu Luo,Qi Wang,Bin Cao,Xiaoqian Li.An intelligent task offloading algorithm(iTOA)for UAV edge computing network[J].Digital Communications and Networks,2020,6(4):433-443. 被引量：8
3苏新,王子怡,王宇鹏,周思源.海洋观监测传感器网络多接入边缘计算卸载方法[J].物联网学报,2021,5(1):36-52. 被引量：3
4苏新,薛淏阳,周一青,朱金秀.面向海洋观监测传感网的计算卸载方法研究[J].通信学报,2021,42(5):149-163. 被引量：7
5梁俊斌,张海涵,蒋婵,王天舒.移动边缘计算中基于深度强化学习的任务卸载研究进展[J].计算机科学,2021,48(7):316-323. 被引量：8
6王其朝,金光淑,李庆,王锴,杨祖业,王宏.工业边缘计算研究现状与展望[J].信息与控制,2021,50(3):257-274. 被引量：40
7Samrat Nath,Jingxian Wu.Deep reinforcement learning for dynamic computation offloading and resource allocation in cache-assisted mobile edge computing systems[J].Intelligent and Converged Networks,2020,1(2):181-198. 被引量：23
8Qilie Liu,Yinyi Xu,Bin Cao,Lei Zhang,Mugen Peng.Unintentional forking analysis in wireless blockchain networks[J].Digital Communications and Networks,2021,7(3):335-341. 被引量：4
9Ziying Wu,Danfeng Yan.Deep Reinforcement Learning-Based Computation Offloading for 5G Vehicle-Aware Multi-Access Edge Computing Network[J].China Communications,2021,18(11):26-41. 被引量：16
10杜梅,周军华,李敦桥,陈士钊,魏翼飞.MEC计算卸载与资源分配联合智能优化方案[J].北京邮电大学学报,2022,45(2):65-71. 被引量：1

1Tao Zihui.RIDING THE WAVE OF CHANGE[J].Beijing Review,2024,67(5):16-18.
2郑珈玥.蜗牛速递[J].疯狂英语（双语世界）,2024(1):43-45.
3李杭青.基于MQTT的智能家居控制系统[J].通信电源技术,2024,41(5):4-6.
4杨恩睿,王志强,孙晓.基于改进RRT的变电站巡检机器人全局路径规划算法[J].自动化应用,2024,65(7):42-45. 被引量：2
5代永华.初中英语教学中小组合作的“度量衡”探索之路[J].天津教育,2024(7):60-61.
6李冰冰,吕平.到达环境可变的M/M/1排队系统[J].杭州师范大学学报（自然科学版）,2024,23(2):190-196.
7梁祺祺,李湘宁,农玉娟,卢正刚,邓波,邹雪兰,黄丹燕(摄影报道),周火文(摄影报道),谢富宁(摄影报道),谭馨,黄文才,劳茜,覃志江,唐小栋,蓝翊翎,谭凯兴(摄影报道).资讯综合[J].当代广西,2024(7):58-59.
8Hu Anyan.I Deliver Parcels in Beijing[J].China Book International,2024(1):40-45.
9于颖,陈岩.基于单值中智语言信息的扩展AQM云模型群决策[J].模糊系统与数学,2023,37(5):143-152.
10徐毅.基于移动通信的广播电视监控系统设计[J].信息记录材料,2024,25(3):112-114.

计算机科学

2024年第5期

浏览历史

内容加载中请稍等...

COURIER:基于非抢占式优先排队和优先经验重放DRL的边缘计算任务调度与卸载方法

参考文献3

二级参考文献2

共引文献67

相关作者

相关机构

相关主题

浏览历史