摘要
为使5G混合专网结构的2种用户能获得最优计算卸载策略,将2种用户竞争移动边缘计算(mobile edge computing,MEC)服务器资源的问题建模成Stackelberg博弈,并分别讨论了完全信息博弈和不完全信息博弈下的策略。完全信息博弈下,存在唯一纳什均衡解;不完全信息博弈下,将环境建模为部分可观测的马尔可夫决策过程(partially observable Markov decision process,POMDP),并提出一种基于二阶段深度强化学习(two-stage deep reinforcement learning,TSDRL)的最优卸载策略。仿真实验表明:该算法相较于D-DRL算法能减少20.81%的时延及3.38%的能耗,有效提高用户QoE(quality of experience)。
To achieve the optimal computation offloading strategy for two kinds of MEC users in 5G hybrid private network, Stackelberg game is used to build the model of the competition for MEC server resources of two kinds of users, and the strategies of complete information game and partially incomplete information game are researched respectively. It is proved that there is only one Nash equilibrium solution in the complete information scenario. In the incomplete information scenario, the environment is modeled as POMDP, and a two-stage deep reinforcement learning(TSDRL) is proposed to obtain the optimal computation offloading strategy. Simulation results show the proposed algorithm having a total reduction of 20.81% time delay and 3.38 % energy consumption compared with the D-DRL algorithm and can effectively improve the user QoE(quality of experience).
作者
周娴玮
龚启旭
余松森
Zhou Xianwei;Gong Qixu;Yu Songsen(School of Software,South China Normal University,Foshan 528225,China)
出处
《系统仿真学报》
CAS
CSCD
北大核心
2023年第2期372-385,共14页
Journal of System Simulation
基金
广东省应用型科技研发重大专项(2016B020244003)
广东省基础与应用基础研究基金(2020B1515120089,2020A1515110783)
广东省企业科技特派员项目(GDKTP2020014000)。
关键词
5G混合专网
计算卸载
STACKELBERG博弈
NASH均衡
马尔可夫决策过程
5G hybrid private network
computation offloading
Stackelberg game theory
Nash equilibrium
partially observable Markov decision process(POMDP)