Dynamic Spectrum Access Based on Prior Knowledge Enabled Reinforcement Learning with Double Actions in Complex Electromagnetic Environment 被引量：2

下载PDF

导出

摘要 The spectrum access problem of cognitive users in the fast-changing dynamic interference spectrum environment is addressed in this paper.The prior knowledge for the dynamic spectrum access is modeled and a reliability quantification scheme is presented to guide the use of the prior knowledge in the learning process.Furthermore,a spectrum access scheme based on the prior knowledge enabled RL(PKRL)is designed,which effectively improved the learning efficiency and provided a solution for users to better adapt to the fast-changing and high-density electromagnetic environment.Compared with the existing methods,the proposed algorithm can adjust the access channel online according to historical information and improve the efficiency of the algorithm to obtain the optimal access policy.Simulation results show that,the convergence speed of the learning is improved by about 66%with the invariant average throughput.

作者 Linghui Zeng Fuqiang Yao Jianzhao Zhang Min Jia

机构地区 The Sixty-third Research Institute Communication Research Center

出处《China Communications》 SCIE CSCD 2022年第7期13-24,共12页 中国通信（英文版）

基金 supported by National Natural Science Foundation of China (No. 62131005)

关键词 prior knowledge reinforcement learning anti-jamming communication spectrum access

分类号 TN929.5 [电子电信—通信与信息系统]

引文网络
相关文献

参考文献5

1林芬,石川,罗杰文,史忠植.基于偏向信息学习的双层强化学习算法[J].计算机研究与发展,2008,45(9):1455-1462. 被引量：9
2肖正,张世永.基于后悔值的多Agent冲突博弈强化学习模型[J].软件学报,2008,19(11):2957-2967. 被引量：6
3肖勇,赵云,涂治东,钱斌,常润勉.基于改进的皮尔逊相关系数的低压配电网拓扑结构校验方法[J].电力系统保护与控制,2019,47(11):37-43. 被引量：120
4陈永秀.相关系数含义的理解[J].中国考试,2011(7):15-19. 被引量：15
5李伟,何雪松,叶庆泰,朱昌明.基于先验知识的强化学习系统[J].上海交通大学学报,2004,38(8):1362-1365. 被引量：5

二级参考文献38

1张双民,石纯一.一种基于特征向量提取的FMDP模型求解方法[J].软件学报,2005,16(5):733-743. 被引量：3
2M~Kenzie, D. A. Statistics in Britain, Edinburgh, U.K.: Edinburgh University Press, 1981:1865-1930. 被引量：1
3Rodgers, J.L., and Nicewander, W. A. "Thirteen Ways to Iok at the Correlation Coefficient," The American Statistician, 1988, 42 (1): 59-66. 被引量：1
4Thrun S, Mitchell T. Integrating inductive neural network learning and explanation-based learning[EB/OL].http://www.ri.cmu.edu/pubs/pub-657.html,2001. 被引量：1
5Gordon D, Subramanian D. A multi-strategy learning scheme for agent knowledge acquisition acquisition[J].Informatica,1994,17:331-346. 被引量：1
6Maclin R, Shavlik J W. Creating advice-talking reinforcement learners[J].Machine Learning,1996,22:251-281. 被引量：1
7Bradtke S J, Duff M O. Reinforcement learning methods for continuous-time markov decision problems[EB/OL].http://iridia.ulb.ac.be/-mbiro/rl/rl&ants.html.2000. 被引量：1
8Bao G, Cassandras C G. Elevator dispatchers for down peak traffic[R].Massachusetts:ECE Department, University of Massachusetts,1994. 被引量：1
9Szepesvári C, Littman M L. A unified analysis of value-function-based reinforcement learning algorithms[J].Neuro Computing,1999,11(8):2017-2060. 被引量：1
10Mitchell Tom M. Machine Learning[M]. New York: McGraw Hill, 1997 被引量：1

共引文献149

1谭斌.瞬时无功控制下低压台区不平衡负序电流实时补偿[J].福建水力发电,2020(2):29-32.
2周云红,黄飞,王玉莹.物联塑壳断路器的拓扑识别模块设计[J].电器与能效管理技术,2022(12):38-45. 被引量：2
3陈启军,肖云伟.基于行动分值的强化学习与奖赏优化[J].同济大学学报（自然科学版）,2007,35(3):411-411. 被引量：1
4陈启军,肖云伟.基于行动分值的强化学习与奖赏优化[J].同济大学学报（自然科学版）,2007,35(4):531-536. 被引量：1
5卞建勇,徐建闽,裴海龙.基于强化学习的视频车辆跟踪[J].华南理工大学学报（自然科学版）,2008,36(10):57-60. 被引量：3
6陈学松,杨宜民.基于递推最小二乘法的多步时序差分学习算法[J].计算机工程与应用,2010,46(8):52-55. 被引量：5
7柴毅,利节,王嘉骐.基于后悔值的多蚁协作关联强化学习模型[J].系统工程,2010,28(4):64-67. 被引量：1
8陈学松,杨宜民.强化学习研究综述[J].计算机应用研究,2010,27(8):2834-2838. 被引量：60
9汪慧玲,范宪伟,杨华磊.多体系统差异度测量与系统引力[J].沈阳师范大学学报（自然科学版）,2011,29(4):495-498.
10刘弘,郑向伟,王吉华.多Agent协同设计系统学习机制[J].兰州大学学报（自然科学版）,2012,48(4):91-97.

同被引文献5

1王海军,李佳迅,赵海涛,王杉.基于协同认知的抗干扰网络结构自适应技术[J].计算机应用,2016,36(9):2367-2373. 被引量：1
2孙岳,李蓓蕾,梁彩虹,李颖.块衰落信道下串联多链空间耦合LDPC码设计[J].西安电子科技大学学报,2019,46(2):1-5. 被引量：4
3王海超,王金龙,丁国如,陈瑾.空天地一体化网络中智能协同抗干扰技术[J].指挥与控制学报,2020,6(3):185-191. 被引量：14
4张国敏,张少勇,张津威.基于PPO算法的攻击路径发现与寻优方法[J].信息网络安全,2023(9):47-57. 被引量：1
5Huici Wu,Yang Wen,Xiaofeng Tao,Jin Xu.Anti-Jamming and Anti-Eavesdropping in A2G Communication System with Jittering UAV[J].China Communications,2023,20(10):230-244. 被引量：1

引证文献2

1Yueyue Su,Nan Qi,Zanqi Huang,Rugui Yao,Luliang Jia.Cooperative Anti-Jamming and Interference Mitigation for UAV Networks: A Local Altruistic Game Approach[J].China Communications,2024,21(2):183-196.
2周权,牛英滔.基于相似性样本生成的深度强化学习快速抗干扰算法[J].通信学报,2024,45(7):117-126.

1XU Zhenyu,SUN Zhiguo,GUO Lili,Muhammad Zahid Hammad,Chintha Tellambura.Joint Spectrum Sensing and Spectrum Access for Defending Massive SSDF Attacks:A Novel Defense Framework[J].Chinese Journal of Electronics,2022,31(2):240-254. 被引量：1
2Michael Ganger,Ethan Duryea,Wei Hu.Double Sarsa and Double Expected Sarsa with Shallow and Deep Learning[J].Journal of Data Analysis and Information Processing,2016,4(4):159-176. 被引量：10
3Weiwei Huang,Yanan Hu,Jinjun Zhu,Zenan Cen,Jiali Bao.The Measurement and Evaluation of the Electromagnetic Environment from 5G Base Station[J].Detection,2022,9(1):1-11.
4谭金林,范文童,刘亚虎,梁志锋,王梁,刘斌,黄斌.基于软硬件协同加速框架的遥感图像目标检测[J].计算机与现代化,2022(6):109-115. 被引量：1
5Gaofeng CHENG,Huan CHEN,Pingzhi FAN,Li LI,Li HAO.A layered grouping random access scheme based on dynamic preamble selection for massive machine type communications[J].Science China(Information Sciences),2022,65(7):265-266. 被引量：1
6Zhifeng Hou,Jin Chen,Yuzhen Huang,Yijie Luo,Ximing Wang,Jiangchun Gu,Yifan Xu,Kailing Yao.Joint Trajectory and Passive Beamforming Optimization in IRS-UAV Enhanced Anti-Jamming Communication Networks[J].China Communications,2022,19(5):191-205. 被引量：5
7Jianzhong Li,Dexiang Mei,Dong Deng,Imran Khan,Peerapong Uthansakul.Proportional Fairness-Based Power Allocation Algorithm for Downlink NOMA 5G Wireless Networks[J].Computers, Materials & Continua,2020(11):1571-1590. 被引量：1
8Sitadevi Bharatula,Meenakshi Murugappan.An Intelligent Fuzzy Based Energy Detection Approach for Cooperative Spectrum Sensing[J].Circuits and Systems,2016,7(6):1042-1050.
9YAN Zhenzhen,YANG Mao,YAN Zhongjiang.A genetic algorithm based hybrid non-orthogonal multiple access protocol[J].High Technology Letters,2022,28(1):1-9.
10Yinghua Zhang,Jian Liu,Yunfeng Peng,Yanfang Dong,Guozhong Sun,Hao Huang,Changming Zhao.Performance Analysis of Relay Based NOMA Cooperative Transmission under Cognitive Radio Network[J].Computers, Materials & Continua,2020(4):197-212. 被引量：3

China Communications

2022年第7期

浏览历史

内容加载中请稍等...