LSTM-DPPO based deep reinforcement learning controller for path following optimization of unmanned surface vehicle 被引量：1

下载PDF

导出

摘要 To solve the path following control problem for unmanned surface vehicles(USVs),a control method based on deep reinforcement learning(DRL)with long short-term memory(LSTM)networks is proposed.A distributed proximal policy opti-mization(DPPO)algorithm,which is a modified actor-critic-based type of reinforcement learning algorithm,is adapted to improve the controller performance in repeated trials.The LSTM network structure is introduced to solve the strong temporal cor-relation USV control problem.In addition,a specially designed path dataset,including straight and curved paths,is established to simulate various sailing scenarios so that the reinforcement learning controller can obtain as much handling experience as possible.Extensive numerical simulation results demonstrate that the proposed method has better control performance under missions involving complex maneuvers than trained with limited scenarios and can potentially be applied in practice.

作者 XIA Jiawei ZHU Xufang LIU Zhong XIA Qingtao

机构地区 School of Weaponry Engineering Qingdao Campus School of Electronic Engineering

出处《Journal of Systems Engineering and Electronics》 SCIE EI CSCD 2023年第5期1343-1358,共16页 系统工程与电子技术（英文版）

基金 supported by the National Natural Science Foundation(61601491) the Natural Science Foundation of Hubei Province(2018CFC865) the China Postdoctoral Science Foundation Funded Project(2016T45686).

关键词 unmanned surface vehicle(USV) deep reinforce-ment learning(DRL) path following path dataset proximal po-licy optimization long short-term memory(LSTM)

分类号 U664.82 [交通运输工程—船舶及航道工程] U665.26 [交通运输工程—船舶与海洋工程] TP18 [自动化与计算机技术—控制理论与控制工程]

引文网络
相关文献

同被引文献8

1樊琼剑,杨忠,方挺,沈春林.多无人机协同编队飞行控制的研究现状[J].航空学报,2009,30(4):683-691. 被引量：103
2朱杰斌,秦世引.无人机编队飞行的分布式控制策略与控制器设计[J].智能系统学报,2010,5(5):392-399. 被引量：10
3FU Dong,LI Xiangjun,MOU Weihua,MA Ming,OU Gang.Navigation jamming signal recognition based on long short-term memory neural networks[J].Journal of Systems Engineering and Electronics,2022,33(4):835-844. 被引量：3
4ZENG Yuan,LU Wenbin,YU Bo,TAO Shifei,ZHOU Haosu,CHEN Yu.Improved IMM algorithm based on support vector regression for UAV tracking[J].Journal of Systems Engineering and Electronics,2022,33(4):867-876. 被引量：3
5GAN Yuanying,LIU Chuntong,LI Hongcai,LIU Zhongye.A camouflage target detection method based on local minimum difference constraints[J].Journal of Systems Engineering and Electronics,2023,34(3):696-705. 被引量：1
6YAO Qihai,WANG Yong,YANG Yixin.Range estimation of few-shot underwater sound source in shallow water based on transfer learning and residual CNN[J].Journal of Systems Engineering and Electronics,2023,34(4):839-850. 被引量：3
7YU Chaopeng,XIONG Wei,LI Xiaoqing,DONG Lei.Deep convolutional neural network for meteorology target detection in airborne weather radar images[J].Journal of Systems Engineering and Electronics,2023,34(5):1147-1157. 被引量：2
8QIN Boyu,ZHANG Dong,TANG Shuo,XU Yang.Two-layer formation-containment fault-tolerant control of fixed-wing UAV swarm for dynamic target tracking[J].Journal of Systems Engineering and Electronics,2023,34(6):1375-1396. 被引量：2

引证文献1

1CHEN Jinyang,WANG Xuhua,CHEN Xian.Track correlation algorithm based on CNN-LSTM for swarm targets[J].Journal of Systems Engineering and Electronics,2024,35(2):417-429.

Journal of Systems Engineering and Electronics

2023年第5期

浏览历史

内容加载中请稍等...

LSTM-DPPO based deep reinforcement learning controller for path following optimization of unmanned surface vehicle 被引量：1

同被引文献8

引证文献1

相关作者

相关机构

相关主题

浏览历史