Reinforcement learning based parameter optimization of active disturbance rejection control for autonomous underwater vehicle 被引量：2

下载PDF

导出

摘要 This paper proposes a liner active disturbance rejection control(LADRC) method based on the Q-Learning algorithm of reinforcement learning(RL) to control the six-degree-of-freedom motion of an autonomous underwater vehicle(AUV).The number of controllers is increased to realize AUV motion decoupling.At the same time, in order to avoid the oversize of the algorithm, combined with the controlled content, a simplified Q-learning algorithm is constructed to realize the parameter adaptation of the LADRC controller.Finally, through the simulation experiment of the controller with fixed parameters and the controller based on the Q-learning algorithm, the rationality of the simplified algorithm, the effectiveness of parameter adaptation, and the unique advantages of the LADRC controller are verified.

作者 SONG Wanping CHEN Zengqiang SUN Mingwei SUN Qinglin

机构地区 College of Artificial Intelligence Key Laboratory of Intelligent Robotics of Tianjin

出处《Journal of Systems Engineering and Electronics》 SCIE EI CSCD 2022年第1期170-179,共10页 系统工程与电子技术（英文版）

基金 supported by the National Natural Science Foundation of China (61973175 61973172) Tianjin Natural Science Foundation (19JCZDJC32800)。

关键词 autonomous underwater vehicle(AUV) reinforcement learning(RL) Q-LEARNING linear active disturbance rejection control(LADRC) motion decoupling parameter optimization

分类号 TP242 [自动化与计算机技术—检测技术与自动化装置] TP273 [自动化与计算机技术—控制科学与工程]

引文网络
相关文献

参考文献5

1韩京清.自抗扰控制器及其应用[J].控制与决策,1998,13(1):19-23. 被引量：1007
2LIU Junjie,SUN Mingwei,CHEN Zengqiang,SUN Qinglin.High AOA decoupling control for aircraft based on ADRC[J].Journal of Systems Engineering and Electronics,2020,31(2):393-402. 被引量：8
3LI Yue,QIU Xiaohui,LIU Xiaodong,XIA Qunli.Deep reinforcement learning and its application in autonomous fitting optimization for attack areas of UCAVs[J].Journal of Systems Engineering and Electronics,2020,31(4):734-742. 被引量：12
4Ruiguang Yang Mingwei Sun Zengqiang Chen.Active disturbance rejection control on first-order plant[J].Journal of Systems Engineering and Electronics,2011,22(1):95-102. 被引量：21
5唐德翠,高志强,张绪红.浊度大时滞过程的预测自抗扰控制器设计[J].控制理论与应用,2017,34(1):101-108. 被引量：25

二级参考文献29

1韩京清,王伟.非线性跟踪─微分器[J].系统科学与数学,1994,14(2):177-183. 被引量：410
2韩京清.一类不确定对象的扩张状态观测器[J].控制与决策,1995,10(1):85-88. 被引量：421
3韩京清.非线性状态误差反馈控制律──NLSEF[J].控制与决策,1995,10(3):221-225. 被引量：179
4K.J.Astrom,T.Hagglund.PID controllers:theory,design,and tuning.Research Triangle Park,NC:Instrument Society of America,1995. 被引量：1
5J.Q.Han.From PID to active disturbance rejection control.IEEE Trans.on Industrial Electronics,2009,56(3):1-7. 被引量：1
6Z.Gao,Y.Huang,J.Q.Han.An alternative paradigm for control system design.Proc.of the 40th IEEE Conference on Decision and Control,2001:4578-4585. 被引量：1
7Z.Gao.Active disturbance rejection control:a paradigm shift in feedback control system design.Proc.of the American Control Conference,2006:2399-2405. 被引量：1
8Y.Huang,K.K.Xu,J.Q.Han,et al.Flight control design using extended state observer and non-smooth feedback.Proc.of the 40th IEEE Conference on Decision and Control,2001:223-228. 被引量：1
9Z.Gao,S.Hu,F.Jiang.A novel motion control design approach based on active disturbance rejection.Proc.of the 40th IEEE Conference on Decision and Control,2001:4877-4882. 被引量：1
10Y.Hou,Z.Gao,F.Jiang,et al.Active disturbance rejection control for web tension regulation.Proc.of the 40th IEEE Conference on Decision and Control,2001:4974-4979. 被引量：1

共引文献1048

1程婷,曾喆昭,杜肖丰,王风琴,程启芝.大时滞过程的预测自耦PI控制方法[J].信息与控制,2019,48(6):672-677. 被引量：3
2高阳,吴文海,张杨.非对称输入饱和下的非仿射不确定系统自抗扰反演控制[J].控制与决策,2020,35(4):885-892. 被引量：5
3高阳,吴文海,高丽.高阶不确定非线性系统的线性自抗扰控制[J].控制与决策,2020,35(2):483-491. 被引量：12
4芮宏斌,曹伟,王天赐.基于改进型自抗扰控制的光伏板清洁机器人路径跟踪控制研究[J].机械设计,2021,38(S02):79-83. 被引量：11
5马悦萌,刘明,杨丁,杨明,张敏刚,葛亚杰.热约束下近空间飞行器预设性能抗噪控制[J].航空学报,2023,44(S02):28-40.
6黄桂梅,刘永立.基于自抗扰技术的风扇磨直吹锅炉主汽压力控制系统的研究[J].仪器仪表用户,2006,13(5):23-24. 被引量：1
7张兆靖,杨慧中,姜永森.一种单神经元自抗扰控制器[J].东南大学学报（自然科学版）,2006,36(S1):132-134. 被引量：4
8黄焕袍,武利强,高峰,韩京清,林永君.自抗扰控制技术在大型火电厂的应用[J].吉林大学学报（工学版）,2004,34(z1):89-94. 被引量：1
9武利强,韩京清.TD滤波器及其应用[J].计算技术与自动化,2003(z1):61-63. 被引量：4
10刘翔,姜学智,李东海,万静芳,薛亚丽.火电单元机组机炉协调自抗扰控制[J].控制理论与应用,2001,18(z1):149-152. 被引量：15

同被引文献19

1王日中,李慧平,崔迪,徐德民.基于深度强化学习算法的自主式水下航行器深度控制[J].智能科学与技术学报,2020(4):354-360. 被引量：4
2孙斌,常晓明,段晋军.基于四元数的机械臂平滑姿态规划与仿真[J].机械科学与技术,2015,34(1):56-59. 被引量：15
3吴琪,李晔.基于四元数的欠驱动AUV的镇定控制设计[J].智能系统学报,2014,9(2):186-191. 被引量：5
4代俣西,沈秋,王惠南,孔繁锵.双目视觉中基于对偶四元数的三维运动估计[J].光学技术,2015,41(2):132-137. 被引量：4
5杨欢,熊剑,郭杭,衷卫声.基于Cortex-M4的多传感器组合导航系统设计[J].传感器与微系统,2017,36(2):88-90. 被引量：7
6葛为民,宇旭东,王肖锋,邢恩宏.基于对偶四元数的机械臂运动学建模及分析[J].机械传动,2018,42(7):112-117. 被引量：6
7LIU Haibo,WANG Heping,SUN Junlei.Attitude control for QTR using exponential nonsingular terminal sliding mode control[J].Journal of Systems Engineering and Electronics,2019,30(1):191-200. 被引量：5
8冯金富,胡俊华,齐铎.水空跨介质航行器发展需求及其关键技术[J].空军工程大学学报（自然科学版）,2019,20(3):8-13. 被引量：23
9宋锡文,董业鹏,杨世飞.基于FPGA的振动信号处理参数寻优试验研究[J].电子测量与仪器学报,2021,35(2):101-108. 被引量：14
10唐胜景,张宝超,岳彩红,桑晨,郭杰.跨介质飞行器关键技术及飞行动力学研究趋势分析[J].飞航导弹,2021(6):7-13. 被引量：7

引证文献2

1许一航,刘剑,孙长银.基于四元数的全驱动碟形AUV单矢量反馈控制[J].智能科学与技术学报,2022,4(4):513-521.
2李存健,刘福朝,刘宁,赵辉,周浩.跨域飞行器惯导装置设计及抗冲击测量方法[J].电子测量技术,2024,47(5):167-172.

1Hailiang Xu,Fei Nie,Zhongxing Wang,Shinan Wang,Jiabing Hu.Impedance Modeling and Stability Factor Assessment of Grid-connected Converters Based on Linear Active Disturbance Rejection Control[J].Journal of Modern Power Systems and Clean Energy,2021,9(6):1327-1338. 被引量：2
2曾柏森,钟勇,牛宪华.基于因子分解机用于安全探索的Q表初始化方法[J].计算机应用,2022,42(1):209-214.
3桓琦,谢小权,郭敏,曾颖明.针对深度强化学习导航的物理对抗攻击方法[J].信息安全研究,2022,8(3):212-222.
4Junjie Wang,Qichao Zhang,Dongbin Zhao.Highway Lane Change Decision-Making via Attention-Based Deep Reinforcement Learning[J].IEEE/CAA Journal of Automatica Sinica,2022,9(3):567-569. 被引量：2
5Binghui Li,Badong Chen.An Adaptive Rapidly-Exploring Random Tree[J].IEEE/CAA Journal of Automatica Sinica,2022,9(2):283-294. 被引量：15
6WANG Xinhua,ZHU Yungang,LI Dayu,ZHANG Guang.Underwater Target Detection Based on Reinforcement Learning and Ant Colony Optimization[J].Journal of Ocean University of China,2022,21(2):323-330. 被引量：1
7Ling Wang,Zixiao Pan,Jingjing Wang.A Review of Reinforcement Learning Based Intelligent Optimization for Manufacturing Scheduling[J].Complex System Modeling and Simulation,2021,1(4):257-270. 被引量：14
8杨丰萍,程权,周铭栀,张殷.基于改进自抗扰控制的微电网混合储能控制策略[J].科学技术与工程,2022,22(5):1898-1905. 被引量：13
9Bai Shuangxia,Song Shaomei,Liang Shiyang,Wang Jianmei,Li Bo,Neretin Evgeny.UAV Maneuvering Decision-Making Algorithm Based on Twin Delayed Deep Deterministic Policy Gradient Algorithm[J].Journal of Artificial Intelligence and Technology,2022,2(1):16-22. 被引量：7
10Young Kook Kim,Woojin Seol,Jihyuk Park.Biomimetic Quadruped Robot with a Spinal Joint and Optimal Spinal Motion via Reinforcement Learning[J].Journal of Bionic Engineering,2021,18(6):1280-1290. 被引量：3

Journal of Systems Engineering and Electronics

2022年第1期

浏览历史

内容加载中请稍等...