动态规划最优控制在非线性系统中的应用被引量：1

Adaptive Dynamic Optimal Control Based on a Novel Adaptive Law for the Nonlinear CT Systems

下载PDF

导出

摘要应用一种新的自适应动态最优化方法(ADP),在线实现对非线性连续系统的最优控制。首先应用汉密尔顿函数(Hamilton-Jacobi-Bellman,HJB)求解系统的最优控制,并应用神经网络BP算法对汉密尔顿函数中的性能指标进行估计,进而得到非线性连续系统的最优控制。同时引进一种新的自适应算法,基于参数误差,在线实现对系统进行动态最优求解,而且通过李亚普诺夫方法对参数收敛情况也进行详细的分析。最后,用仿真结果来验证所提出的方法的可行性。 This paper proposed a novel adaptive dynamic program （ADP） algorithm to online obtain the optimal control for nonlinear systems. Firstly, the optimal control of system was obtained by using HJB equation, and an NN algorithm was used to approximate the value function of HJB. In particular, this paper introduced a novel adaptive algorithm to online update critic NN weights based on the parameter error estimation. The closed-loop system stability was proved based on the Lyapunov theory. Finally, the simulation results were provided to verify the effectiveness of the proposed methods.

作者陈瑶张刚

机构地区云南省交通科学研究所昆明理工大学机电工程学院

出处《计算技术与自动化》 2015年第4期15-18,共4页 Computing Technology and Automation

关键词最优控制动态规划神经网络自适应算法汉密尔顿函数 optimal control Approximate Dynamic Program（ADP） Neural Network（NN） adaptive law Hamilton-Ja cobi-Bellman（HJB）

分类号 TP273.1 [自动化与计算机技术—检测技术与自动化装置]

引文网络
相关文献

参考文献15

1B. R. E, Dynamic programming, Princeton: Princeton Uni versity Press, 1957. 被引量：1
2SUTTON R S,BARTO A G. Reinforcement learning: an introduction. Cambridge Univ Press, 1998. 被引量：1
3WERBOS P J. Approximate dynamic programming for real-- time control and neural modeling, Handbook of intelligent control: Neural[J]. fuzzy, and adaptive approaches, 1992, 15: 493--525. 被引量：1
4DREYFUS S E,LAW A M. Art and theory of dynamic pro- gramming[M]. New York: Academic Press, 1977,56. 被引量：1
5MURRAY J J,COX C J,LENDARIS G G, et al. Adaptive dynamic programming, Systems, Man, and Cybernetics, Part C= Applications and Reviews[J]. IEEE Transactions on, 2002, 32(2): 140-153. 被引量：1
6WERBOS P J. A menu of designs for reinforcement learning over time[J]. Neural networks for control, 1990 : 67-95. 被引量：1
7ABU-KHALAF M,LEWIS F L. Nearly optimal control laws for nonlinear systems with saturating actuators using a neural network HJB approach[J]. Automatiea, 2005, 41(5) : 779-- 791. 被引量：1
8NA J,HERRMANN G,REN X. , et al. Robust adaptive fi- nite-time parameter estimation and control of nonlinear sys- tems[J]. IEEE International Symposium on in Intelligent Control (ISIC), 2011 : 1014-- I019. 被引量：1
9Na. Jing, Ren. Xuemei, Zhang. Dongdong, Adaptive con- trol for nonlinear pure-feedback systems with high-order slid- ing mode observer[J]. IEEE transactions on neural networks and learning systems, 2013, 24(3)= 370--382. 被引量：1
10VAMVOUDAKIS K G,LEWIS F L. Online actor-critic algo rithm to solve the continuous-time infinite horizon optimal control problem[J]. Automation, 2010,46 (5) : 878- 888. 被引量：1

同被引文献2

1任雯,闻霞,王维庆.基于线性二次型的单神经元PID最优控制器设计及仿真[J].计算机应用与软件,2008,25(5):123-124. 被引量：8
2张化光,张欣,罗艳红,杨珺.自适应动态规划综述[J].自动化学报,2013,39(4):303-311. 被引量：77

引证文献1

1姚庆华,和永军,郭镇江,伏冬孝.基于梯度算法的跟踪最优控制器设计及仿真[J].计算机与现代化,2016(12):34-37.

1欧阳小迅,沈轶.基于最优控制理论的选举模型分析[J].华中科技大学学报（自然科学版）,2007,35(1):58-59. 被引量：1
2徐跃州,张欣,张涛,李阳.基于贪婪—改进果蝇算法的无线传感器网络路由协议[J].传感器与微系统,2015,34(5):134-136. 被引量：3
3黄学雨,何焕,戴志晃.基于Min-Min和蚁群算法的网格任务调度方法[J].计算机时代,2009(7):50-52.
4刘若峰,曹大铸.一类非线性连续系统参数估计方法[J].自动化学报,1990,16(5):460-464. 被引量：4
5尹作友,张化光.基于模糊模型的非线性时滞系统的H∞鲁棒容错控制[J].系统仿真学报,2008,20(24):6783-6786. 被引量：1
6韦鲜,刘华毅.基于改进遗传算法的线性离散系统辨识[J].自动化技术与应用,2003,22(1):8-10. 被引量：1
7张化光,张欣,罗艳红,杨珺.自适应动态规划综述[J].自动化学报,2013,39(4):303-311. 被引量：77
8解学军,张大雷.基于径向基函数网络或模糊系统的非线性连续时间系统的自适应调节[J].系统工程理论与实践,2000,20(2):8-14. 被引量：2
9张灿青,彭成,薛智山,满君丰.基于粒子群算法的构件组合技术研究[J].计算技术与自动化,2017,36(1):78-81.
10刘延年,忻欣,冯纯伯.基于神经网络的一类非线性连续系统的稳定自适应控制[J].控制理论与应用,1996,13(1):70-75. 被引量：9

计算技术与自动化

2015年第4期

浏览历史

内容加载中请稍等...

动态规划最优控制在非线性系统中的应用被引量：1

参考文献15

同被引文献2

引证文献1

相关作者

相关机构

相关主题

浏览历史

动态规划最优控制在非线性系统中的应用 被引量：1

参考文献15

同被引文献2

引证文献1

相关作者

相关机构

相关主题

浏览历史

动态规划最优控制在非线性系统中的应用被引量：1