期刊文献+

动态规划最优控制在非线性系统中的应用 被引量:1

Adaptive Dynamic Optimal Control Based on a Novel Adaptive Law for the Nonlinear CT Systems
下载PDF
导出
摘要 应用一种新的自适应动态最优化方法(ADP),在线实现对非线性连续系统的最优控制。首先应用汉密尔顿函数(Hamilton-Jacobi-Bellman,HJB)求解系统的最优控制,并应用神经网络BP算法对汉密尔顿函数中的性能指标进行估计,进而得到非线性连续系统的最优控制。同时引进一种新的自适应算法,基于参数误差,在线实现对系统进行动态最优求解,而且通过李亚普诺夫方法对参数收敛情况也进行详细的分析。最后,用仿真结果来验证所提出的方法的可行性。 This paper proposed a novel adaptive dynamic program (ADP) algorithm to online obtain the optimal control for nonlinear systems. Firstly, the optimal control of system was obtained by using HJB equation, and an NN algorithm was used to approximate the value function of HJB. In particular, this paper introduced a novel adaptive algorithm to online update critic NN weights based on the parameter error estimation. The closed-loop system stability was proved based on the Lyapunov theory. Finally, the simulation results were provided to verify the effectiveness of the proposed methods.
作者 陈瑶 张刚
出处 《计算技术与自动化》 2015年第4期15-18,共4页 Computing Technology and Automation
关键词 最优控制 动态规划 神经网络 自适应算法 汉密尔顿函数 optimal control Approximate Dynamic Program(ADP) Neural Network(NN) adaptive law Hamilton-Ja cobi-Bellman(HJB)
  • 相关文献

参考文献15

  • 1B. R. E, Dynamic programming, Princeton: Princeton Uni versity Press, 1957. 被引量:1
  • 2SUTTON R S,BARTO A G. Reinforcement learning: an introduction. Cambridge Univ Press, 1998. 被引量:1
  • 3WERBOS P J. Approximate dynamic programming for real-- time control and neural modeling, Handbook of intelligent control: Neural[J]. fuzzy, and adaptive approaches, 1992, 15: 493--525. 被引量:1
  • 4DREYFUS S E,LAW A M. Art and theory of dynamic pro- gramming[M]. New York: Academic Press, 1977,56. 被引量:1
  • 5MURRAY J J,COX C J,LENDARIS G G, et al. Adaptive dynamic programming, Systems, Man, and Cybernetics, Part C= Applications and Reviews[J]. IEEE Transactions on, 2002, 32(2): 140-153. 被引量:1
  • 6WERBOS P J. A menu of designs for reinforcement learning over time[J]. Neural networks for control, 1990 : 67-95. 被引量:1
  • 7ABU-KHALAF M,LEWIS F L. Nearly optimal control laws for nonlinear systems with saturating actuators using a neural network HJB approach[J]. Automatiea, 2005, 41(5) : 779-- 791. 被引量:1
  • 8NA J,HERRMANN G,REN X. , et al. Robust adaptive fi- nite-time parameter estimation and control of nonlinear sys- tems[J]. IEEE International Symposium on in Intelligent Control (ISIC), 2011 : 1014-- I019. 被引量:1
  • 9Na. Jing, Ren. Xuemei, Zhang. Dongdong, Adaptive con- trol for nonlinear pure-feedback systems with high-order slid- ing mode observer[J]. IEEE transactions on neural networks and learning systems, 2013, 24(3)= 370--382. 被引量:1
  • 10VAMVOUDAKIS K G,LEWIS F L. Online actor-critic algo rithm to solve the continuous-time infinite horizon optimal control problem[J]. Automation, 2010,46 (5) : 878- 888. 被引量:1

同被引文献2

引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部