期刊文献+

Online optimal control of nonlinear discrete-time systems using approximate dynamic programming 被引量:4

Online optimal control of nonlinear discrete-time systems using approximate dynamic programming
原文传递
导出
摘要 In this paper,the optimal control of a class of general affine nonlinear discrete-time(DT) systems is undertaken by solving the Hamilton Jacobi-Bellman(HJB) equation online and forward in time.The proposed approach,referred normally as adaptive or approximate dynamic programming(ADP),uses online approximators(OLAs) to solve the infinite horizon optimal regulation and tracking control problems for affine nonlinear DT systems in the presence of unknown internal dynamics.Both the regulation and tracking controllers are designed using OLAs to obtain the optimal feedback control signal and its associated cost function.Additionally,the tracking controller design entails a feedforward portion that is derived and approximated using an additional OLA for steady state conditions.Novel update laws for tuning the unknown parameters of the OLAs online are derived.Lyapunov techniques are used to show that all signals are uniformly ultimately bounded and that the approximated control signals approach the optimal control inputs with small bounded error.In the absence of OLA reconstruction errors,an optimal control is demonstrated.Simulation results verify that all OLA parameter estimates remain bounded,and the proposed OLA-based optimal control scheme tunes itself to reduce the cost HJB equation. In this paper,the optimal control of a class of general affine nonlinear discrete-time(DT) systems is undertaken by solving the Hamilton Jacobi-Bellman(HJB) equation online and forward in time.The proposed approach,referred normally as adaptive or approximate dynamic programming(ADP),uses online approximators(OLAs) to solve the infinite horizon optimal regulation and tracking control problems for affine nonlinear DT systems in the presence of unknown internal dynamics.Both the regulation and tracking controllers are designed using OLAs to obtain the optimal feedback control signal and its associated cost function.Additionally,the tracking controller design entails a feedforward portion that is derived and approximated using an additional OLA for steady state conditions.Novel update laws for tuning the unknown parameters of the OLAs online are derived.Lyapunov techniques are used to show that all signals are uniformly ultimately bounded and that the approximated control signals approach the optimal control inputs with small bounded error.In the absence of OLA reconstruction errors,an optimal control is demonstrated.Simulation results verify that all OLA parameter estimates remain bounded,and the proposed OLA-based optimal control scheme tunes itself to reduce the cost HJB equation.
出处 《控制理论与应用(英文版)》 EI 2011年第3期361-369,共9页
基金 partly supported by the National Science Foundation (No.ECCS#0621924,ECCS-#0901562) the Intelligent Systems Center
关键词 Online nonlinear optimal control Hamilton Jacobi-Bellman Online approximators Discrete-time systems Online nonlinear optimal control Hamilton Jacobi-Bellman Online approximators Discrete-time systems
  • 相关文献

参考文献15

  • 1J. Shamma,J. Cloutier.Existence of SDRE stabilizing feedback. IEEE Transactions on Automatic Control . 2003 被引量:1
  • 2D. Vrabie,O. Pastravanu,M. Abu-Khalaf,et al.Adaptive optimal control for continuous-time linear systems based on policy iteration. Automatica . 2009 被引量:1
  • 3H. Zhang,Q. Wei,Y. Luo.A novel infinite-time optimal tracking control scheme for a class of discrete-time nonlinear systems via the greedy HDP iteration algorithm. IEEE Transactions on Systems Man and Cybernetics . 2008 被引量:1
  • 4T. Dierks,B. T. Thumati,S. Jagannathan.Optimal control of unknown affine nonlinear discrete-time systems using offiine-trained neural networks with proof of convergence. Neural Networks . 2009 被引量:1
  • 5K. G. Vamvoudakis,F. L. Lewis.Online actor-critic algorithm to solve the continuous-time infinite horizon optimal control problem. Automatica . 2010 被引量:1
  • 6Khalil HK.Nonlinear Systems. . 2002 被引量:1
  • 7Krstic M,Kanellakopoulos I,Kokotovic P.Nonlinear and adaptive control design. . 1995 被引量:1
  • 8Chen Z,Jagannathan S.Generalized Hamilton-Jacobi- Bellman formulation-based neural network control of a-ne nonlinear discretetime systems. IEEE Transactions on Neu- ral Networks . 2008 被引量:1
  • 9TAMIMI A A,LEWIS F L,ABU-KHALAF M.Discrete-time non-linear HJB solution using approximate dynamic programming:con-vergence proof. IEEE Transactions on Systems,Man, and Cyber-netics,part B:Cybernetics . 2008 被引量:1
  • 10Jagannathan,S.Neural Network Control of Nonlinear Discrete- Time Systems. . 2006 被引量:1

同被引文献4

引证文献4

二级引证文献12

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部