一种基于递归神经网络的自适应控制方法研究被引量：3

A Novel Adaptive Control Algorithm Based on Recurrent Neural Network

下载PDF

导出

摘要本文针对快速、多变量、强非线性的复杂系统的控制问题,在强化学习方式的基础上,提出一种新的自适应控制方法。该方法在没有先验知识的条件下,基于递归神经网络并结合强化学习的自调节能力,通过自身神经网络的在线学习,有效控制不稳定的非线性系统。本文以一级倒立摆系统为实验对象,仿真实验结果表明:所提出的控制方法具有非常好的控制效果和稳定精度,抗干扰能力强。 Based on reinforcement learning, a novel adaptive control algorithm is proposed for the complex systems which have the characteristics of speediness, multiple variables, serious nonlinear. The method based on recurrent neural network needs not know the priori knowledge of system, combines the self-tune property of reinforcement learning through on-line learning of network, and at last effectively controls the unstably nonlinear system. The experimental object is a single inverted pendulum. It is shown from the simulation results that this method has good control effect, good steady accuracy and good interference rejection.

作者钱征孙亮阮晓钢

机构地区北京工业大学信息与控制研究所

出处《微计算机信息》北大核心 2005年第11S期88-90,共3页 Control & Automation

基金国家自然科学基金资助项目(60375017)

关键词强化学习 OIF ELMAN网络 BP网络一级倒立摆系统 reinforcement learning OIF Elman-network BP network single inverted pendulum

分类号 TP18 [自动化与计算机技术—控制理论与控制工程]

引文网络
相关文献

参考文献5

1Barto A G, Sutton S, Anderson C W. Neuron like adaptive elements that can solve difficult learning control problems [J].IEEE Trans. on Systems,Man, and Cybernetics, 1983, 13(5): 834-846. 被引量：1
2Charles W. Anderson.Learning to Control an Inverted Pendulum Using Neural Networks[J].IEEE Control System Magazine, 1989, 9(4): 31-37. 被引量：1
3D.A.White, D.A.Sofge. Handbook of Intelligent Control: Neural, Fuzzy,and Adaptive Approaches[M]. New York: Van Nostrand Reinhold, 1992. 被引量：1
4Jennie Si, Yu-Tsung Wang.On-Line Learning Control by Association and Reinforcement [J].IEEE TRANSACTIONS ON NEURAL NETWORKS,2001, 12(2): 264-276. 被引量：1
5时小虎,梁艳春,徐旭.改进的Elman模型与递归反传控制神经网络[J].软件学报,2003,14(6):1110-1119. 被引量：57

二级参考文献19

1Senjyu T, Yokoda S, Uezato K. A study on high-efficiency drive of ultrasonic motors. Electric Power Components and Systems,2001,29(3 ): 179- 189. 被引量：1
2Uehino K. Piezoelectric motors: Overview. Smart Materials and Structures, 1998,7(3):273-285. 被引量：1
3Hagood NW, McFarland AJ. Modeling of a piezoelectric rotary ultrasonic motor. IEEE Transactions on Ultrasonics, Ferroelectrics and Frequency Control, 1995,42(2):210--224. 被引量：1
4Senjyu T, Miyazato H, Yokoda S, Uezato K. Speed control of ultrasonic motors using neural network. IEEE Transactions on Power Electronics, 1998,13(3):381-387. 被引量：1
5Lin F J, Wai R3, Shyu KK, Liu TM. Recurrent fuzzy neural network control for piezoelectric ceramic linear ultrasonic motor drive.IEEE Transactions on Ultrasonics, Ferroelectrics and Frequency Control, 2001,48(4):900-913. 被引量：1
6Senjyu T, Yokoda S, Uezato K. Speed control of ultrasonic motors using fuzzy neural network. Journal of Intelligent Fuzzy System,2000,8(2):135-146. 被引量：1
7Lin F J, Wai R J, Hong CM. Recurrent neural network control for LCC-resonant ultrasonic motor drive. IEEE Transactions on Ultrasonics, Ferroelectrics and Frequency Control, 2000,47(3):737-749. 被引量：1
8Elman JL. Finding structure in time. Cognitive Science, 1990,14(2):179-211. 被引量：1
9Pham DT, Liu X. Dynamic system modeling using partially recurrent neural networks. Journal of Systems Engineering,1992,2(2):90--97. 被引量：1
10Pham DT, Liu X. Training of Elman networks and dynamic system modeling. International Journal of Systems Science, 1996,27(2):221 -226. 被引量：1

共引文献56

1苏刚,王玲玲,徐永生.基于改进Elman网络的燃气负荷预测[J].东南大学学报（自然科学版）,2006,36(S1):168-171. 被引量：1
2卢志刚,冀尔康,李伟,吴士昌.基于混合Elman网络的非线性自适应逆控制[J].仪器仪表学报,2005,26(z2):349-351. 被引量：3
3赵奇,刘开第,庞彦军.JORDAN神经网络在系统辨识中应用研究[J].制造业自动化,2005,27(3):16-18. 被引量：2
4赵奇,刘开第,庞彦军.Elman神经网络训练方法及其在非线性系统辨识中的应用[J].煤矿机械,2005,26(5):73-74. 被引量：5
5孙延风,梁艳春,张文力,吕英华.RBF神经网络最优分割算法及其在股市预测中的应用[J].模式识别与人工智能,2005,18(3):374-379. 被引量：1
6于霖冲,白广忱,焦俊婷,李辉.无润滑机构系统考虑摩擦和柔性影响的动态响应神经网络预测[J].润滑与密封,2006,31(4):45-47. 被引量：5
7张菊秀,全书海,王超.信息融合在燃料电池传感器故障识别中的应用[J].武汉理工大学学报,2006,28(6):113-116. 被引量：1
8段东兴,孙伟,何玉钧.基于蚁群聚类-Elman神经网络模型的短期电力负荷预测[J].中国电力,2006,39(7):49-51. 被引量：8
9李锡杰,师硕,王旭.基于Elman神经网络的肌电信号分类[J].微计算机信息,2006(08Z):305-306. 被引量：12
10行天.技术经验与本土化策略护航爱立信强攻“公共安全”[J].通讯世界,2006(8):15-16.

同被引文献14

1张双民,石纯一.一种基于特征向量提取的FMDP模型求解方法[J].软件学报,2005,16(5):733-743. 被引量：3
2AbrahamSilberschatz HenryEKorth SSudarshan.数据库系统概念[M].北京：机械工业出版社,2000.1-99. 被引量：5
3Boutros S.BOUTROS,Bipin C.DESAI.A two-phase commit protocol and its performance.Database and Expert Systems Applications,1996.Proceedings.Seventh International Workshop on,1996 Page(s):100 -105 被引量：1
4SUTTON R,BARTO A.Reinforcement learning:An introduction[M].Cambridge,MA:MIT Press,1998. 被引量：1
5MURAO H,KITAMURA S.Q-learning with adaptive state segmentation (QLASS)[C]// Proceedings of the 1997 IEEE International Symposium on Computational Intelligence in Robotics and Automation(CIRA'97).Washington,DC:IEEE Computer Society,1997:179-184. 被引量：1
6MORALES E P.Relational state abstractions for reinforcement learning[C]// Proceedings of the 21st International Conference on Machine Learning(ICML 2004).New York,NY:ACM Press,2004:27-32. 被引量：1
7MURATA M,OZAWA S.A reinforcement learning algorithm for a class of dynamical environments using neural networks[C]// SICE 2003 Annual Conference.Washington,DC:IEEE Computer Society,2003:2004-2009. 被引量：1
8SEKINO M,KATAGAMI D,NITTA K.State spaces self organization based on the interaction between basis functions[C]// Proceedings of the 2005 IEEE/RSJ International Conference on Intelligent Robots and Systems(IROS 2005).Washington,DC:IEEE Computer Society,2005:2929-2934. 被引量：1
9ORMONEIT D,GLYNN P.Kernel-based reinforcement learning in average-cost problems[J].IEEE Transactions on Automatic Control,2002,47(10):1624-1636. 被引量：1
10RUDOLPH P.Adaptive control of human posture using reinforcement learning[D].Cleveland,OH:Cleveland state university,2003. 被引量：1

引证文献3

1宋静,刘心松,赖周建,牟力.一种改进的2pc协议及其性能[J].微计算机信息,2006,22(04X):232-234. 被引量：6
2郑宇,罗四维,吕子昂.强化学习算法的稳定状态空间控制[J].计算机应用,2008,28(5):1328-1330.
3刘忠亮,王旭辉,白振兴.基于L-M BP算法对RoboCup仿真训练工具的设计[J].微计算机信息,2008,24(28):305-306.

二级引证文献6

1陈俊伟.对两阶段提交协议的一些改进[J].重庆职业技术学院学报,2007,16(2):147-151.
2张根荣.分布式数据库两阶段提交协议的改进[J].黑龙江科技信息,2008(30):79-79. 被引量：1
3王成良,李立新,叶剑,秦小龙.一种基于JMS的分布式异步事务处理模型设计[J].信息工程大学学报,2010,11(2):234-238. 被引量：2
4许海洋,李登道.分布式事务两阶段提交协议的实现方法研究[J].信息技术与信息化,2011(2):72-75. 被引量：2
5罗军,李文生,王宏.移动数据库事务处理模型的研究[J].计算机工程与应用,2014,50(22):145-148. 被引量：2
6刘冉,布辉.分布式数据库两阶段提交协议研究与改进[J].电脑知识与技术,2012,8(5X):3500-3502. 被引量：3

1顾冬雷,陈卫东,席裕庚.一种基于增强学习的自适应控制方法[J].控制与决策,2002,17(4):473-475. 被引量：4
2马吴宁,徐诚.交流伺服系统的Ziegler-Nichols PI控制[J].机床与液压,2012,40(15):91-93. 被引量：1
3李庆飞,陈乐庚.参数自调节模糊PID控制在胎面联动线直流调速系统的应用研究[J].科学技术与工程,2012,20(4):794-798. 被引量：9
4宋绍云,张玉忠,沐士光.基于误差投影和局部投影的RBF神经网络学习算法[J].玉溪师范学院学报,2009,25(4):46-50.
5张海波,徐刚,孙健国,费跃农,郭小勤.Backstepping+Fuzzy方法在机器人视觉伺服系统中的应用[J].山东大学学报（工学版）,2004,34(6):21-26. 被引量：1
6林广栋,王煦法,尤海峰.一种与人工神经网络结合的人工内分泌系统模型[J].小型微型计算机系统,2012,33(2):249-253. 被引量：1

微计算机信息

2005年第11S期

浏览历史

内容加载中请稍等...

一种基于递归神经网络的自适应控制方法研究被引量：3

参考文献5

二级参考文献19

共引文献56

同被引文献14

引证文献3

二级引证文献6

相关作者

相关机构

相关主题

浏览历史

一种基于递归神经网络的自适应控制方法研究 被引量：3

参考文献5

二级参考文献19

共引文献56

同被引文献14

引证文献3

二级引证文献6

相关作者

相关机构

相关主题

浏览历史

一种基于递归神经网络的自适应控制方法研究被引量：3