随机线性二次问题中一类改进的强化学习方法

下载PDF

导出

摘要随机线性二次问题是一类重要且研究较为成熟的随机控制问题。其中,部分信息条件下的随机线性二次问题是指系统的状态方程或代价函数中存在未知系数的情形,该文在前人工作的基础上,改进部分信息条件下线性二次问题的最优控制在线强化学习算法。所研究系统方程和代价函数的系数都存在未知量,在此条件下,算法通过可观察的样本轨迹和回报函数求得最优控制以及代价函数中的未知系数,进一步地,我们给出迭代过程收敛性与控制稳定性的证明。 Random linear quadratic problems are important and mature stochastic control problems.Among them,the stochastic linear quadratic problem under partial information conditions refers to the situation where there are unknown coefficients in the state equation or cost function of the system.Based on previous work,this paper improves the optimal control online reinforcement learning algorithm for linear quadratic problems under partial information conditions.The coefficients of the studied system equations and cost function have unknown quantities.In this condition,the algorithm obtains the optimal control and the unknown coefficients in the cost function through the observable sample trajectory and the reward function.At the same time,the convergence and stability of the iterative process are proved.

作者高晋鹏

机构地区山东大学

出处《科技创新与应用》 2024年第32期142-145,共4页 Technology Innovation and Application

关键词随机线性二次问题部分信息李雅普诺夫方程强化学习动态规划原理 random linear quadratic problem partial information Lyapunov equation reinforcement learning dynamic programming principle

分类号 O211.63 [理学—概率论与数理统计]

引文网络
相关文献

1张越,郭中阳,黄孝慈,邢孟阳,宋娟娟.基于改进超螺旋滑模控制的线控制动系统控制方法[J].汽车技术,2024(9):18-24.
2本刊编辑部.医学论著“材料与方法”的撰写要求[J].右江医学,2024,52(9):860-860.
3赵敬.高职院校航空电子专业课程职业化改革方式探析[J].信息与电脑,2024,36(15):149-153.
4屈路安,段超颖.基于CNN的单级光伏并网逆变器积分滑模控制方法[J].流体测量与控制,2024,5(5):26-30.
5孟亭亭,王久斌,崔爔.具有时变输出约束的柔性机翼非线性自适应控制[J].控制与决策,2024,39(9):2987-2994. 被引量：1
6崔智昊,王立国,韦鑫,王宗杰.考虑采样失准的卡尔曼滤波改进神经网络锌溴液流电池状态估计策略[J].中国电机工程学报,2024,44(20):8126-8135.
7胡玉洲,李莉,王发良.不对称信息条件下BOPS全渠道供应链定价和服务策略[J].科技与经济,2024,37(5):26-30.
8胡瑜洪,王德光,杨明,王玺.基于强化学习的离散事件系统最优定向监控[J].电子学报,2024,52(9):3172-3184.
9戴志辉,柳梅元,韦舒清,朱卫平,王文卓.基于超导磁储能的光伏场站送出线路距离保护[J].中国电力,2024,57(10):102-114.
10王念欣,曾晖,栾吉益,陈万福,王成镇.转炉氧枪动态控制技术研究与应用[J].山东冶金,2024,46(5):52-53.

科技创新与应用

2024年第32期

浏览历史

内容加载中请稍等...

随机线性二次问题中一类改进的强化学习方法

相关作者

相关机构

相关主题

浏览历史