摘要
针对移动机器人在未知的特殊环境(如U型、狭窄且不规则通道等)下路径规划效率低问题,本文提出一种强化学习(RL)驱动快速探索随机树(RRT)的局部路径重规划方法(RL-RRT).该方法利用Sarsa(λ)优化RRT的随机树扩展过程,既保持未知环境中RRT的随机探索性,又利用Sarsa(λ)缩减无效区域的探索代价.具体来说,在满足移动机器人运动学模型约束的同时,通过设定扩展节点的回报函数、目标距离函数和平滑度目标函数,缩减无效节点,加速探索过程,从而达到路径规划多目标决策优化的目标.仿真实验中,将本方法用于多种未知的特殊环境,实验结果显示出RL-RRT算法的可行性、有效性及其性能优势.
In this paper,a local path re-planning rapidly-exploring random tree(RRT)method(RL-RRT)driven by reinforcement learning(RL)is proposed,aiming at the low efficiency of path planning for the mobile robot in the unknown and peculiar environments such as U-shaped,narrow and irregular channels.The RRT random tree expansion process is optimized by Sarsa(λ)in this method,which not only maintains the random exploratory nature of RRT in the unknown environment,but also uses Sarsa(λ)to reduce the exploration cost of the invalid region.Specifically,RL-RRT can reduce invalid nodes and accelerate the exploration process by setting the return function,target distance function and smoothness objective function of extended nodes,while satisfying the constraints of mobile robot kinematics model,so as to achieve the goal of multi-objective decision-making optimization of path planning.In the simulation experiment,RL-RRT is applied to many unknown and particular environments.The experimental results show the feasibility,effectiveness and performance advantages of RL-RRT method.
作者
邹启杰
刘世慧
张跃
侯英鹂
ZOU Qi-jie;LIU Shi-hui;ZHANG Yue;HOU Ying-li(Information Engineering Faculty,Dalian University,Dalian Liaoning 116000,China;National Institute of Innovation,National Defense Science and Technology Research Center for Unmanned Systems,Changsha Hunan 410000,China)
出处
《控制理论与应用》
EI
CAS
CSCD
北大核心
2020年第8期1737-1748,共12页
Control Theory & Applications
基金
国家自然科学基金面上项目(61673084)
辽宁省自然基金项目(2019-ZD-0578)资助.