基于深度强化学习的履带机器人摆臂控制方法

Flipper Control Method for Tracked Robot Based on Deep Reinforcement Learning

下载PDF

导出

摘要摆臂式履带机器人具有一定的地形适应能力,实现摆臂的自主控制对提升机器人在复杂环境中的智能化作业水平具有重要意义。结合专家越障知识和技术指标对机器人的摆臂控制问题进行马尔可夫决策过程(Markov decision process,MDP)建模,基于物理仿真引擎Pymunk搭建了越障训练的仿真环境;提出一种基于D3QN(dueling double DQN)网络模型的深度强化学习摆臂控制算法,以地形信息与机器人状态为输入,以机器人前后四摆臂转角为输出,能够实现挑战性地形下履带机器人摆臂的自学习控制。在Gazebo三维仿真环境中将算法学得的控制策略与人工操纵进行了对比实验,结果表明:所提算法相对人工操纵具有更加高效的复杂地形通行能力。 Tracked robots with flippers have certain terrain adaptation capabilities.To improve the intelligent operation level of robots in complex environments,it is significant to realize the flipper autonomously control.Combining the expert experience in obstacle crossing and optimization indicators,Markov decision process(MDP)modeling of the robot's flipper control problem is carried out and a simulation training environment based on physics simulation engine Pymunk is built.A deep reinforcement learning control algorithm based on dueling double DQN(D3QN)network is proposed for controlling the flippers.With terrain information and robot state as the input and the four flippers'angle as the output,the algorithm can achieve the self-learning control of the flippers in challenging terrain.The learned flipper control policy is compared with the manual operation in Gazebo 3D simulation environment.The results show that the proposed algorithm can enable the flippers of robot to obtain adaptive adjustment ability,which helps the robot pass complex terrain more efficiently.

作者潘海南陈柏良黄开宏任君凯程创卢惠民张辉 Pan Hainan;Chen Bailiang;Huang Kaihong;Ren Junkai;Cheng Chuang;Lu Huimin;Zhang Hui(College of Intelligence Science and Technology,National University of Defense Technology,Changsha 410073,China)

机构地区国防科技大学智能科学学院

出处《系统仿真学报》 CAS CSCD 北大核心 2024年第2期405-414,共10页 Journal of System Simulation

基金国家自然科学基金联合基金重点项目(U1813205,U1913202)。

关键词履带机器人摆臂自主控制自主越障深度强化学习机器人操作 tracked robot flipper autonomous control autonomous traversal DRL robot operation

分类号 TP242.6 [自动化与计算机技术—检测技术与自动化装置]

引文网络
相关文献

参考文献2

1商德勇..薄煤层综采工作面巡检机器人运动分析及试验研究[D].中国矿业大学(北京),2016:
2李允旺,葛世荣,朱华,刘建.四履带双摆臂机器人越障机理及越障能力[J].机器人,2010,32(2):157-165. 被引量：93

二级参考文献6

1iRobot Corporation. Small Unmanned Ground Vehicle (SUGV) [EB/OL]. (2008-05-12) [2009-03-18]. http://www.irobot.com/ sp.cfm?pageid=219. 被引量：1
2Chiba Institute of Technology. Hibiscus, The New Rescue Robot[EB/OL]. (2006-06-05) [2009-03-18]. http://www. techfresh.net/hibiscus-the-new-rescue-robot. 被引量：1
3Chen C X, Trivedi M M. Reactive locomotion control of articulated-tracked mobile robots for obstacle negotiation[C]// IEEE/RSJ International Conference on Intelligent Robots and Systems. Piscataway, NJ, USA: IEEE, 1993: 1349-1356. 被引量：1
4Choi B S, Song S M. Fully automated obstacle-crossing gaits for walking machines[J]. IEEE Transactions on Systems, Man and Cybernetics, 1998, 18(6): 952-964. 被引量：1
5Liu J G, Wang Y C, Ma S G, et al. Analysis of stairs-climbing ability for a tracked reconfigurable modular robot[C]//IEEE International Workshop on Safety, Security and Rescue Robotics. Piscataway, NJ, USA: IEEE, 2005: 36-41. 被引量：1
6信建国,李小凡,王忠,姚辰,原培章.履带腿式非结构环境移动机器人特性分析[J].机器人,2004,26(1):35-39. 被引量：49

共引文献92

1王鹏,张驰,曾招财,叶剑锋,余修毕.摆臂履带机器人研发及在箱涵检测中的应用[J].给水排水,2023,49(S02):809-813.
2张勤,王曼莉,黄小刚,余晓鑫,田联房.形状可变救助型机器人的结构及其越障能力的分析与实验[J].制造业自动化,2011,33(15):7-12. 被引量：1
3付宜利,李海泓,张福海.轮履变结构反恐机器人设计及运动学分析[J].机械与电子,2011,29(10):66-69. 被引量：9
4王建,谈英姿,许映秋.基于姿态的多关节履带机器人越障控制[J].东南大学学报（自然科学版）,2011,41(B09):160-167. 被引量：12
5邓乐,赵莉莉.煤矿井下搜救机器人越障分析[J].河南理工大学学报（自然科学版）,2011,30(5):567-570. 被引量：6
6徐如强,韩宝玲,罗庆生,何灏.六履带机器人结构参数与越障性能的关系[J].机械设计与制造,2012(7):113-115. 被引量：16
7孟宪春,孟广柱,张建华,谷德明.八轮腿移动机器人平台越障性能研究[J].河北工业大学学报,2012,41(6):35-39. 被引量：1
8邓乐,谢国周.串联式履带机器人越障性能研究[J].河南理工大学学报（自然科学版）,2013,32(1):56-61. 被引量：3
9刘少刚,郭云龙,贾鹤鸣,林珊颖,赵丹.履带自张紧式主臂可变构型机器人机构原理与越障分析[J].中南大学学报（自然科学版）,2013,44(6):2289-2297. 被引量：12
10魏毅龙,程志红.适应性调整轮径式车轮的设计及越障性能分析[J].机械传动,2013,37(7):125-129. 被引量：2

1曹学伟.浅析越障训练教学[J].灌篮,2020(27):33-33.
2张智凌.探究装配式建筑施工技术在建筑工程中的应用[J].中文科技期刊数据库（文摘版）工程技术,2024(1):0013-0016.
3彭慧.数字化转型指标体系——促进物流企业数字化转型的评估和实施[J].物流工程与管理,2024,46(1):162-165. 被引量：2
4张兴国.市政道路附属设施的质量控制探究[J].大众标准化,2024(1):34-35. 被引量：1
5郑建.消防应急救援技术与训练标准化工作的开展对策探讨[J].大众标准化,2024(1):36-38.
6常波峰,黄伟,刘宽,王佳,吴海英,邹元杰,黄雪赞,陈前炜,陈卫红,王冬明.陕西省某煤矿企业噪声职业健康风险评估[J].公共卫生与预防医学,2024,35(1):70-73. 被引量：1

系统仿真学报

2024年第2期

浏览历史

内容加载中请稍等...

基于深度强化学习的履带机器人摆臂控制方法

参考文献2

二级参考文献6

共引文献92

相关作者

相关机构

相关主题

浏览历史