中国民航大学学报 ›› 2023, Vol. 41 ›› Issue (1): 52-57.

• 民用航空 • 上一篇    下一篇

基于强化学习的模块机器人故障自修复方法

管恩广1,2,王尧1,曹家彬2,赵言正2   

  1. (1. 上海海事大学物流工程学院,上海 201306; 2. 上海交通大学机械与动力工程学院,上海 200240)
  • 收稿日期:2022-01-08 修回日期:2022-04-18 出版日期:2023-10-29 发布日期:2023-10-29
  • 作者简介:管恩广(1983—),男,黑龙江哈尔滨人,讲师,博士,研究方向为移动机器人控制理论方法.
  • 基金资助:
    国家自然科学基金青年科学基金项目(61806124)

Fault self-repair method of modular robot based on reinforcement learning algorithm

Guan Enguang1,2 , Wang Yao1 , Cao Jiabin2 , Zhao Yanzheng2   

  1. (1. Logistics Engineering College, Shanghai Maritime University, Shanghai 201306, China;2. School of Mechanical Engineering, Shanghai Jiao Tong University, Shanghai 200240, China)
  • Received:2022-01-08 Revised:2022-04-18 Online:2023-10-29 Published:2023-10-29

摘要: 针对晶格式模块机器人的鲁棒性设计问题,提出一种基于强化学习算法的故障自修复方法。该方法将系统内以空位填充为目标的自修复过程转化为以包含空位的子群模块运动为手段的系统自重构过程。同时,基于强化学习算法提出一种离散方式下的空位移动路径规划,并按此引导空位在系统内穿行。仿真试验结果表明,该故障自修复方法的有效性在晶格式模块机器人系统上得到了验证,且可广泛应用于其他同构模块机器人系统

关键词: 模块机器人, 自重构, 强化学习算法, 故障自修复

Abstract: A fault self-repair method based on reinforcement learning algorithm is proposed for the robust design issue of lattice modular robot. The method transforms the self-repair process with the goal of filling empty spaces into a self-reconfiguration process by means of the movements of meta-modules containing empty nodes. Based on the reinforcement learning algorithm, a discrete path planning for the movement of empty nodes is proposed, and the empty nodes are accordingly guided to travel through the system. The simulation test results show that the effectiveness of this fault self-repair method is verified on the lattice modular robotic system and can be widely applied to other isomorphic modular robotic systems

Key words: modular robot, self-reconfiguration, reinforcement learning algorithm, fault self-repair

中图分类号: