Journal of Civil Aviation University of China ›› 2024, Vol. 42 ›› Issue (5): 59-65.

Previous Articles     Next Articles

Improved reinforcement learning algorithm for mobile robot path planning

ZHANG Wei 1a,2,3 , CHU Zeyuan1b , YANG Yutao1a , WANG Wei 1a
  

  1. 1a. College of Aeronautical Engineering; 1b. College of Safety Science and Engineering, CAUC, Tianjin 300300, China;
    2. Aviation Special Ground Equipment Research Base, CAAC,Tianjin 300300, China;
    3. Key Laboratory of Smart Airport Theory and System, CAAC, Guangzhou 510470, China
  • Received:2023-01-07 Revised:2023-05-04 Online:2024-12-21 Published:2025-04-08

Abstract:

Aiming at the problems of poor smoothness, slow convergence speed and low learning efficiency of the paths
planned by the traditional Q-learning algorithm, this paper proposes an improved Q-learning algorithm for mobile
robot path planning. Firstly, the density of obstacles and the relative position of the start point are considered to select the action set to accelerate the convergence speed of the Q-learning algorithm. Secondly, a continuous heuristic factor is added to the reward function, which consists of the distance between the current point and the end
point, and the distance of the current point from all the obstacles in the map as well as the boundary of the map. Finally, a scale factor is introduced into the initialization process of Q-value table to give the mobile robot with a priori environment information, and the proposed improved Q-learning algorithm is simulated and verified in a raster
map. The simulation results show that the convergence speed of the improved Q-learning algorithm is significantly
improved compared with the traditional Q-learning algorithm, and its adaptability in complex environments is better, which verifies the superiority of the improved algorithm.

Key words:

CLC Number: