必威(betway·官方网站)西汉姆联-EURO CUP

Online System

Reviewer Login

Editor Login

Author Login

Download

Statement of Competing Interests

Authors Contribution Form

Online Journal

Advanced search

Special subject

Current Issue

Previous Issue

Introduction

Bimonthly, started in 1957
Administrator
Shanxi Provincial Education Department
Sponsor
Taiyuan University of Technology
Publisher
Ed. Office of Journal of TYUT
Editor-in-Chief
SUN Hongbin
ISSN: 1007-9432
CN: 14-1220/N

Links

Shanxi Provincial Education Department

Taiyuan University of Technology

location: home > paper >

References:

MAO Guojun GU Shimin.An Improved Q-Learning Algorithm and Its Application in Path Planning[J].Taiyuan University of technology,2021,52(01):91-97

PDFdownloadsize：631KBviewed：download：

An Improved Q-Learning Algorithm and Its Application in Path Planning

DOI:

10.16355/j.cnki.issn1007-9432tyut.2021.01.012

Received:

Accepted:

Corresponding author		Institute
MAO Guojun		Institute of Machine Learning and Intelligent Science,Fujian Universtiy of Technology

abstract:

Traditional Q-Learning algorithm has the problems of too many random searches and slow convergence speed. Therefore, in this paper an improved ε-Q-Learning algorithm based on traditional Q-Learning algorithm was propased and applied to path planning. The key of this method is to introduce the dynamic search factor technology, which adjusts the greedy factor dynamically according to the feedback of the environment. If one exploration from the beginning to the end fails, the randomicity of the next exploration will be increased by increasing greedy factor, in order to avoid falling into the local optimization dilemma. Conversely, purpose will be increased by reducing greedy factor. The performance of the algorithm is evaluated by loss function, running efficiency, number of steps, and total return. Experiments show that compared with the existing Q-Learning algorithm, ε-Q-Learning can not only find a better optimal path, but also significantly reduce the cost of iterative searching.

Keywords:

path planning; artificial intelligence; reinforcement learning; Q-Learning;