Reinforcement Learning-Based Safe Path Planning for a 3R Planar Robot

Home Page
About
Submit A Journal
Submit A Conference
Submit Paper/Book
- Submit a Preprint
- Submit a Book
Contact

Sakarya Üniversitesi Fen Bilimleri Enstitüsü Dergisi
Volume:26 Issue:1
Reinforcement Learning-Based Safe Path Planning for a 3R Planar Robot

Reinforcement Learning-Based Safe Path Planning for a 3R Planar Robot

Authors : Mustafa Can BİNGOL

Pages : 128-135

Doi:10.16984/saufenbilder.911942

View : 116 | Download : 12

Publication Date : 2022-02-28

Article Type : Research Paper

Abstract :Path planning is an essential topic of robotics studies. Robotic researchers have suggested some methods such as particle swarm optimization, A*, and reinforcement learning insert ignore into journalissuearticles values(RL); to obtain a path. In the current study, it was aimed to generate RL-based safe path planning for a 3R planar robot. For this purpose, firstly, the environment was performed. Later, state, action, reward, and terminate functions were determined. Lastly, actor and critic artificial neural networks insert ignore into journalissuearticles values(ANN);, which are basic components of deep deterministic policy gradients insert ignore into journalissuearticles values(DDPG);, were formed in order to generate a safe path. Another aim of the current study was to obtain an optimum actor ANN. Different ANN structures that have 2, 4, and 8-layers and 512, 1024, 2048, and 4096-units were formed to get an optimum actor ANN. These formed ANN structures were trained during 5000 episodes and 200 steps and the best results were obtained by 4-layer, 1024, and 2048-units structures. Owing to this reason, 4 different ANN structures were performed utilizing 4-layer, 1024, and 2048-units. The proposed structures were trained. The NET-M2U-4L structure generated the best result among 4 different proposed structures. The NET-M2U-4L structure was tested by using 1000 different scenarios. As a result of the tests, the rate of generating a safe path was calculated as 93.80% and the rate of colliding to the obstacle was computed as 1.70%. As a consequence, a safe path was planned and an optimum actor ANN was obtained for a 3R planar robot.
Keywords : artificial neural networks, Deep Deterministic Policy Gradients, path planning, reinforcement learning

ORIGINAL ARTICLE URL

* There may have been changes in the journal, article,conference, book, preprint etc. informations. Therefore, it would be appropriate to follow the information on the official page of the source. The information here is shared for informational purposes. IAD is not responsible for incorrect or missing information.