Reinforcement learning-based complete area coverage path planning for a modified htrihex robot

被引：0

作者：

Apuroop, Koppaka Ganesh Sai ^{[1
]}

Le, Anh Vu ^{[2
]}

Elara, Mohan Rajesh ^{[1
]}

Sheu, Bing J. ^{[3
]}

机构：

[1] ROAR Lab, Engineering Product Development, Singapore University of Technology and Design, Singapore,487372, Singapore

[2] Optoelectronics Research Group, Faculty of Electrical and Electronics Engineering, Ton Duc Thang University, Ho Chi Minh City,700000, Viet Nam

[3] Electronics Engineering and Information Management, Chang Gung University, Taoyuan City,330, Taiwan

来源：

Sensors (Switzerland) | 2021年 / 21卷 / 04期

关键词：

Learning algorithms - Long short-term memory - Multilayer neural networks - Traveling salesman problem - Energy utilization - Motion planning - Ant colony optimization - Cleaning - Learning systems - Genetic algorithms - Robot programming;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

One of the essential attributes of a cleaning robot is to achieve complete area coverage. Current commercial indoor cleaning robots have fixed morphology and are restricted to clean only specific areas in a house. The results of maximum area coverage are sub-optimal in this case. Tiling robots are innovative solutions for such a coverage problem. These new kinds of robots can be deployed in the cases of cleaning, painting, maintenance, and inspection, which require complete area coverage. Tiling robots’ objective is to cover the entire area by reconfiguring to different shapes as per the area requirements. In this context, it is vital to have a framework that enables the robot to maximize the area coverage while minimizing energy consumption. That means it is necessary for the robot to cover the maximum area with the least number of shape reconfigurations possible. The current paper proposes a complete area coverage planning module for the modified hTrihex, a honeycomb-shaped tiling robot, based on the deep reinforcement learning technique. This framework simultaneously generates the tiling shapes and the trajectory with minimum overall cost. In this regard, a convolutional neural network (CNN) with long short term memory (LSTM) layer was trained using the actor-critic experience replay (ACER) reinforcement learning algorithm. The simulation results obtained from the current implementation were compared against the results that were generated through traditional tiling theory models that included zigzag, spiral, and greedy search schemes. The model presented in the current paper was also compared against other methods where this problem was considered as a traveling salesman problem (TSP) solved through genetic algorithm (GA) and ant colony optimization (ACO) approaches. Our proposed scheme generates a path with a minimized cost at a lesser time. © 2021 by the authors. Licensee MDPI, Basel, Switzerland.

引用

页码：1 / 20

共 50 条

[21] Robot path planning in dynamic environment based on reinforcement learning
Zhuang, Xiao-Dong
Meng, Qing-Chun
Wei, Tian-Bin
Wang, Xu-Zhu
Tan, Rui
Li, Xiao-Jing
Journal of Harbin Institute of Technology (New Series), 2001, 8 (03) : 253 - 255
[22] A Reinforcement Learning-Based Path Planning Considering Degree of Observability
Cho, Yong Hyeon
Park, Chan Gook
PROCEEDINGS OF THE 2020 INTERNATIONAL CONFERENCE ON ARTIFICIAL LIFE AND ROBOTICS (ICAROB2020), 2020, : 502 - 505
[23] Toward complete coverage planning using deep reinforcement learning by trapezoid-based transformable robot
Vo, Dinh Tung
Le, Anh Vu
Ta, Tri Duc
Tran, Minh
Duc, Phan Van
Vu, Minh Bui
Nhan, Nguyen Huu Khanh
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 122
[24] Deep Learning-Based Complete Coverage Path Planning With Re-Joint and Obstacle Fusion Paradigm
Lei, Tingjun
Luo, Chaomin
Jan, Gene Eu
Bi, Zhuming
FRONTIERS IN ROBOTICS AND AI, 2022, 9
[25] Coverage Path Planning Using Reinforcement Learning-Based TSP for hTetran-A Polyabolo-Inspired Self-Reconfigurable Tiling Robot
Le, Anh Vu
Veerajagadheswar, Prabakaran
Thiha Kyaw, Phone
Elara, Mohan Rajesh
Nhan, Nguyen Huu Khanh
SENSORS, 2021, 21 (08)
[26] Path Planning of Cleaning Robot with Reinforcement Learning
Moon, Woohyeon
Park, Bumgeun
Nengroo, Sarvar Hussain
Kim, Taeyoung
Har, Dongsoo
2022 IEEE INTERNATIONAL SYMPOSIUM ON ROBOTIC AND SENSORS ENVIRONMENTS (ROSE), 2022,
[27] Complete Coverage Path Planning of Mobile Robot Based on Dynamic Programming Algorithm
Zhou, Peng
Wang, Zhong-min
Li, Zhen-nan
Li, Yang
PROCEEDINGS OF THE 2ND INTERNATIONAL CONFERENCE ON ELECTRONIC & MECHANICAL ENGINEERING AND INFORMATION TECHNOLOGY (EMEIT-2012), 2012, 23
[28] An Algorithm of Complete Coverage Path Planning for Deep-Sea Mining Vehicle Clusters Based on Reinforcement Learning
Xing, Bowen
Wang, Xiao
Liu, Zhenchong
ADVANCED THEORY AND SIMULATIONS, 2024, 7 (04)
[29] Improved Robot Path Planning Method Based on Deep Reinforcement Learning
Han, Huiyan
Wang, Jiaqi
Kuang, Liqun
Han, Xie
Xue, Hongxin
SENSORS, 2023, 23 (12)
[30] Path planning for a robot manipulator based on probabilistic roadmap and reinforcement learning
Park, Jung-Jun
Kim, Ji-Hun
Song, Jae-Bok
INTERNATIONAL JOURNAL OF CONTROL AUTOMATION AND SYSTEMS, 2007, 5 (06) : 674 - 680

← 1 2 3 4 5 →