Reinforcement learning-based complete area coverage path planning for a modified htrihex robot

被引:0
|
作者
Apuroop, Koppaka Ganesh Sai [1 ]
Le, Anh Vu [2 ]
Elara, Mohan Rajesh [1 ]
Sheu, Bing J. [3 ]
机构
[1] ROAR Lab, Engineering Product Development, Singapore University of Technology and Design, Singapore,487372, Singapore
[2] Optoelectronics Research Group, Faculty of Electrical and Electronics Engineering, Ton Duc Thang University, Ho Chi Minh City,700000, Viet Nam
[3] Electronics Engineering and Information Management, Chang Gung University, Taoyuan City,330, Taiwan
来源
Sensors (Switzerland) | 2021年 / 21卷 / 04期
关键词
Learning algorithms - Long short-term memory - Multilayer neural networks - Traveling salesman problem - Energy utilization - Motion planning - Ant colony optimization - Cleaning - Learning systems - Genetic algorithms - Robot programming;
D O I
暂无
中图分类号
学科分类号
摘要
One of the essential attributes of a cleaning robot is to achieve complete area coverage. Current commercial indoor cleaning robots have fixed morphology and are restricted to clean only specific areas in a house. The results of maximum area coverage are sub-optimal in this case. Tiling robots are innovative solutions for such a coverage problem. These new kinds of robots can be deployed in the cases of cleaning, painting, maintenance, and inspection, which require complete area coverage. Tiling robots’ objective is to cover the entire area by reconfiguring to different shapes as per the area requirements. In this context, it is vital to have a framework that enables the robot to maximize the area coverage while minimizing energy consumption. That means it is necessary for the robot to cover the maximum area with the least number of shape reconfigurations possible. The current paper proposes a complete area coverage planning module for the modified hTrihex, a honeycomb-shaped tiling robot, based on the deep reinforcement learning technique. This framework simultaneously generates the tiling shapes and the trajectory with minimum overall cost. In this regard, a convolutional neural network (CNN) with long short term memory (LSTM) layer was trained using the actor-critic experience replay (ACER) reinforcement learning algorithm. The simulation results obtained from the current implementation were compared against the results that were generated through traditional tiling theory models that included zigzag, spiral, and greedy search schemes. The model presented in the current paper was also compared against other methods where this problem was considered as a traveling salesman problem (TSP) solved through genetic algorithm (GA) and ant colony optimization (ACO) approaches. Our proposed scheme generates a path with a minimized cost at a lesser time. © 2021 by the authors. Licensee MDPI, Basel, Switzerland.
引用
收藏
页码:1 / 20
相关论文
共 50 条
  • [21] Robot path planning in dynamic environment based on reinforcement learning
    Zhuang, Xiao-Dong
    Meng, Qing-Chun
    Wei, Tian-Bin
    Wang, Xu-Zhu
    Tan, Rui
    Li, Xiao-Jing
    Journal of Harbin Institute of Technology (New Series), 2001, 8 (03) : 253 - 255
  • [22] A Reinforcement Learning-Based Path Planning Considering Degree of Observability
    Cho, Yong Hyeon
    Park, Chan Gook
    PROCEEDINGS OF THE 2020 INTERNATIONAL CONFERENCE ON ARTIFICIAL LIFE AND ROBOTICS (ICAROB2020), 2020, : 502 - 505
  • [23] Toward complete coverage planning using deep reinforcement learning by trapezoid-based transformable robot
    Vo, Dinh Tung
    Le, Anh Vu
    Ta, Tri Duc
    Tran, Minh
    Duc, Phan Van
    Vu, Minh Bui
    Nhan, Nguyen Huu Khanh
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 122
  • [24] Deep Learning-Based Complete Coverage Path Planning With Re-Joint and Obstacle Fusion Paradigm
    Lei, Tingjun
    Luo, Chaomin
    Jan, Gene Eu
    Bi, Zhuming
    FRONTIERS IN ROBOTICS AND AI, 2022, 9
  • [25] Coverage Path Planning Using Reinforcement Learning-Based TSP for hTetran-A Polyabolo-Inspired Self-Reconfigurable Tiling Robot
    Le, Anh Vu
    Veerajagadheswar, Prabakaran
    Thiha Kyaw, Phone
    Elara, Mohan Rajesh
    Nhan, Nguyen Huu Khanh
    SENSORS, 2021, 21 (08)
  • [26] Path Planning of Cleaning Robot with Reinforcement Learning
    Moon, Woohyeon
    Park, Bumgeun
    Nengroo, Sarvar Hussain
    Kim, Taeyoung
    Har, Dongsoo
    2022 IEEE INTERNATIONAL SYMPOSIUM ON ROBOTIC AND SENSORS ENVIRONMENTS (ROSE), 2022,
  • [27] Complete Coverage Path Planning of Mobile Robot Based on Dynamic Programming Algorithm
    Zhou, Peng
    Wang, Zhong-min
    Li, Zhen-nan
    Li, Yang
    PROCEEDINGS OF THE 2ND INTERNATIONAL CONFERENCE ON ELECTRONIC & MECHANICAL ENGINEERING AND INFORMATION TECHNOLOGY (EMEIT-2012), 2012, 23
  • [28] An Algorithm of Complete Coverage Path Planning for Deep-Sea Mining Vehicle Clusters Based on Reinforcement Learning
    Xing, Bowen
    Wang, Xiao
    Liu, Zhenchong
    ADVANCED THEORY AND SIMULATIONS, 2024, 7 (04)
  • [29] Improved Robot Path Planning Method Based on Deep Reinforcement Learning
    Han, Huiyan
    Wang, Jiaqi
    Kuang, Liqun
    Han, Xie
    Xue, Hongxin
    SENSORS, 2023, 23 (12)
  • [30] Path planning for a robot manipulator based on probabilistic roadmap and reinforcement learning
    Park, Jung-Jun
    Kim, Ji-Hun
    Song, Jae-Bok
    INTERNATIONAL JOURNAL OF CONTROL AUTOMATION AND SYSTEMS, 2007, 5 (06) : 674 - 680