Enhancing construction robot learning for collaborative and long-horizon tasks using generative adversarial imitation learning

被引：8

作者：

Li, Rui ^{[1
]}

Zou, Zhengbo ^{[1
]}

机构：

[1] Univ British Columbia, Dept Civil Engn, Vancouver, BC, Canada

来源：

ADVANCED ENGINEERING INFORMATICS | 2023年 / 58卷

关键词：

Reinforcement learning; Construction robot; Generative adversarial imitation learning; Virtual reality;

D O I：

10.1016/j.aei.2023.102140

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The development and deployment of robots on construction sites are integral to the industrialization of construction, known as Construction 4.0. Tele-operated and pre-programmed robots have enhanced construction efficiency and safety. However, their utilization on-site remains limited due to the need for expert remote control and the lack of adaptability in dynamic environments. Reinforcement learning (RL) has emerged as a promising solution, as RL-controlled robots possess inherent self-learning abilities to adapt to diverse situations. Nevertheless, manual design of RL reward functions for complex tasks poses challenges. To address this issue, inverse reinforcement learning (IRL) methods, such as Generative Adversarial Imitation Learning (GAIL), have been proposed to learn optimal actions through expert demonstration and self-exploration, without explicitly defined reward functions. In this study, we propose an innovative approach integrating GAIL and virtual reality (VR) integrated robot control approach to control robots for long-horizon collaborative construction tasks involving multiple sub-tasks. We employ VR expert demonstrations as input for GAIL training, enabling a team of robots, including an Unmanned Ground Vehicle (UGV) and two robot arms, to interact with the designed RL environment and perform tasks such as transporting, picking, and installing window panels. Handle long-horizon collaborative construction tasks (i.e., a long sequence of several sub-tasks performed by multiple robots). For evaluation, we compare the performance of our VR-GAIL model with a prevalent and robust RL baseline model, Proximal Policy Optimization (PPO). The results demonstrate that our reward-free VR-GAIL model achieves, on average, a 4.5% higher success rate than the PPO counterpart equipped with carefully designed reward functions across all three sub-tasks and their randomized variations. Furthermore, the performance gap between GAIL and PPO widens as the task difficulty increases. These findings indicate that our approach effectively enhances RL agent performance in tackling complex construction tasks while expediting development by eliminating reward function design requirements.

引用

页数：12

共 67 条

[1] Robotic assembly of timber joints using reinforcement learning
Apolinarska, Aleksandra Anna
Pacher, Matteo
Li, Hui
Cote, Nicholas
Pastrana, Rafael
Gramazio, Fabio
Kohler, Matthias
[J]. AUTOMATION IN CONSTRUCTION, 2021, 125
[2] Atanasova L., 2020, ACADIA 2020 DISTRIBU, P350
[3] Robotic architectural assembly with tactile skills: Simulation and optimization
Belousov, Boris
Wibranek, Bastian
Schneider, Jan
Schneider, Tim
Chalvatzaki, Georgia
Peters, Jan
Tessmann, Oliver
[J]. AUTOMATION IN CONSTRUCTION, 2022, 133
[4] Energy-efficient and damage-recovery slithering gait design for a snake-like robot based on reinforcement learning and inverse reinforcement learning
Bing, Zhenshan
Lemke, Christian
Cheng, Long
Huang, Kai
Knoll, Alois
[J]. NEURAL NETWORKS, 2020, 129 : 323 - 333
[5] Brockman G, 2016, Arxiv, DOI [arXiv:1606.01540, DOI 10.48550/ARXIV.1606.01540]
[6] Structural rigidity theory applied to the scaffold-free (dis)assembly of space frames using cooperative robotics
Bruun E.P.G.
Adriaenssens S.
Parascho S.
[J]. Automation in Construction, 2022, 141
[7] Three cooperative robotic fabrication methods for the scaffold-free construction of a masonry arch
Bruun, Edvard P. G.
Pastrana, Rafael
Paris, Vittorio
Beghini, Alessandro
Pizzigoni, Attilio
Parascho, Stefana
Adriaenssens, Sigrid
[J]. AUTOMATION IN CONSTRUCTION, 2021, 129 (129)
[8] PRE-PROGRAMMED ROBOTIC OSTEOTOMIES FOR FIBULA FREE FLAP MANDIBLE RECONSTRUCTION: A PRECLINICAL INVESTIGATION
Chao, Albert H.
Weimer, Katie
Raczkowsky, Joerg
Zhang, Yaokun
Kunze, Mirko
Cody, Dianna
Selber, Jesse C.
Hanasono, Matthew M.
Skoracki, Roman J.
[J]. MICROSURGERY, 2016, 36 (03) : 246 - 249
[9] Chen J., 2022, arXiv, DOI 10.48550/arXiv.2204.01975
[10] Chi H.L., 2014, Optimization and Evaluation of Automatic Rigging Path Guidance for Tele-Operated Construction Crane

← 1 2 3 4 5 6 7 →