Enhancing construction robot learning for collaborative and long-horizon tasks using generative adversarial imitation learning

被引:8
作者
Li, Rui [1 ]
Zou, Zhengbo [1 ]
机构
[1] Univ British Columbia, Dept Civil Engn, Vancouver, BC, Canada
关键词
Reinforcement learning; Construction robot; Generative adversarial imitation learning; Virtual reality;
D O I
10.1016/j.aei.2023.102140
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The development and deployment of robots on construction sites are integral to the industrialization of construction, known as Construction 4.0. Tele-operated and pre-programmed robots have enhanced construction efficiency and safety. However, their utilization on-site remains limited due to the need for expert remote control and the lack of adaptability in dynamic environments. Reinforcement learning (RL) has emerged as a promising solution, as RL-controlled robots possess inherent self-learning abilities to adapt to diverse situations. Nevertheless, manual design of RL reward functions for complex tasks poses challenges. To address this issue, inverse reinforcement learning (IRL) methods, such as Generative Adversarial Imitation Learning (GAIL), have been proposed to learn optimal actions through expert demonstration and self-exploration, without explicitly defined reward functions. In this study, we propose an innovative approach integrating GAIL and virtual reality (VR) integrated robot control approach to control robots for long-horizon collaborative construction tasks involving multiple sub-tasks. We employ VR expert demonstrations as input for GAIL training, enabling a team of robots, including an Unmanned Ground Vehicle (UGV) and two robot arms, to interact with the designed RL environment and perform tasks such as transporting, picking, and installing window panels. Handle long-horizon collaborative construction tasks (i.e., a long sequence of several sub-tasks performed by multiple robots). For evaluation, we compare the performance of our VR-GAIL model with a prevalent and robust RL baseline model, Proximal Policy Optimization (PPO). The results demonstrate that our reward-free VR-GAIL model achieves, on average, a 4.5% higher success rate than the PPO counterpart equipped with carefully designed reward functions across all three sub-tasks and their randomized variations. Furthermore, the performance gap between GAIL and PPO widens as the task difficulty increases. These findings indicate that our approach effectively enhances RL agent performance in tackling complex construction tasks while expediting development by eliminating reward function design requirements.
引用
收藏
页数:12
相关论文
共 67 条
  • [1] Robotic assembly of timber joints using reinforcement learning
    Apolinarska, Aleksandra Anna
    Pacher, Matteo
    Li, Hui
    Cote, Nicholas
    Pastrana, Rafael
    Gramazio, Fabio
    Kohler, Matthias
    [J]. AUTOMATION IN CONSTRUCTION, 2021, 125
  • [2] Atanasova L., 2020, ACADIA 2020 DISTRIBU, P350
  • [3] Robotic architectural assembly with tactile skills: Simulation and optimization
    Belousov, Boris
    Wibranek, Bastian
    Schneider, Jan
    Schneider, Tim
    Chalvatzaki, Georgia
    Peters, Jan
    Tessmann, Oliver
    [J]. AUTOMATION IN CONSTRUCTION, 2022, 133
  • [4] Energy-efficient and damage-recovery slithering gait design for a snake-like robot based on reinforcement learning and inverse reinforcement learning
    Bing, Zhenshan
    Lemke, Christian
    Cheng, Long
    Huang, Kai
    Knoll, Alois
    [J]. NEURAL NETWORKS, 2020, 129 : 323 - 333
  • [5] Brockman G, 2016, Arxiv, DOI [arXiv:1606.01540, DOI 10.48550/ARXIV.1606.01540]
  • [6] Structural rigidity theory applied to the scaffold-free (dis)assembly of space frames using cooperative robotics
    Bruun E.P.G.
    Adriaenssens S.
    Parascho S.
    [J]. Automation in Construction, 2022, 141
  • [7] Three cooperative robotic fabrication methods for the scaffold-free construction of a masonry arch
    Bruun, Edvard P. G.
    Pastrana, Rafael
    Paris, Vittorio
    Beghini, Alessandro
    Pizzigoni, Attilio
    Parascho, Stefana
    Adriaenssens, Sigrid
    [J]. AUTOMATION IN CONSTRUCTION, 2021, 129 (129)
  • [8] PRE-PROGRAMMED ROBOTIC OSTEOTOMIES FOR FIBULA FREE FLAP MANDIBLE RECONSTRUCTION: A PRECLINICAL INVESTIGATION
    Chao, Albert H.
    Weimer, Katie
    Raczkowsky, Joerg
    Zhang, Yaokun
    Kunze, Mirko
    Cody, Dianna
    Selber, Jesse C.
    Hanasono, Matthew M.
    Skoracki, Roman J.
    [J]. MICROSURGERY, 2016, 36 (03) : 246 - 249
  • [9] Chen J., 2022, arXiv, DOI 10.48550/arXiv.2204.01975
  • [10] Chi H.L., 2014, Optimization and Evaluation of Automatic Rigging Path Guidance for Tele-Operated Construction Crane