Using Adversarial Reinforcement Learning to Improve the Resilience of Human-Robot Collaboration in Industrial Assembly

被引:0
作者
Antonelli, Dario [1 ]
Aliev, Khurshid [1 ]
机构
[1] Politecn Torino, Turin, Italy
来源
COLLABORATIVE NETWORKS IN DIGITALIZATION AND SOCIETY 5.0, PRO-VE 2023 | 2023年 / 688卷
关键词
Adversarial Reinforcement Learning; Cobots; Human-Robot Collaboration; Machine Learning;
D O I
10.1007/978-3-031-42622-3_22
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The paper proposes a novel approach to enhance the resilience of mutual collaborative activity between humans and robots in industrial assembly tasks. The approach exploits Adversarial Reinforcement Learning (ARL) to enable a robot to learn an assembly policy that is robust against human mistakes. The adversary can represent various sources of uncertainty or disturbance in the environment. By learning from adversarial feedback, the agent can improve its performance and adaptability in challenging scenarios. The paper applies ARL to the execution of the assembly task sequence. The robot acts as one agent and learns how to assist the human partner during the assembly. The agent simulating the human partner acts as the adversary and deliberately introduces mistakes during the assembly process. The robot also learns how to cope with different levels of human competence and cooperation by adjusting its own behaviour accordingly. The paper evaluates the proposed approach through experiments reproducing complex assembly sequences and compares it with baseline methods that use conventional optimization algorithms. The results show that ARL does not outperforms conventional optimization algorithms in terms of task completion time but guarantee robustness against human mistakes. The paper also discusses the implications for human-robot collaboration and suggests future directions for research.
引用
收藏
页码:317 / 327
页数:11
相关论文
共 30 条
[1]   Task-based Programming and Sequence Planning for Human-Robot Collaborative Assembly [J].
Aliev, Khurshid ;
Antonelli, Dario ;
Bruno, Giulia .
IFAC PAPERSONLINE, 2019, 52 (13) :1638-1643
[2]   Robust Assembly Sequence Generation in a Human-Robot Collaborative Workcell by Reinforcement Learning [J].
Antonelli, Dario ;
Zeng, Qingfei ;
Aliev, Khurshid ;
Liu, Xuemei .
FME TRANSACTIONS, 2021, 49 (04) :851-858
[3]   Optimizing human-robot task allocation using a simulation tool based on standardized work descriptions [J].
Baenziger, Timo ;
Kunz, Andreas ;
Wegener, Konrad .
JOURNAL OF INTELLIGENT MANUFACTURING, 2020, 31 (07) :1635-1648
[4]   Analysing Factory Workers' Acceptance of Collaborative Robots: A Web-Based Tool for Company Representatives [J].
Baumgartner, Marco ;
Kopp, Tobias ;
Kinkel, Steffen .
ELECTRONICS, 2022, 11 (01)
[5]  
BRAGANCA S., 2019, Occupational and Environmental Safety and Health, P641, DOI [10.1007/978-3-030-14730-368, DOI 10.1007/978-3-030-14730-368]
[6]   Dynamic task classification and assignment for the management of human-robot collaborative teams in workcells [J].
Bruno, Giulia ;
Antonelli, Dario .
INTERNATIONAL JOURNAL OF ADVANCED MANUFACTURING TECHNOLOGY, 2018, 98 (9-12) :2415-2427
[7]   Multi-Agent Reinforcement Learning: A Review of Challenges and Applications [J].
Canese, Lorenzo ;
Cardarilli, Gian Carlo ;
Di Nunzio, Luca ;
Fazzolari, Rocco ;
Giardino, Daniele ;
Re, Marco ;
Spano, Sergio .
APPLIED SCIENCES-BASEL, 2021, 11 (11)
[8]   Striatal gray matter volumes, externalizing traits, and N-back task performance: An exploratory study of sex differences using the human connectome project data [J].
Chen, Yu ;
Li, Chiang-Shan R. .
JOURNAL OF EXPERIMENTAL PSYCHOPATHOLOGY, 2022, 13 (01) :1-11
[9]   Strategic View on Cobot Deployment in Assembly 4.0 Systems [J].
Cohen, Yuval ;
Shoval, Shraga ;
Faccio, Maurizio .
IFAC PAPERSONLINE, 2019, 52 (13) :1519-1524
[10]   AND OR GRAPH REPRESENTATION OF ASSEMBLY PLANS [J].
DEMELLO, LSH ;
SANDERSON, AC .
IEEE TRANSACTIONS ON ROBOTICS AND AUTOMATION, 1990, 6 (02) :188-199