Consensus-based Normalizing-Flow Control: A Case Study in Learning Dual-Arm Coordination

被引:3
作者
Yin, Hang [1 ]
Verginis, Christos K. [2 ]
Kragic, Danica [1 ]
机构
[1] KTH Royal Inst Technol, CAS RPL, Stockholm, Sweden
[2] Uppsala Univ, Div Signals & Syst, Dept Elect Engn, Uppsala, Sweden
来源
2022 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS) | 2022年
关键词
LEADER-FOLLOWER CONSENSUS; MULTIAGENT SYSTEMS; MANIPULATION;
D O I
10.1109/IROS47612.2022.9981827
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We develop two consensus-based learning algorithms for multi-robot systems applied on complex tasks involving collision constraints and force interactions, such as the cooperative peg-in-hole placement. The proposed algorithms integrate multi-robot distributed consensus and normalizingflow-based reinforcement learning. The algorithms guarantee the stability and the consensus of the multi-robot system's generalized variables in a transformed space. This transformed space is obtained via a diffeomorphic transformation parameterized by normalizing-flow models that the algorithms use to train the underlying task, learning hence skillful, dexterous trajectories required for the task accomplishment. We validate the proposed algorithms by parameterizing reinforcement learning policies, demonstrating efficient cooperative learning, and strong generalization of dual-arm assembly skills in a dynamics-engine simulator.
引用
收藏
页码:10417 / 10424
页数:8
相关论文
共 47 条
[1]   Impedance-Based Collision Reaction Strategy via Internal Stress Loading in Cooperative Manipulation [J].
Aladele, Victor ;
Hutchinson, Seth .
2021 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2021, :5837-5843
[2]  
Almeida D., 2019, P 2019 IEEE 58 C DEC, P4252
[3]  
Bais AZ, 2015, IEEE INT C INT ROBOT, P2380, DOI 10.1109/IROS.2015.7353699
[4]   Decentralized Robust Synchronization of Unknown High Order Nonlinear Multi-Agent Systems With Prescribed Transient and Steady State Performance [J].
Bechlioulis, Charalampos P. ;
Rovithakis, George A. .
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2017, 62 (01) :123-134
[5]  
Bicchi A., 2000, Proceedings 2000 ICRA. Millennium Conference. IEEE International Conference on Robotics and Automation. Symposia Proceedings (Cat. No.00CH37065), P348, DOI 10.1109/ROBOT.2000.844081
[6]  
Coumans E., 2016, Pybullet, a Python module for physics simulation for games, robotics and machine learning
[7]   Distributed Event-Triggered Control for Multi-Agent Systems [J].
Dimarogonas, Dimos V. ;
Frazzoli, Emilio ;
Johansson, Karl H. .
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2012, 57 (05) :1291-1297
[8]  
Dinh L., 2017, P ICLR
[9]   Internal Force Analysis and Load Distribution for Cooperative Multi-Robot Manipulation [J].
Erhart, Sebastian ;
Hirche, Sandra .
IEEE TRANSACTIONS ON ROBOTICS, 2015, 31 (05) :1238-1243
[10]   Planning and control for cooperative manipulation and transportation with aerial robots [J].
Fink, Jonathan ;
Michael, Nathan ;
Kim, Soonkyum ;
Kumar, Vijay .
INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH, 2011, 30 (03) :324-334