Mixline: A Hybrid Reinforcement Learning Framework for Long-Horizon Bimanual Coffee Stirring Task

被引：1

作者：

Sun, Zheng ^{[1
]}

Wang, Zhiqi ^{[1
]}

Liu, Junjia ^{[1
]}

Li, Miao ^{[2
]}

Chen, Fei ^{[1
]}

机构：

[1] Chinese Univ Hong Kong, Hong Kong, Peoples R China

[2] Wuhan Univ, Wuhan, Peoples R China

来源：

INTELLIGENT ROBOTICS AND APPLICATIONS (ICIRA 2022), PT I | 2022年 / 13455卷

关键词：

Reinforcement learning; Bimanual coordination; Isaac Gym;

D O I：

10.1007/978-3-031-13844-7_58

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Bimanual activities like coffee stirring, which require coordination of dual arms, are common in daily life and intractable to learn by robots. Adopting reinforcement learning to learn these tasks is a promising topic since it enables the robot to explore how dual arms coordinate together to accomplish the same task. However, this field has two main challenges: coordination mechanism and long-horizon task decomposition. Therefore, we propose the Mixline method to learn sub-tasks separately via the online algorithm and then compose them together based on the generated data through the offline algorithm. We constructed a learning environment based on the GPU-accelerated Isaac Gym. In our work, the bimanual robot successfully learned to grasp, hold and lift the spoon and cup, insert them together and stir the coffee. The proposed method has the potential to be extended to other long-horizon bimanual tasks.

引用

页码：627 / 636

页数：10

共 39 条

[1] State-Dependent Maximum Entropy Reinforcement Learning for Robot Long-Horizon Task Learning
Deshuai Zheng
Jin Yan
Tao Xue
Yong Liu
Journal of Intelligent & Robotic Systems, 2024, 110
[2] State-Dependent Maximum Entropy Reinforcement Learning for Robot Long-Horizon Task Learning
Zheng, Deshuai
Yan, Jin
Xue, Tao
Liu, Yong
JOURNAL OF INTELLIGENT & ROBOTIC SYSTEMS, 2024, 110 (01)
[3] Modular Reinforcement Learning In Long-Horizon Manipulation Tasks
Vavrecka, Michal
Kriz, Jonas
Sokovnin, Nikita
Sejnova, Gabriela
ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING-ICANN 2024, PT X, 2024, 15025 : 299 - 312
[4] Bimanual Long-Horizon Manipulation Via Temporal-Context Transformer RL
Oh, Ji-Heon
Espinoza, Ismael
Jung, Danbi
Kim, Tae-Seong
IEEE ROBOTICS AND AUTOMATION LETTERS, 2024, 9 (12): : 10898 - 10905
[5] Skill Learning for Long-Horizon Sequential Tasks
Alves, Joao
Lau, Nuno
Silva, Filipe
PROGRESS IN ARTIFICIAL INTELLIGENCE, EPIA 2022, 2022, 13566 : 713 - 724
[6] A reinforcement learning approach to long-horizon operations, health, and maintenance supervisory control of advanced energy systems
Pylorof, Dimitrios
Garcia, Humberto E.
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2022, 116
[7] LEAGUE: Guided Skill Learning and Abstraction for Long-Horizon Manipulation
Cheng, Shuo
Xu, Danfei
IEEE ROBOTICS AND AUTOMATION LETTERS, 2023, 8 (10) : 6451 - 6458
[8] To imitate or not to imitate: Boosting reinforcement learning-based construction robotic control for long-horizon tasks using virtual demonstrations
Huang, Lei
Zhu, Zihan
Zou, Zhengbo
AUTOMATION IN CONSTRUCTION, 2023, 146
[9] Multi-State-Space Reasoning Reinforcement Learning for Long-Horizon RFID-Based Robotic Searching and Planning Tasks
Yu Z.
Zhang J.
Mao S.
Periaswamy S.C.G.
Patton J.
Journal of Communications and Information Networks, 2022, 7 (03) : 239 - 251
[10] Enhancing construction robot learning for collaborative and long-horizon tasks using generative adversarial imitation learning
Li, Rui
Zou, Zhengbo
ADVANCED ENGINEERING INFORMATICS, 2023, 58

← 1 2 3 4 →