Mixline: A Hybrid Reinforcement Learning Framework for Long-Horizon Bimanual Coffee Stirring Task

被引:1
|
作者
Sun, Zheng [1 ]
Wang, Zhiqi [1 ]
Liu, Junjia [1 ]
Li, Miao [2 ]
Chen, Fei [1 ]
机构
[1] Chinese Univ Hong Kong, Hong Kong, Peoples R China
[2] Wuhan Univ, Wuhan, Peoples R China
来源
INTELLIGENT ROBOTICS AND APPLICATIONS (ICIRA 2022), PT I | 2022年 / 13455卷
关键词
Reinforcement learning; Bimanual coordination; Isaac Gym;
D O I
10.1007/978-3-031-13844-7_58
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Bimanual activities like coffee stirring, which require coordination of dual arms, are common in daily life and intractable to learn by robots. Adopting reinforcement learning to learn these tasks is a promising topic since it enables the robot to explore how dual arms coordinate together to accomplish the same task. However, this field has two main challenges: coordination mechanism and long-horizon task decomposition. Therefore, we propose the Mixline method to learn sub-tasks separately via the online algorithm and then compose them together based on the generated data through the offline algorithm. We constructed a learning environment based on the GPU-accelerated Isaac Gym. In our work, the bimanual robot successfully learned to grasp, hold and lift the spoon and cup, insert them together and stir the coffee. The proposed method has the potential to be extended to other long-horizon bimanual tasks.
引用
收藏
页码:627 / 636
页数:10
相关论文
共 39 条
  • [1] State-Dependent Maximum Entropy Reinforcement Learning for Robot Long-Horizon Task Learning
    Deshuai Zheng
    Jin Yan
    Tao Xue
    Yong Liu
    Journal of Intelligent & Robotic Systems, 2024, 110
  • [2] State-Dependent Maximum Entropy Reinforcement Learning for Robot Long-Horizon Task Learning
    Zheng, Deshuai
    Yan, Jin
    Xue, Tao
    Liu, Yong
    JOURNAL OF INTELLIGENT & ROBOTIC SYSTEMS, 2024, 110 (01)
  • [3] Modular Reinforcement Learning In Long-Horizon Manipulation Tasks
    Vavrecka, Michal
    Kriz, Jonas
    Sokovnin, Nikita
    Sejnova, Gabriela
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING-ICANN 2024, PT X, 2024, 15025 : 299 - 312
  • [4] Bimanual Long-Horizon Manipulation Via Temporal-Context Transformer RL
    Oh, Ji-Heon
    Espinoza, Ismael
    Jung, Danbi
    Kim, Tae-Seong
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2024, 9 (12): : 10898 - 10905
  • [5] Skill Learning for Long-Horizon Sequential Tasks
    Alves, Joao
    Lau, Nuno
    Silva, Filipe
    PROGRESS IN ARTIFICIAL INTELLIGENCE, EPIA 2022, 2022, 13566 : 713 - 724
  • [6] A reinforcement learning approach to long-horizon operations, health, and maintenance supervisory control of advanced energy systems
    Pylorof, Dimitrios
    Garcia, Humberto E.
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2022, 116
  • [7] LEAGUE: Guided Skill Learning and Abstraction for Long-Horizon Manipulation
    Cheng, Shuo
    Xu, Danfei
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2023, 8 (10) : 6451 - 6458
  • [8] To imitate or not to imitate: Boosting reinforcement learning-based construction robotic control for long-horizon tasks using virtual demonstrations
    Huang, Lei
    Zhu, Zihan
    Zou, Zhengbo
    AUTOMATION IN CONSTRUCTION, 2023, 146
  • [9] Multi-State-Space Reasoning Reinforcement Learning for Long-Horizon RFID-Based Robotic Searching and Planning Tasks
    Yu Z.
    Zhang J.
    Mao S.
    Periaswamy S.C.G.
    Patton J.
    Journal of Communications and Information Networks, 2022, 7 (03) : 239 - 251
  • [10] Enhancing construction robot learning for collaborative and long-horizon tasks using generative adversarial imitation learning
    Li, Rui
    Zou, Zhengbo
    ADVANCED ENGINEERING INFORMATICS, 2023, 58