An imitation learning framework for generating multi-modal trajectories from unstructured demonstrations

被引:3
|
作者
Peng, Jian-Wei [1 ]
Hu, Min-Chun [2 ]
Chu, Wei-Ta [1 ]
机构
[1] Natl Cheng Kung Univ, Dept Comp Sci & Informat Engn, Tainan, Taiwan
[2] Natl Tsing Hua Univ, Dept Comp Sci, Hsinchu, Taiwan
关键词
Trajectory generation; Motion synthesis; Imitation learning; Reinforcement learning; Generative adversarial networks; HUMAN MOTION PREDICTION;
D O I
10.1016/j.neucom.2022.05.076
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The main challenge of the trajectory generation problem is to generate long-term as well as diverse tra-jectories. Generative Adversarial Imitation Learning (GAIL) is a well-known model-free imitation learning algorithm that can be utilized to generate trajectory data, while vanilla GAIL would fail to capture multi -modal demonstrations. Recent methods propose latent variable models to solve this problem; however, previous works may have a mode missing problem. In this work, we propose a novel method to generate long-term trajectories that are controllable by a continuous latent variable based on GAIL and a condi-tional Variational Autoencoder (cVAE). We further assume that subsequences of the same trajectory should be encoded to similar locations in the latent space. Therefore, we introduce a contrastive loss in the training of the encoder. In our motion synthesis task, we propose to first construct a low-dimensional motion manifold by using a VAE to reduce the burden of our imitation learning model. Our experimental results show that the proposed model outperforms the state-of-the-art methods and can be applied to motion synthesis.(c) 2022 Elsevier B.V. All rights reserved.
引用
收藏
页码:712 / 723
页数:12
相关论文
共 41 条
  • [31] A reinforcement learning-based neighborhood search operator for multi-modal optimization and its applications
    Hong, Jiale
    Shen, Bo
    Pan, Anqi
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 246
  • [32] Learning Category-Level Generalizable Object Manipulation Policy Via Generative Adversarial Self-Imitation Learning From Demonstrations
    Shen, Hao
    Wan, Weikang
    Wang, He
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2022, 7 (04) : 11166 - 11173
  • [33] Generalized Zero-Shot Learning Via Multi-Modal Aggregated Posterior Aligning Neural Network
    Chen, Xingyu
    Li, Jin
    Lan, Xuguang
    Zheng, Nanning
    IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 24 : 177 - 187
  • [34] A whole-process interpretable and multi-modal deep reinforcement learning for diagnosis and analysis of Alzheimer's disease *
    Zhang, Quan
    Du, Qian
    Liu, Guohua
    JOURNAL OF NEURAL ENGINEERING, 2021, 18 (06)
  • [35] Perception-Aware-Based UAV Trajectory Planner via Generative Adversarial Self-Imitation Learning From Demonstrations
    Zhang, Hanxuan
    Huo, Ju
    Huang, Yulong
    Cheng, Jiajun
    Li, Xiaofeng
    IEEE INTERNET OF THINGS JOURNAL, 2025, 12 (03): : 3248 - 3260
  • [36] AW3M: An auto-weighting and recovery framework for breast cancer diagnosis using multi-modal ultrasound
    Huang, Ruobing
    Lin, Zehui
    Dou, Haoran
    Wang, Jian
    Miao, Juzheng
    Zhou, Guangquan
    Jia, Xiaohong
    Xu, Wenwen
    Mei, Zihan
    Dong, Yijie
    Yang, Xin
    Zhou, Jianqiao
    Ni, Dong
    MEDICAL IMAGE ANALYSIS, 2021, 72
  • [37] Alternative multi-label imitation learning framework monitoring tool wear and bearing fault under different working conditions
    Wang, Zisheng
    Xuan, Jianping
    Shi, Tielin
    ADVANCED ENGINEERING INFORMATICS, 2022, 54
  • [38] Improved Cooperative Multi-agent Reinforcement Learning Algorithm Augmented by Mixing Demonstrations from Centralized Policy
    Lee, Hyun-Rok
    Lee, Taesik
    AAMAS '19: PROCEEDINGS OF THE 18TH INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS AND MULTIAGENT SYSTEMS, 2019, : 1089 - 1098
  • [39] BPGAN: Brain PET synthesis from MRI using generative adversarial network for multi-modal Alzheimer's disease diagnosis
    Zhang, Jin
    He, Xiaohai
    Qing, Linbo
    Gao, Feng
    Wang, Bin
    COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2022, 217
  • [40] Control framework for collaborative robot using imitation learning-based teleoperation from human digital twin to robot digital twin*
    Lee, Hyunsoo
    Kim, Seong Dae
    Amin, Mohammad Aman Ullah Al
    MECHATRONICS, 2022, 85