An imitation learning framework for generating multi-modal trajectories from unstructured demonstrations

被引：3

作者：

Peng, Jian-Wei ^{[1
]}

Hu, Min-Chun ^{[2
]}

Chu, Wei-Ta ^{[1
]}

机构：

[1] Natl Cheng Kung Univ, Dept Comp Sci & Informat Engn, Tainan, Taiwan

[2] Natl Tsing Hua Univ, Dept Comp Sci, Hsinchu, Taiwan

来源：

NEUROCOMPUTING | 2022年 / 500卷

关键词：

Trajectory generation; Motion synthesis; Imitation learning; Reinforcement learning; Generative adversarial networks; HUMAN MOTION PREDICTION;

D O I：

10.1016/j.neucom.2022.05.076

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The main challenge of the trajectory generation problem is to generate long-term as well as diverse tra-jectories. Generative Adversarial Imitation Learning (GAIL) is a well-known model-free imitation learning algorithm that can be utilized to generate trajectory data, while vanilla GAIL would fail to capture multi -modal demonstrations. Recent methods propose latent variable models to solve this problem; however, previous works may have a mode missing problem. In this work, we propose a novel method to generate long-term trajectories that are controllable by a continuous latent variable based on GAIL and a condi-tional Variational Autoencoder (cVAE). We further assume that subsequences of the same trajectory should be encoded to similar locations in the latent space. Therefore, we introduce a contrastive loss in the training of the encoder. In our motion synthesis task, we propose to first construct a low-dimensional motion manifold by using a VAE to reduce the burden of our imitation learning model. Our experimental results show that the proposed model outperforms the state-of-the-art methods and can be applied to motion synthesis.(c) 2022 Elsevier B.V. All rights reserved.

引用

页码：712 / 723

页数：12

共 41 条

[31] A reinforcement learning-based neighborhood search operator for multi-modal optimization and its applications
Hong, Jiale
Shen, Bo
Pan, Anqi
EXPERT SYSTEMS WITH APPLICATIONS, 2024, 246
[32] Learning Category-Level Generalizable Object Manipulation Policy Via Generative Adversarial Self-Imitation Learning From Demonstrations
Shen, Hao
Wan, Weikang
Wang, He
IEEE ROBOTICS AND AUTOMATION LETTERS, 2022, 7 (04) : 11166 - 11173
[33] Generalized Zero-Shot Learning Via Multi-Modal Aggregated Posterior Aligning Neural Network
Chen, Xingyu
Li, Jin
Lan, Xuguang
Zheng, Nanning
IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 24 : 177 - 187
[34] A whole-process interpretable and multi-modal deep reinforcement learning for diagnosis and analysis of Alzheimer's disease *
Zhang, Quan
Du, Qian
Liu, Guohua
JOURNAL OF NEURAL ENGINEERING, 2021, 18 (06)
[35] Perception-Aware-Based UAV Trajectory Planner via Generative Adversarial Self-Imitation Learning From Demonstrations
Zhang, Hanxuan
Huo, Ju
Huang, Yulong
Cheng, Jiajun
Li, Xiaofeng
IEEE INTERNET OF THINGS JOURNAL, 2025, 12 (03): : 3248 - 3260
[36] AW3M: An auto-weighting and recovery framework for breast cancer diagnosis using multi-modal ultrasound
Huang, Ruobing
Lin, Zehui
Dou, Haoran
Wang, Jian
Miao, Juzheng
Zhou, Guangquan
Jia, Xiaohong
Xu, Wenwen
Mei, Zihan
Dong, Yijie
Yang, Xin
Zhou, Jianqiao
Ni, Dong
MEDICAL IMAGE ANALYSIS, 2021, 72
[37] Alternative multi-label imitation learning framework monitoring tool wear and bearing fault under different working conditions
Wang, Zisheng
Xuan, Jianping
Shi, Tielin
ADVANCED ENGINEERING INFORMATICS, 2022, 54
[38] Improved Cooperative Multi-agent Reinforcement Learning Algorithm Augmented by Mixing Demonstrations from Centralized Policy
Lee, Hyun-Rok
Lee, Taesik
AAMAS '19: PROCEEDINGS OF THE 18TH INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS AND MULTIAGENT SYSTEMS, 2019, : 1089 - 1098
[39] BPGAN: Brain PET synthesis from MRI using generative adversarial network for multi-modal Alzheimer's disease diagnosis
Zhang, Jin
He, Xiaohai
Qing, Linbo
Gao, Feng
Wang, Bin
COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2022, 217
[40] Control framework for collaborative robot using imitation learning-based teleoperation from human digital twin to robot digital twin*
Lee, Hyunsoo
Kim, Seong Dae
Amin, Mohammad Aman Ullah Al
MECHATRONICS, 2022, 85

← 1 2 3 4 5 →