Multi-view dreaming: multi-view world model with contrastive learning

被引：0

作者：

Kinose A. ^{[1
]}

Okumura R. ^{[2
]}

Okada M. ^{[2
]}

Taniguchi T. ^{[2
,3
]}

机构：

[1] Research and Development Center, Panasonic Connect Co. Ltd., Tokyo

[2] Digital and AI Technology Center, Technology Division, Panasonic Holdings Co, Osaka, Kadoma

[3] College of Information Science and Engineering, Ritsumeikan University, Kusatsu

来源：

Advanced Robotics | 2023年 / 37卷 / 19期

关键词：

multimodal; reinforcement learning; robotic manipulation; sensor integration; World models;

D O I：

10.1080/01691864.2023.2264363

中图分类号：

学科分类号：

摘要：

In this paper, we propose Multi-View Dreaming, a novel reinforcement learning agent for integrated recognition and control from multi-view observations by extending Dreaming. Most current reinforcement learning method assumes a single-view observation space, and this imposes limitations on the observed data, such as lack of spatial information and occlusions. This makes obtaining ideal observational information from the environment difficult and is a bottleneck for real-world robotics applications. In this paper, we use contrastive learning to train a shared latent space between different viewpoints and show how the Products of Experts approach can be used to integrate and control the probability distributions of latent states for multiple viewpoints. We also propose Multi-View DreamingV2, a variant of Multi-View Dreaming that uses a categorical distribution to model the latent state instead of the Gaussian distribution. Experiments show that the proposed method outperforms simple extensions of existing methods in a realistic robot control task. © 2023 Informa UK Limited, trading as Taylor & Francis Group and The Robotics Society of Japan.

引用

页码：1212 / 1220

页数：8

共 50 条

[1] Contrastive Multi-View Kernel Learning
Liu, Jiyuan
Liu, Xinwang
Yang, Yuexiang
Liao, Qing
Xia, Yuanqing
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (08) : 9552 - 9566
[2] Multi-View Contrastive Learning from Demonstrations
Correia, Andre
Alexandre, Luis A.
2022 SIXTH IEEE INTERNATIONAL CONFERENCE ON ROBOTIC COMPUTING, IRC, 2022, : 338 - 344
[3] Dual contrastive learning for multi-view clustering
Bao, Yichen
Zhao, Wenhui
Zhao, Qin
Gao, Quanxue
Yang, Ming
NEUROCOMPUTING, 2024, 599
[4] Multi-view Contrastive Learning Network for Recommendation
Bu, Xiya
Ma, Ruixin
PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT IX, 2024, 14433 : 319 - 330
[5] Heterogeneous Graph Contrastive Multi-view Learning
Wang, Zehong
Li, Qi
Yu, Donghua
Han, Xiaolong
Gao, Xiao-Zhi
Shen, Shigen
PROCEEDINGS OF THE 2023 SIAM INTERNATIONAL CONFERENCE ON DATA MINING, SDM, 2023, : 136 - 144
[6] Contrastive Multi-View Representation Learning on Graphs
Hassani, Kaveh
Khasahmadi, Amir Hosein
INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 119, 2020, 119
[7] Multi-view Multi-behavior Contrastive Learning in Recommendation
Wu, Yiqing
Xie, Ruobing
Zhu, Yongchun
Ao, Xiang
Chen, Xin
Zhang, Xu
Zhuang, Fuzhen
Lin, Leyu
He, Qing
DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, DASFAA 2022, PT II, 2022, : 166 - 182
[8] Multi-view representation learning for multi-view action recognition
Hao, Tong
Wu, Dan
Wang, Qian
Sun, Jin-Sheng
JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2017, 48 : 453 - 460
[9] MULTI-VIEW METRIC LEARNING FOR MULTI-VIEW VIDEO SUMMARIZATION
Wang, Linbo
Fang, Xianyong
Guo, Yanwen
Fu, Yanwei
2016 INTERNATIONAL CONFERENCE ON CYBERWORLDS (CW), 2016, : 179 - 182
[10] Contrastive and attentive graph learning for multi-view clustering
Wang, Ru
Li, Lin
Tao, Xiaohui
Wang, Peipei
Liu, Peiyu
INFORMATION PROCESSING & MANAGEMENT, 2022, 59 (04)

← 1 2 3 4 5 →