Multi-view dreaming: multi-view world model with contrastive learning

被引:0
|
作者
Kinose A. [1 ]
Okumura R. [2 ]
Okada M. [2 ]
Taniguchi T. [2 ,3 ]
机构
[1] Research and Development Center, Panasonic Connect Co. Ltd., Tokyo
[2] Digital and AI Technology Center, Technology Division, Panasonic Holdings Co, Osaka, Kadoma
[3] College of Information Science and Engineering, Ritsumeikan University, Kusatsu
关键词
multimodal; reinforcement learning; robotic manipulation; sensor integration; World models;
D O I
10.1080/01691864.2023.2264363
中图分类号
学科分类号
摘要
In this paper, we propose Multi-View Dreaming, a novel reinforcement learning agent for integrated recognition and control from multi-view observations by extending Dreaming. Most current reinforcement learning method assumes a single-view observation space, and this imposes limitations on the observed data, such as lack of spatial information and occlusions. This makes obtaining ideal observational information from the environment difficult and is a bottleneck for real-world robotics applications. In this paper, we use contrastive learning to train a shared latent space between different viewpoints and show how the Products of Experts approach can be used to integrate and control the probability distributions of latent states for multiple viewpoints. We also propose Multi-View DreamingV2, a variant of Multi-View Dreaming that uses a categorical distribution to model the latent state instead of the Gaussian distribution. Experiments show that the proposed method outperforms simple extensions of existing methods in a realistic robot control task. © 2023 Informa UK Limited, trading as Taylor & Francis Group and The Robotics Society of Japan.
引用
收藏
页码:1212 / 1220
页数:8
相关论文
共 50 条
  • [1] Contrastive Multi-View Kernel Learning
    Liu, Jiyuan
    Liu, Xinwang
    Yang, Yuexiang
    Liao, Qing
    Xia, Yuanqing
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (08) : 9552 - 9566
  • [2] Multi-View Contrastive Learning from Demonstrations
    Correia, Andre
    Alexandre, Luis A.
    2022 SIXTH IEEE INTERNATIONAL CONFERENCE ON ROBOTIC COMPUTING, IRC, 2022, : 338 - 344
  • [3] Dual contrastive learning for multi-view clustering
    Bao, Yichen
    Zhao, Wenhui
    Zhao, Qin
    Gao, Quanxue
    Yang, Ming
    NEUROCOMPUTING, 2024, 599
  • [4] Multi-view Contrastive Learning Network for Recommendation
    Bu, Xiya
    Ma, Ruixin
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT IX, 2024, 14433 : 319 - 330
  • [5] Heterogeneous Graph Contrastive Multi-view Learning
    Wang, Zehong
    Li, Qi
    Yu, Donghua
    Han, Xiaolong
    Gao, Xiao-Zhi
    Shen, Shigen
    PROCEEDINGS OF THE 2023 SIAM INTERNATIONAL CONFERENCE ON DATA MINING, SDM, 2023, : 136 - 144
  • [6] Contrastive Multi-View Representation Learning on Graphs
    Hassani, Kaveh
    Khasahmadi, Amir Hosein
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 119, 2020, 119
  • [7] Multi-view Multi-behavior Contrastive Learning in Recommendation
    Wu, Yiqing
    Xie, Ruobing
    Zhu, Yongchun
    Ao, Xiang
    Chen, Xin
    Zhang, Xu
    Zhuang, Fuzhen
    Lin, Leyu
    He, Qing
    DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, DASFAA 2022, PT II, 2022, : 166 - 182
  • [8] Multi-view representation learning for multi-view action recognition
    Hao, Tong
    Wu, Dan
    Wang, Qian
    Sun, Jin-Sheng
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2017, 48 : 453 - 460
  • [9] MULTI-VIEW METRIC LEARNING FOR MULTI-VIEW VIDEO SUMMARIZATION
    Wang, Linbo
    Fang, Xianyong
    Guo, Yanwen
    Fu, Yanwei
    2016 INTERNATIONAL CONFERENCE ON CYBERWORLDS (CW), 2016, : 179 - 182
  • [10] Contrastive and attentive graph learning for multi-view clustering
    Wang, Ru
    Li, Lin
    Tao, Xiaohui
    Wang, Peipei
    Liu, Peiyu
    INFORMATION PROCESSING & MANAGEMENT, 2022, 59 (04)