Multi-view dreaming: multi-view world model with contrastive learning

被引:0
|
作者
Kinose A. [1 ]
Okumura R. [2 ]
Okada M. [2 ]
Taniguchi T. [2 ,3 ]
机构
[1] Research and Development Center, Panasonic Connect Co. Ltd., Tokyo
[2] Digital and AI Technology Center, Technology Division, Panasonic Holdings Co, Osaka, Kadoma
[3] College of Information Science and Engineering, Ritsumeikan University, Kusatsu
关键词
multimodal; reinforcement learning; robotic manipulation; sensor integration; World models;
D O I
10.1080/01691864.2023.2264363
中图分类号
学科分类号
摘要
In this paper, we propose Multi-View Dreaming, a novel reinforcement learning agent for integrated recognition and control from multi-view observations by extending Dreaming. Most current reinforcement learning method assumes a single-view observation space, and this imposes limitations on the observed data, such as lack of spatial information and occlusions. This makes obtaining ideal observational information from the environment difficult and is a bottleneck for real-world robotics applications. In this paper, we use contrastive learning to train a shared latent space between different viewpoints and show how the Products of Experts approach can be used to integrate and control the probability distributions of latent states for multiple viewpoints. We also propose Multi-View DreamingV2, a variant of Multi-View Dreaming that uses a categorical distribution to model the latent state instead of the Gaussian distribution. Experiments show that the proposed method outperforms simple extensions of existing methods in a realistic robot control task. © 2023 Informa UK Limited, trading as Taylor & Francis Group and The Robotics Society of Japan.
引用
收藏
页码:1212 / 1220
页数:8
相关论文
共 50 条
  • [21] Multi-view Mixed Attention for Contrastive Learning on Hypergraphs
    Lee, Jongsoo
    Chae, Dong-Kyu
    PROCEEDINGS OF THE 47TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, SIGIR 2024, 2024, : 2543 - 2547
  • [22] Selective Contrastive Learning for Unpaired Multi-View Clustering
    Xin, Like
    Yang, Wanqi
    Wang, Lei
    Yang, Ming
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2025, 36 (01) : 1749 - 1763
  • [23] Multi-view Document Clustering with Joint Contrastive Learning
    Bai, Ruina
    Huang, Ruizhang
    Qin, Yongbin
    Chen, Yanping
    NATURAL LANGUAGE PROCESSING AND CHINESE COMPUTING, NLPCC 2022, PT I, 2022, 13551 : 706 - 719
  • [24] Contrastive Consensus Graph Learning for Multi-View Clustering
    Shiping Wang
    Xincan Lin
    Zihan Fang
    Shide Du
    Guobao Xiao
    IEEE/CAAJournalofAutomaticaSinica, 2022, 9 (11) : 2027 - 2030
  • [25] Multi-view clustering with semantic fusion and contrastive learning
    Yu, Hui
    Bian, Hui-Xiang
    Chong, Zi-Ling
    Liu, Zun
    Shi, Jian-Yu
    NEUROCOMPUTING, 2024, 603
  • [26] MultiCBR: Multi-view Contrastive Learning for Bundle Recommendation
    Ma, Yunshan
    He, Yingzhi
    Wang, Xiang
    Wei, Yinwei
    Du, Xiaoyu
    Fu, Yuyangzi
    Chua, Tat-Seng
    ACM TRANSACTIONS ON INFORMATION SYSTEMS, 2024, 42 (04)
  • [27] Selective Contrastive Learning for Unpaired Multi-View Clustering
    Xin, Like
    Yang, Wanqi
    Wang, Lei
    Yang, Ming
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2025, 36 (01) : 1749 - 1763
  • [28] Multi-view Contrastive Learning for Medical Question Summarization
    Wei, Sibo
    Peng, Xueping
    Guan, Hongjiao
    Geng, Lina
    Jian, Ping
    Wu, Hao
    Lu, Wenpeng
    PROCEEDINGS OF THE 2024 27 TH INTERNATIONAL CONFERENCE ON COMPUTER SUPPORTED COOPERATIVE WORK IN DESIGN, CSCWD 2024, 2024, : 1826 - 1831
  • [29] Multi-view denoising contrastive learning for bundle recommendation
    Sang, Lei
    Hu, Yang
    Zhang, Yi
    Zhang, Yiwen
    APPLIED INTELLIGENCE, 2024, 54 (23) : 12332 - 12346
  • [30] Multi-view Contrastive Graph Clustering
    Pan, Erlin
    Kang, Zhao
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34