A Clustering-Guided Contrastive Fusion for Multi-View Representation Learning

被引:19
|
作者
Ke, Guanzhou [1 ]
Chao, Guoqing [2 ]
Wang, Xiaoli [3 ]
Xu, Chenyang [4 ]
Zhu, Yongqi [1 ]
Yu, Yang [1 ]
机构
[1] Beijing Jiaotong Univ, Inst Data Sci & Intelligent Decis Support, Beijing Inst Big Data Res, Beijing 100080, Peoples R China
[2] Harbin Inst Technol, Sch Comp Sci & Technol, Weihai 264209, Peoples R China
[3] Nanjing Univ Sci & Technol, Sch Comp Sci & Engn, Nanjing 210000, Peoples R China
[4] Wuyi Univ, Fac Intelligent Mfg, Jiangmen 529000, Peoples R China
关键词
Task analysis; Semantics; Robustness; Representation learning; Image reconstruction; Data models; Learning systems; Multi-view representation learning; contrastive learning; fusion; clustering; incomplete view; ENHANCEMENT;
D O I
10.1109/TCSVT.2023.3300319
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Multi-view representation learning aims to extract comprehensive information from multiple sources. It has achieved significant success in applications such as video understanding and 3D rendering. However, how to improve the robustness and generalization of multi-view representations from unsupervised and incomplete scenarios remains an open question in this field. In this study, we discovered a positive correlation between the semantic distance of multi-view representations and the tolerance for data corruption. Moreover, we found that the information ratio of consistency and complementarity significantly impacts the performance of discriminative and generative tasks related to multi-view representations. Based on these observations, we propose an end-to-end CLustering-guided cOntrastiVE fusioN (CLOVEN) method, which enhances the robustness and generalization of multi-view representations simultaneously. To balance consistency and complementarity, we design an asymmetric contrastive fusion module. The module first combines all view-specific representations into a comprehensive representation through a scaling fusion layer. Then, the information of the comprehensive representation and view-specific representations is aligned via contrastive learning loss function, resulting in a view-common representation that includes both consistent and complementary information. We prevent the module from learning suboptimal solutions by not allowing information alignment between view-specific representations. We design a clustering-guided module that encourages the aggregation of semantically similar views. This action reduces the semantic distance of the view-common representation. We quantitatively and qualitatively evaluate CLOVEN on five datasets, demonstrating its superiority over 13 other competitive multi-view learning methods in terms of clustering and classification performance. In the data-corrupted scenario, our proposed method resists noise interference better than competitors. Additionally, the visualization demonstrates that CLOVEN succeeds in preserving the intrinsic structure of view-specific representations and improves the compactness of view-common representations. Our code can be found at https://github.com/guanzhou-ke/cloven.
引用
收藏
页码:2056 / 2069
页数:14
相关论文
共 50 条
  • [31] Dual Contrastive Prediction for Incomplete Multi-View Representation Learning
    Lin, Yijie
    Gou, Yuanbiao
    Liu, Xiaotian
    Bai, Jinfeng
    Lv, Jiancheng
    Peng, Xi
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (04) : 4447 - 4461
  • [32] Self-Weighted Contrastive Fusion for Deep Multi-View Clustering
    Wu, Song
    Zheng, Yan
    Ren, Yazhou
    He, Jing
    Pu, Xiaorong
    Huang, Shudong
    Hao, Zhifeng
    He, Lifang
    IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 9150 - 9162
  • [33] Multi-view graph contrastive representation learning for bundle recommendation
    Zhang, Peng
    Niu, Zhendong
    Ma, Ru
    Zhang, Fuzhi
    INFORMATION PROCESSING & MANAGEMENT, 2025, 62 (01)
  • [34] MORI-RAN: Multi-view Robust Representation Learning via Hybrid Contrastive Fusion
    Ke, Guanzhou
    Zhu, Yongqi
    Yu, Yang
    2022 IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS, ICDMW, 2022, : 467 - 474
  • [35] Learning Smooth Representation for Multi-view Subspace Clustering
    Huang, Shudong
    Liu, Yixi
    Ren, Yazhou
    Tsang, Ivor W.
    Xu, Zenglin
    Lv, Jiancheng
    PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 3421 - 3429
  • [36] Joint representation learning for multi-view subspace clustering
    Zhang, Guang-Yu
    Zhou, Yu-Ren
    Wang, Chang-Dong
    Huang, Dong
    He, Xiao-Yu
    EXPERT SYSTEMS WITH APPLICATIONS, 2021, 166
  • [37] Representation Learning in Multi-view Clustering: A Literature Review
    Chen, Man-Sheng
    Lin, Jia-Qi
    Li, Xiang-Long
    Liu, Bao-Yu
    Wang, Chang-Dong
    Huang, Dong
    Lai, Jian-Huang
    DATA SCIENCE AND ENGINEERING, 2022, 7 (03) : 225 - 241
  • [38] Representation Learning in Multi-view Clustering: A Literature Review
    Man-Sheng Chen
    Jia-Qi Lin
    Xiang-Long Li
    Bao-Yu Liu
    Chang-Dong Wang
    Dong Huang
    Jian-Huang Lai
    Data Science and Engineering, 2022, 7 : 225 - 241
  • [39] Flexible Multi-View Representation Learning for Subspace Clustering
    Li, Ruihuang
    Zhang, Changqing
    Hu, Qinghua
    Zhu, Pengfei
    Wang, Zheng
    PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2019, : 2916 - 2922
  • [40] Trustworthy multi-view clustering via alternating generative adversarial representation learning and fusion
    Yang, Wenqi
    Wang, Minhui
    Tang, Chang
    Zheng, Xiao
    Liu, Xinwang
    He, Kunlun
    INFORMATION FUSION, 2024, 107