A Clustering-Guided Contrastive Fusion for Multi-View Representation Learning

被引:19
|
作者
Ke, Guanzhou [1 ]
Chao, Guoqing [2 ]
Wang, Xiaoli [3 ]
Xu, Chenyang [4 ]
Zhu, Yongqi [1 ]
Yu, Yang [1 ]
机构
[1] Beijing Jiaotong Univ, Inst Data Sci & Intelligent Decis Support, Beijing Inst Big Data Res, Beijing 100080, Peoples R China
[2] Harbin Inst Technol, Sch Comp Sci & Technol, Weihai 264209, Peoples R China
[3] Nanjing Univ Sci & Technol, Sch Comp Sci & Engn, Nanjing 210000, Peoples R China
[4] Wuyi Univ, Fac Intelligent Mfg, Jiangmen 529000, Peoples R China
关键词
Task analysis; Semantics; Robustness; Representation learning; Image reconstruction; Data models; Learning systems; Multi-view representation learning; contrastive learning; fusion; clustering; incomplete view; ENHANCEMENT;
D O I
10.1109/TCSVT.2023.3300319
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Multi-view representation learning aims to extract comprehensive information from multiple sources. It has achieved significant success in applications such as video understanding and 3D rendering. However, how to improve the robustness and generalization of multi-view representations from unsupervised and incomplete scenarios remains an open question in this field. In this study, we discovered a positive correlation between the semantic distance of multi-view representations and the tolerance for data corruption. Moreover, we found that the information ratio of consistency and complementarity significantly impacts the performance of discriminative and generative tasks related to multi-view representations. Based on these observations, we propose an end-to-end CLustering-guided cOntrastiVE fusioN (CLOVEN) method, which enhances the robustness and generalization of multi-view representations simultaneously. To balance consistency and complementarity, we design an asymmetric contrastive fusion module. The module first combines all view-specific representations into a comprehensive representation through a scaling fusion layer. Then, the information of the comprehensive representation and view-specific representations is aligned via contrastive learning loss function, resulting in a view-common representation that includes both consistent and complementary information. We prevent the module from learning suboptimal solutions by not allowing information alignment between view-specific representations. We design a clustering-guided module that encourages the aggregation of semantically similar views. This action reduces the semantic distance of the view-common representation. We quantitatively and qualitatively evaluate CLOVEN on five datasets, demonstrating its superiority over 13 other competitive multi-view learning methods in terms of clustering and classification performance. In the data-corrupted scenario, our proposed method resists noise interference better than competitors. Additionally, the visualization demonstrates that CLOVEN succeeds in preserving the intrinsic structure of view-specific representations and improves the compactness of view-common representations. Our code can be found at https://github.com/guanzhou-ke/cloven.
引用
收藏
页码:2056 / 2069
页数:14
相关论文
共 50 条
  • [41] Decoupled representation for multi-view learning
    Sun, Shiding
    Wang, Bo
    Tian, Yingjie
    PATTERN RECOGNITION, 2024, 151
  • [42] AdaMCL: Adaptive Fusion Multi-View Contrastive Learning for Collaborative Filtering
    Zhu, Guanghui
    Lu, Wang
    Yuan, Chunfeng
    Huang, Yihua
    PROCEEDINGS OF THE 46TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, SIGIR 2023, 2023, : 1076 - 1085
  • [43] Contrastive Multi-View Kernel Learning
    Liu, Jiyuan
    Liu, Xinwang
    Yang, Yuexiang
    Liao, Qing
    Xia, Yuanqing
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (08) : 9552 - 9566
  • [44] View-Driven Multi-View Clustering via Contrastive Double-Learning
    Liu, Shengcheng
    Zhu, Changming
    Li, Zishi
    Yang, Zhiyuan
    Gu, Wenjie
    ENTROPY, 2024, 26 (06)
  • [45] A Survey of Multi-View Representation Learning
    Li, Yingming
    Yang, Ming
    Zhang, Zhongfei
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2019, 31 (10) : 1863 - 1883
  • [46] Multi-view representation learning for data stream clustering q
    Chen, Jie
    Yang, Shengxiang
    Wang, Zhu
    INFORMATION SCIENCES, 2022, 613 : 731 - 746
  • [47] Multi-view clustering via efficient representation learning with anchors
    Yu, Xiao
    Liu, Hui
    Zhang, Yan
    Sun, Shanbao
    Zhang, Caiming
    PATTERN RECOGNITION, 2023, 144
  • [48] Multi-view Contrastive Clustering with Clustering Guidance and Adaptive Auto-encoders
    Guo, Bingchen
    Kong, Bing
    Zhou, Lihua
    Chen, Hongmei
    Bao, Chongming
    SPATIAL DATA AND INTELLIGENCE, SPATIALDI 2024, 2024, 14619 : 3 - 14
  • [49] Prototype Matching Learning for Incomplete Multi-View Clustering
    Yuan, Honglin
    Sun, Yuan
    Zhou, Fei
    Wen, Jing
    Yuan, Shihua
    You, Xiaojian
    Ren, Zhenwen
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2025, 34 : 828 - 841
  • [50] Contrastive learning, multi-view redundancy, and linear models
    Tosh, Christopher
    Krishnamurthy, Akshay
    Hsu, Daniel
    ALGORITHMIC LEARNING THEORY, VOL 132, 2021, 132