Anchor-Sharing and Clusterwise Contrastive Network for Multiview Representation Learning

被引:9
作者
Yan, Weiqing [1 ]
Zhang, Yuanyang [1 ]
Tang, Chang [2 ]
Zhou, Wujie [3 ,4 ]
Lin, Weisi [4 ]
机构
[1] Yantai Univ, Sch Comp & Control Engn, Yantai 261400, Peoples R China
[2] China Univ Geosci, Sch Comp Sci, Wuhan 430074, Peoples R China
[3] Zhejiang Univ Sci & Technol, Sch Informat & Elect Engn, Hangzhou 310023, Peoples R China
[4] Nanyang Technol Univ, Sch Comp Sci & Engn, Singapore 639798, Singapore
基金
中国国家自然科学基金;
关键词
Representation learning; Task analysis; Correlation; Clustering methods; Self-supervised learning; Image reconstruction; Computer science; Anchor-sharing feature aggregation (ASFA); clusterwise contrastive learning (CwCL); multiview clustering (MVC); self-supervised learning;
D O I
10.1109/TNNLS.2024.3357087
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multiview clustering (MVC) has gained significant attention as it enables the partitioning of samples into their respective categories through unsupervised learning. However, there are a few issues as follows: 1) many existing deep clustering methods use the same latent features to achieve the conflict objectives, namely, reconstruction and view consistency. The reconstruction objective aims to preserve view-specific features for each individual view, while the view-consistency objective strives to obtain common features across all views; 2) some deep embedded clustering (DEC) approaches adopt view-wise fusion to obtain consensus feature representation. However, these approaches overlook the correlation between samples, making it challenging to derive discriminative consensus representations; and 3) many methods use contrastive learning (CL) to align the view's representations; however, they do not take into account cluster information during the construction of sample pairs, which can lead to the presence of false negative pairs. To address these issues, we propose a novel multiview representation learning network, called anchor-sharing and clusterwise CL (CwCL) network for multiview representation learning. Specifically, we separate view-specific learning and view-common learning into different network branches, which addresses the conflict between reconstruction and consistency. Second, we design an anchor-sharing feature aggregation (ASFA) module, which learns the sharing anchors from different batch data samples, establishes the bipartite relationship between anchors and samples, and further leverages it to improve the samples' representations. This module enhances the discriminative power of the common representation from different samples. Third, we design CwCL module, which incorporates the learned transition probability into CL, allowing us to focus on minimizing the similarity between representations from negative pairs with a low transition probability. It alleviates the conflict in previous sample-level contrastive alignment. Experimental results demonstrate that our method outperforms the state-of-the-art performance.
引用
收藏
页码:3797 / 3807
页数:11
相关论文
共 50 条
  • [21] Dual Contrastive Learning Network for Graph Clustering
    Peng, Xin
    Cheng, Jieren
    Tang, Xiangyan
    Liu, Jingxin
    Wu, Jiahua
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (08) : 10846 - 10856
  • [22] DCCN: A dual-cross contrastive neural network for 3D point cloud representation learning
    Wu, Xiaopeng
    Shi, Guangsi
    Zhao, Zexing
    Li, Mingjie
    Gao, Xiaojun
    Yan, Xiaoli
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 249
  • [23] Semisupervised Progressive Representation Learning for Deep Multiview Clustering
    Chen, Rui
    Tang, Yongqiang
    Xie, Yuan
    Feng, Wenlong
    Zhang, Wensheng
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (10) : 14341 - 14355
  • [24] Unsupervised Speech Segmentation and Variable Rate Representation Learning Using Segmental Contrastive Predictive Coding
    Bhati, Saurabhchand
    Villalba, Jesus
    Zelasko, Piotr
    Moro-Velazquez, Laureano
    Dehak, Najim
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2022, 30 : 2002 - 2014
  • [25] An Improved Inter-Intra Contrastive Learning Framework on Self-Supervised Video Representation
    Tao, Li
    Wang, Xueting
    Yamasaki, Toshihiko
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (08) : 5266 - 5280
  • [26] TCLR: Temporal contrastive learning for video representation
    Dave, Ishan
    Gupta, Rohit
    Rizve, Mamshad Nayeem
    Shah, Mubarak
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2022, 219
  • [27] Self-Supervised Video Representation Learning Using Improved Instance-Wise Contrastive Learning and Deep Clustering
    Zhu, Yisheng
    Shuai, Hui
    Liu, Guangcan
    Liu, Qingshan
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (10) : 6741 - 6752
  • [28] Video Representation Learning with Graph Contrastive Augmentation
    Zhang, Jingran
    Xu, Xing
    Shen, Fumin
    Yao, Yazhou
    Shao, Jie
    Zhu, Xiaofeng
    PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 3043 - 3051
  • [29] Analysis of Augmentations for Contrastive ECG Representation Learning
    Soltanieh, Sahar
    Etemad, Ali
    Hashemi, Javad
    2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
  • [30] Contrastive Self-Supervised Learning With Smoothed Representation for Remote Sensing
    Jung, Heechul
    Oh, Yoonju
    Jeong, Seongho
    Lee, Chaehyeon
    Jeon, Taegyun
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19