Anchor-Sharing and Clusterwise Contrastive Network for Multiview Representation Learning

被引：15

作者：

Yan, Weiqing ^{[1
]}

Zhang, Yuanyang ^{[1
]}

Tang, Chang ^{[2
]}

Zhou, Wujie ^{[3
,4
]}

Lin, Weisi ^{[4
]}

机构：

[1] Yantai Univ, Sch Comp & Control Engn, Yantai 261400, Peoples R China

[2] China Univ Geosci, Sch Comp Sci, Wuhan 430074, Peoples R China

[3] Zhejiang Univ Sci & Technol, Sch Informat & Elect Engn, Hangzhou 310023, Peoples R China

[4] Nanyang Technol Univ, Sch Comp Sci & Engn, Singapore 639798, Singapore

来源：

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS | 2024年

基金：

中国国家自然科学基金;

关键词：

Representation learning; Task analysis; Correlation; Clustering methods; Self-supervised learning; Image reconstruction; Computer science; Anchor-sharing feature aggregation (ASFA); clusterwise contrastive learning (CwCL); multiview clustering (MVC); self-supervised learning;

D O I：

10.1109/TNNLS.2024.3357087

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Multiview clustering (MVC) has gained significant attention as it enables the partitioning of samples into their respective categories through unsupervised learning. However, there are a few issues as follows: 1) many existing deep clustering methods use the same latent features to achieve the conflict objectives, namely, reconstruction and view consistency. The reconstruction objective aims to preserve view-specific features for each individual view, while the view-consistency objective strives to obtain common features across all views; 2) some deep embedded clustering (DEC) approaches adopt view-wise fusion to obtain consensus feature representation. However, these approaches overlook the correlation between samples, making it challenging to derive discriminative consensus representations; and 3) many methods use contrastive learning (CL) to align the view's representations; however, they do not take into account cluster information during the construction of sample pairs, which can lead to the presence of false negative pairs. To address these issues, we propose a novel multiview representation learning network, called anchor-sharing and clusterwise CL (CwCL) network for multiview representation learning. Specifically, we separate view-specific learning and view-common learning into different network branches, which addresses the conflict between reconstruction and consistency. Second, we design an anchor-sharing feature aggregation (ASFA) module, which learns the sharing anchors from different batch data samples, establishes the bipartite relationship between anchors and samples, and further leverages it to improve the samples' representations. This module enhances the discriminative power of the common representation from different samples. Third, we design CwCL module, which incorporates the learned transition probability into CL, allowing us to focus on minimizing the similarity between representations from negative pairs with a low transition probability. It alleviates the conflict in previous sample-level contrastive alignment. Experimental results demonstrate that our method outperforms the state-of-the-art performance.

引用

页码：3797 / 3807

页数：11

共 50 条

[21] Joint Representation Learning and Clustering: A Framework for Grouping Partial Multiview Data [J].

Zhuge, Wenzhang ;

Tao, Hong ;

Luo, Tingjin ;

Zeng, Ling-Li ;

Hou, Chenping ;

Yi, Dongyun .

IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2022, 34 (08) :3826-3840

[22] Dual Contrastive Learning Network for Graph Clustering [J].

Peng, Xin ;

Cheng, Jieren ;

Tang, Xiangyan ;

Liu, Jingxin ;

Wu, Jiahua .

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (08) :10846-10856

[23] DCCN: A dual-cross contrastive neural network for 3D point cloud representation learning [J].

Wu, Xiaopeng ;

Shi, Guangsi ;

Zhao, Zexing ;

Li, Mingjie ;

Gao, Xiaojun ;

Yan, Xiaoli .

EXPERT SYSTEMS WITH APPLICATIONS, 2024, 249

[24] Tensor Learning Meets Dynamic Anchor Learning: From Complete to Incomplete Multiview Clustering [J].

Chen, Yongyong ;

Zhao, Xiaojia ;

Zhang, Zheng ;

Liu, Youfa ;

Su, Jingyong ;

Zhou, Yicong .

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (11) :15332-15345

[25] Multigranularity Information Fused Contrastive Learning With Multiview Clustering [J].

Ju, Hengrong ;

Lu, Yang ;

Ding, Weiping ;

Zhang, Wei ;

Yang, Xibei .

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2025,

[26] Semisupervised Progressive Representation Learning for Deep Multiview Clustering [J].

Chen, Rui ;

Tang, Yongqiang ;

Xie, Yuan ;

Feng, Wenlong ;

Zhang, Wensheng .

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (10) :14341-14355

[27] Unsupervised Speech Segmentation and Variable Rate Representation Learning Using Segmental Contrastive Predictive Coding [J].

Bhati, Saurabhchand ;

Villalba, Jesus ;

Zelasko, Piotr ;

Moro-Velazquez, Laureano ;

Dehak, Najim .

IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2022, 30 :2002-2014

[28] An Improved Inter-Intra Contrastive Learning Framework on Self-Supervised Video Representation [J].

Tao, Li ;

Wang, Xueting ;

Yamasaki, Toshihiko .

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (08) :5266-5280

[29] Self-Supervised Video Representation Learning Using Improved Instance-Wise Contrastive Learning and Deep Clustering [J].

Zhu, Yisheng ;

Shuai, Hui ;

Liu, Guangcan ;

Liu, Qingshan .

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (10) :6741-6752

[30] TCLR: Temporal contrastive learning for video representation [J].

Dave, Ishan ;

Gupta, Rohit ;

Rizve, Mamshad Nayeem ;

Shah, Mubarak .

COMPUTER VISION AND IMAGE UNDERSTANDING, 2022, 219

← 1 2 3 4 5 →