Dual Contrastive Prediction for Incomplete Multi-View Representation Learning

被引:148
作者
Lin, Yijie [1 ]
Gou, Yuanbiao [1 ]
Liu, Xiaotian [2 ]
Bai, Jinfeng [3 ]
Lv, Jiancheng [1 ]
Peng, Xi [1 ]
机构
[1] Sichuan Univ, Coll Comp Sci, Chengdu 610065, Peoples R China
[2] Wake Forest Univ, Dept Comp Sci, Winston Salem, NC 27109 USA
[3] TAL AI Lab, Beijing 100080, Peoples R China
基金
国家重点研发计划;
关键词
Task analysis; Representation learning; Mutual information; Entropy; Linear programming; Optimization; Kernel; Multi-view learning; contrastive prediction; view missing; multi-view clustering; multi-view representation learning; VIEW;
D O I
10.1109/TPAMI.2022.3197238
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this article, we propose a unified framework to solve the following two challenging problems in incomplete multi-view representation learning: i) how to learn a consistent representation unifying different views, and ii) how to recover the missing views. To address the challenges, we provide an information theoretical framework under which the consistency learning and data recovery are treated as a whole. With the theoretical framework, we propose a novel objective function which jointly solves the aforementioned two problems and achieves a provable sufficient and minimal representation. In detail, the consistency learning is performed by maximizing the mutual information of different views through contrastive learning, and the missing views are recovered by minimizing the conditional entropy through dual prediction. To the best of our knowledge, this is one of the first works to theoretically unify the cross-view consistency learning and data recovery for representation learning. Extensive experimental results show that the proposed method remarkably outperforms 20 competitive multi-view learning methods on six datasets in terms of clustering, classification, and human action recognition. The code could be accessed from https://pengxi.me.
引用
收藏
页码:4447 / 4461
页数:15
相关论文
共 76 条
[1]  
Andrew G., 2013, PROC INT C MACH LEAR, V28, P1247
[2]  
[Anonymous], 2007, ICML, DOI DOI 10.1145/1273496.1273523
[3]  
[Anonymous], 2018, PROC INT C LEARN REP
[4]  
[Anonymous], 2006, 2006 IEEE COMP SOC C
[5]   Dynamic 3D Hand Gesture Recognition by Learning Weighted Depth Motion Maps [J].
Azad, Reza ;
Asadi-Aghbolaghi, Maryam ;
Kasaei, Shohreh ;
Escalera, Sergio .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2019, 29 (06) :1729-1740
[6]  
Chen T, 2020, PR MACH LEARN RES, V119
[7]  
Dai ZH, 2019, 57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), P2978
[8]  
Deng C, 2015, PROCEEDINGS OF THE TWENTY-FOURTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE (IJCAI), P3438
[9]  
Federici Marco, 2020, PROC INT C LEARN REP
[10]  
Fei-Fei L, 2005, PROC CVPR IEEE, P524