Deep Partial Multi-View Learning

被引:187
作者
Zhang, Changqing [1 ]
Cui, Yajie [1 ]
Han, Zongbo [1 ]
Zhou, Joey Tianyi [2 ]
Fu, Huazhu [3 ]
Hu, Qinghua [1 ]
机构
[1] Tianjin Univ, Coll Intelligence & Comp, Tianjin 300350, Peoples R China
[2] ASTAR, Inst High Performance Comp IHIC, Singapore 138632, Singapore
[3] Incept Inst Artificial Intelligence, Abu Dhabi, U Arab Emirates
基金
中国国家自然科学基金;
关键词
Correlation; Encoding; Training; Image reconstruction; Data models; Testing; Neural networks; Multi-view learning; cross partial multi-view networks; latent representation; FRAMEWORK;
D O I
10.1109/TPAMI.2020.3037734
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Although multi-view learning has made significant progress over the past few decades, it is still challenging due to the difficulty in modeling complex correlations among different views, especially under the context of view missing. To address the challenge, we propose a novel framework termed Cross Partial Multi-View Networks (CPM-Nets), which aims to fully and flexibly take advantage of multiple partial views. We first provide a formal definition of completeness and versatility for multi-view representation and then theoretically prove the versatility of the learned latent representations. For completeness, the task of learning latent multi-view representation is specifically translated to a degradation process by mimicking data transmission, such that the optimal tradeoff between consistency and complementarity across different views can be implicitly achieved. Equipped with adversarial strategy, our model stably imputes missing views, encoding information from all views for each sample to be encoded into latent representation to further enhance the completeness. Furthermore, a nonparametric classification loss is introduced to produce structured representations and prevent overfitting, which endows the algorithm with promising generalization under view-missing cases. Extensive experimental results validate the effectiveness of our algorithm over existing state of the arts for classification, representation learning and data imputation.
引用
收藏
页码:2402 / 2415
页数:14
相关论文
共 60 条
[1]  
Akaho S., 2006, ARXIV
[2]  
Andrew G, 2013, INT C MACH LEARN, V28
[3]  
[Anonymous], 2011, Advances in neural information processing systems
[4]   Multimodal Machine Learning: A Survey and Taxonomy [J].
Baltrusaitis, Tadas ;
Ahuja, Chaitanya ;
Morency, Louis-Philippe .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2019, 41 (02) :423-443
[5]   LSimpute: accurate estimation of missing values in microarray data with least squares methods [J].
Bo, TH ;
Dysvik, J ;
Jonassen, I .
NUCLEIC ACIDS RESEARCH, 2004, 32 (03) :e34
[6]   A SINGULAR VALUE THRESHOLDING ALGORITHM FOR MATRIX COMPLETION [J].
Cai, Jian-Feng ;
Candes, Emmanuel J. ;
Shen, Zuowei .
SIAM JOURNAL ON OPTIMIZATION, 2010, 20 (04) :1956-1982
[7]   Deep Adversarial Learning for Multi-Modality Missing Data Completion [J].
Cai, Lei ;
Wang, Zhengyang ;
Gao, Hongyang ;
Shen, Dinggang ;
Ji, Shuiwang .
KDD'18: PROCEEDINGS OF THE 24TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2018, :1158-1166
[8]   Learning Aligned Cross-Modal Representations from Weakly Aligned Data [J].
Castrejon, Lluis ;
Aytar, Yusuf ;
Vondrick, Carl ;
Pirsiavash, Hamed ;
Torralba, Antonio .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :2940-2949
[9]   A unified dimensionality reduction framework for semi-paired and semi-supervised multi-view data [J].
Chen, Xiaohong ;
Chen, Songcan ;
Xue, Hui ;
Zhou, Xudong .
PATTERN RECOGNITION, 2012, 45 (05) :2005-2018
[10]  
Chung YA, 2018, ADV NEUR IN, V31