Deep multi-view learning methods: A review

被引:167
作者
Yan, Xiaoqiang [1 ]
Hu, Shizhe [1 ]
Mao, Yiqiao [1 ]
Ye, Yangdong [1 ]
Yu, Hui [2 ]
机构
[1] Zhengzhou Univ, Sch Informat Engn, Zhengzhou 450052, Peoples R China
[2] Univ Portsmouth, Sch Creat Technol, Portsmouth PO1 2DJ, Hants, England
关键词
Deep multi-view learning; deep neural networks; representation learning; statistical learning survey; CANONICAL CORRELATION-ANALYSIS; GRAPH NEURAL-NETWORK; INFORMATION BOTTLENECK; ACTION RECOGNITION; VIEW; ENSEMBLE; AUTOENCODER; REDUCTION; MODELS;
D O I
10.1016/j.neucom.2021.03.090
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multi-view learning (MVL) has attracted increasing attention and achieved great practical success by exploiting complementary information of multiple features or modalities. Recently, due to the remarkable performance of deep models, deep MVL has been adopted in many domains, such as machine learning, artificial intelligence and computer vision. This paper presents a comprehensive review on deep MVL from the following two perspectives: MVL methods in deep learning scope and deep MVL extensions of traditional methods. Specifically, we first review the representative MVL methods in the scope of deep learning, such as multi-view auto-encoder, conventional neural networks and deep brief networks. Then, we investigate the advancements of the MVL mechanism when traditional learning methods meet deep learning models, such as deep multi-view canonical correlation analysis, matrix factorization and information bottleneck. Moreover, we also summarize the main applications, widely-used datasets and performance comparison in the domain of deep MVL. Finally, we attempt to identify some open challenges to inform future research directions. (c) 2021 Elsevier B.V. All rights reserved.
引用
收藏
页码:106 / 129
页数:24
相关论文
共 210 条
  • [1] Multimodal Recurrent Neural Networks With Information Transfer Layers for Indoor Scene Labeling
    Abdulnabi, Abrar H.
    Shuai, Bing
    Zuo, Zhen
    Chau, Lap-Pui
    Wang, Gang
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2018, 20 (07) : 1656 - 1671
  • [2] Spectral clustering via ensemble deep autoencoder learning (SC-EDAE)
    Affeldt, Severine
    Labiod, Lazhar
    Nadif, Mohamed
    [J]. PATTERN RECOGNITION, 2020, 108
  • [3] Ahmadi A.H.K., 2020, INT C MACHINE LEARNI, P4116
  • [4] Akaho S, 2001, INT M PSYCHOMETRIC S
  • [5] Akata Z, 2020, ARXIV200207017
  • [6] A multimodal deep learning framework using local feature representations for face recognition
    Al-Waisy, Alaa S.
    Qahwaji, Rami
    Ipson, Stanley
    Al-Fahdawi, Shumoos
    [J]. MACHINE VISION AND APPLICATIONS, 2018, 29 (01) : 35 - 54
  • [7] Classifying Imbalanced Multi-modal Sensor Data for Human Activity Recognition in a Smart Home using Deep Learning
    Alani, Ali A.
    Cosma, Georgina
    Taherkhani, Aboozar
    [J]. 2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,
  • [8] Alemi AA, 2017, ICLR
  • [9] Alemi AA, 2019, VARIATIONAL PREDICTI, P1
  • [10] Deep Multimodal Fusion: A Hybrid Approach
    Amer, Mohamed R.
    Shields, Timothy
    Siddiquie, Behjat
    Tamrakar, Amir
    Divakaran, Ajay
    Chai, Sek
    [J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2018, 126 (2-4) : 440 - 456