Dual structural consistency based multi-modal correlation propagation projections for data representation

被引:0
作者
Ji, Hong-Kun [1 ,2 ]
Sun, Quan-Sen [1 ]
Yuan, Yun-Hao [3 ]
Ji, Ze-Xuan [1 ]
Zhang, Guo-Qing [1 ]
Feng, Lei [1 ]
机构
[1] Nanjing Univ Sci & Technol, Sch Comp Sci & Engn, Nanjing 210094, Jiangsu, Peoples R China
[2] Nanyang Technol Univ, Sch Elect & Elect Engn, Singapore 637553, Singapore
[3] Yangzhou Univ, Dept Comp Sci & Technol, Yangzhou 225000, Jiangsu, Peoples R China
基金
中国国家自然科学基金; 中国博士后科学基金;
关键词
Multi-modal semi-supervised learning; Structural consistency; Feature extraction and fusion; Correlation analysis; CANONICAL CORRELATION-ANALYSIS; FACE RECOGNITION; DIMENSIONALITY REDUCTION; FEATURE-EXTRACTION; CLASSIFICATION; ALGORITHM; FUSION; FORMULATION; EXTENSIONS; EIGENFACES;
D O I
10.1007/s11042-016-3993-y
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Canonical correlation analysis (CCA) is a powerful tool for analyzing multi-dimensional paired data. However, when facing semi-supervised multi-modal data (Also called multi-view Hou et al. (Pattern Recog 43(3):720-730, 2010) or multi-represented Kailing et al. (Clustering multi-represented objects with noise. In: Proceedings of the eighth Pacific-Asia conference on knowledge discovery and data mining (PAKDD). Sydney, Australia, pp 394-403) data. For convenience, we will uniformly call them multi-modal data hereafter.) which widely exist in real-world applications, CCA usually performs poorly due to ignoring useful supervised information. Meanwhile, due to the limited labeled training samples in the semi-supervised scenario, supervised extensions of CCA suffer from overfitting. Several semi-supervised extensions of CCA have been proposed recently. Nevertheless, they either just utilize the global structural information captured from the unlabeled data, or propagate label information by discovering the affinities just between the labeled and unlabeled data points in advance. In this paper, we propose a robust multi-modal semi-supervised feature extraction and fusion framework, termed as dual structural consistency based multi-modal correlation propagation projections (SCMCPP). SCMCPP guarantees the consistency between representation structure and hypotaxis structure in each modality and ensures the consistency of hypotaxis structure between two different modalities. By iteratively propagating labels and learning affinities, discriminative information of both given labels and estimated labels is utilized to improve the affinity construction and infer the remaining unknown labels. Moreover, probabilistic within-class scatter matrices in each modality and probabilistic correlation matrix between two modalities are constructed to enhance the discriminative power of features. Extensive experiments on several benchmark face databases demonstrate the effectiveness of our approach.
引用
收藏
页码:20909 / 20933
页数:25
相关论文
共 61 条
  • [1] Andrienko G., 2013, Introduction, P1
  • [2] [Anonymous], 2006, BOOK REV IEEE T NEUR
  • [3] [Anonymous], 53 ANN ALL C COMM CO
  • [4] [Anonymous], 1998, The AR Face Database Technical Report 24
  • [5] CVC
  • [6] [Anonymous], 2000, NIPS
  • [7] [Anonymous], 2007, PROC IEEE INT C COMP
  • [8] A FAST ITERATIVE SHRINKAGE-THRESHOLDING ALGORITHM WITH APPLICATION TO WAVELET-BASED IMAGE DEBLURRING
    Beck, Amir
    Teboulle, Marc
    [J]. 2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 693 - +
  • [9] Eigenfaces vs. Fisherfaces: Recognition using class specific linear projection
    Belhumeur, PN
    Hespanha, JP
    Kriegman, DJ
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1997, 19 (07) : 711 - 720
  • [10] Distributed optimization and statistical learning via the alternating direction method of multipliers
    Boyd S.
    Parikh N.
    Chu E.
    Peleato B.
    Eckstein J.
    [J]. Foundations and Trends in Machine Learning, 2010, 3 (01): : 1 - 122