Dual structural consistency based multi-modal correlation propagation projections for data representation

被引:0
作者
Ji, Hong-Kun [1 ,2 ]
Sun, Quan-Sen [1 ]
Yuan, Yun-Hao [3 ]
Ji, Ze-Xuan [1 ]
Zhang, Guo-Qing [1 ]
Feng, Lei [1 ]
机构
[1] Nanjing Univ Sci & Technol, Sch Comp Sci & Engn, Nanjing 210094, Jiangsu, Peoples R China
[2] Nanyang Technol Univ, Sch Elect & Elect Engn, Singapore 637553, Singapore
[3] Yangzhou Univ, Dept Comp Sci & Technol, Yangzhou 225000, Jiangsu, Peoples R China
基金
中国国家自然科学基金; 中国博士后科学基金;
关键词
Multi-modal semi-supervised learning; Structural consistency; Feature extraction and fusion; Correlation analysis; CANONICAL CORRELATION-ANALYSIS; FACE RECOGNITION; DIMENSIONALITY REDUCTION; FEATURE-EXTRACTION; CLASSIFICATION; ALGORITHM; FUSION; FORMULATION; EXTENSIONS; EIGENFACES;
D O I
10.1007/s11042-016-3993-y
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Canonical correlation analysis (CCA) is a powerful tool for analyzing multi-dimensional paired data. However, when facing semi-supervised multi-modal data (Also called multi-view Hou et al. (Pattern Recog 43(3):720-730, 2010) or multi-represented Kailing et al. (Clustering multi-represented objects with noise. In: Proceedings of the eighth Pacific-Asia conference on knowledge discovery and data mining (PAKDD). Sydney, Australia, pp 394-403) data. For convenience, we will uniformly call them multi-modal data hereafter.) which widely exist in real-world applications, CCA usually performs poorly due to ignoring useful supervised information. Meanwhile, due to the limited labeled training samples in the semi-supervised scenario, supervised extensions of CCA suffer from overfitting. Several semi-supervised extensions of CCA have been proposed recently. Nevertheless, they either just utilize the global structural information captured from the unlabeled data, or propagate label information by discovering the affinities just between the labeled and unlabeled data points in advance. In this paper, we propose a robust multi-modal semi-supervised feature extraction and fusion framework, termed as dual structural consistency based multi-modal correlation propagation projections (SCMCPP). SCMCPP guarantees the consistency between representation structure and hypotaxis structure in each modality and ensures the consistency of hypotaxis structure between two different modalities. By iteratively propagating labels and learning affinities, discriminative information of both given labels and estimated labels is utilized to improve the affinity construction and infer the remaining unknown labels. Moreover, probabilistic within-class scatter matrices in each modality and probabilistic correlation matrix between two modalities are constructed to enhance the discriminative power of features. Extensive experiments on several benchmark face databases demonstrate the effectiveness of our approach.
引用
收藏
页码:20909 / 20933
页数:25
相关论文
共 61 条
  • [41] A new method of feature fusion and its application in image recognition
    Sun, QS
    Zeng, SG
    Liu, Y
    Heng, PA
    Xia, DS
    [J]. PATTERN RECOGNITION, 2005, 38 (12) : 2437 - 2448
  • [42] A theorem on the generalized canonical projective vectors
    Sun, QS
    Liu, ZD
    Heng, PA
    Xia, DS
    [J]. PATTERN RECOGNITION, 2005, 38 (03) : 449 - 452
  • [43] Sun Quan-Sen, 2005, Chinese Journal of Computers, V28, P1524
  • [44] Locality preserving CCA with applications to data visualization and pose estimation
    Sun, Tingkai
    Chen, Songcan
    [J]. IMAGE AND VISION COMPUTING, 2007, 25 (05) : 531 - 543
  • [45] A Novel Method of Combined Feature Extraction for Recognition
    Sun, Tingkai
    Chen, Songcan
    Yang, Jingyu
    Shi, Pengfei
    [J]. ICDM 2008: EIGHTH IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2008, : 1043 - +
  • [46] Ting Y, 2015, P 15 INT C COMP VIS
  • [47] EIGENFACES FOR RECOGNITION
    TURK, M
    PENTLAND, A
    [J]. JOURNAL OF COGNITIVE NEUROSCIENCE, 1991, 3 (01) : 71 - 86
  • [48] Waaijenborg S, 2008, STAT APPL GENET MOL, V7
  • [49] Wang WR, 2015, INT CONF ACOUST SPEE, P4590, DOI 10.1109/ICASSP.2015.7178840
  • [50] Fast k-NN classification for multichannel image data
    Warfield, S
    [J]. PATTERN RECOGNITION LETTERS, 1996, 17 (07) : 713 - 721