Multiview nonlinear discriminant structure learning for emotion recognition

被引:1
作者
Guo, Shuai [1 ]
Song, Li [2 ,3 ]
Xie, Rong [1 ]
Li, Lin [4 ]
Liu, Shenglan [5 ,6 ]
机构
[1] Shanghai Jiao Tong Univ, Inst Image Commun & Network Engn, Shanghai 200240, Peoples R China
[2] Shanghai Jiao Tong Univ, Inst Image Commun & Network Engn, Shanghai 200240, Peoples R China
[3] Shanghai Jiao Tong Univ, AI Inst, MoE Key Lab Artificial Intelligence, Shanghai 200240, Peoples R China
[4] MIGU Co Ltd, Beijing 100120, Peoples R China
[5] Dalian Univ Technol, Sch Innovat & Entrepreneurship, Dalian 116024, Peoples R China
[6] Dalian Univ Technol, Fac Elect Informat & Elect Engn, Dalian 116024, Peoples R China
关键词
Multiview subspace learning; Emotion recognition; Nonlinear; Uncorrelated; Out-of-sample; CANONICAL CORRELATION-ANALYSIS; LAPLACIAN EIGENMAPS; MODELS; SCALE; JOINT; SETS;
D O I
10.1016/j.knosys.2022.110042
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multiview subspace learning (MSL) has been widely used in various practical applications including emotion recognition. Despite the recent progress in MSL, two challenges remain to address. First, most existing MSL methods indiscriminately utilize both helpful and defective information contained in different views. Second, the most recent methods are linear approaches that do not perform well on emotion datasets with weak linear separability. Therefore, in this study, we introduce a framework for emotion recognition: multiview nonlinear discriminant structure learning (MNDSL). MNDSL fully exploits useful information in each input through local information preservation and discriminant reconstruction (LPDR) and obtains latent subspaces using multiview discriminant latent subspace learning (MDLSL). In addition, an out-of-sample extension was introduced to satisfy the requirements of large-scale applications and obtain the projections of new samples. The proposed framework constructs interviews and intra-view-weighted connections to explore discriminant structures and preserve locality under complementarity and correlation principles. The results demonstrate the superiority of the proposed framework compared with state-of-the-art methods. (c) 2022 Elsevier B.V. All rights reserved.
引用
收藏
页数:14
相关论文
共 95 条
  • [1] Aguilar G, 2019, Arxiv, DOI arXiv:1906.10198
  • [2] Akaho S., 2006, PREPRINT, Patent No. 0609071
  • [3] [Anonymous], 1998, The averaged Karolinska directed emotional faces
  • [4] [Anonymous], 2014, ADV ELECT COMPUTERS
  • [5] [Anonymous], 2014, ACM INT C MULTIMEDIA
  • [6] [Anonymous], 2019, IEEE T NEURAL NETW L
  • [7] Ashkanasy NM, 2008, NEW HORIZ MANAG, P1
  • [8] Laplacian eigenmaps for dimensionality reduction and data representation
    Belkin, M
    Niyogi, P
    [J]. NEURAL COMPUTATION, 2003, 15 (06) : 1373 - 1396
  • [9] Bhatia R., 2013, Matrix analysis, V169, DOI 10.1007/978-1-4612-0653-8
  • [10] Human emotion recognition and analysis in response to audio music using brain signals
    Bhatti, Adnan Mehmood
    Majid, Muhammad
    Anwar, Syed Muhammad
    Khan, Bilal
    [J]. COMPUTERS IN HUMAN BEHAVIOR, 2016, 65 : 267 - 275