K-Means Clustering-Based Kernel Canonical Correlation Analysis for Multimodal Emotion Recognition in Human-Robot Interaction

被引:47
|
作者
Chen, Luefeng [1 ,2 ,3 ]
Wang, Kuanlin [1 ,2 ,3 ]
Li, Min [1 ,2 ,3 ]
Wu, Min [1 ,2 ,3 ]
Pedrycz, Witold [4 ,5 ,6 ]
Hirota, Kaoru [7 ]
机构
[1] China Univ Geosci, Sch Automat, Wuhan 430074, Peoples R China
[2] Hubei Key Lab Adv Control & Intelligent Automat C, Wuhan 430074, Peoples R China
[3] Minist Educ, Engn Res Ctr Intelligent Technol Geoexplorat, Wuhan 430074, Peoples R China
[4] Univ Alberta, Dept Elect & Comp Engn, Edmonton, AB T6R 2V4, Canada
[5] King Abgudulaziz Univ, Fac Engn, Dept Elect & Comp Engn, Jeddah 21589, Saudi Arabia
[6] Polish Acad Sci, Syst Res Inst, PL-01447 Warsaw, Poland
[7] Tokyo Inst Technol, Yokohama, Kanagawa 2268502, Japan
基金
中国国家自然科学基金;
关键词
Feature fusion; K-means clustering; Kernel canonical correlation analysis (KCCA); multimodal emotion recognition; REGRESSION; FEATURES; PATTERN;
D O I
10.1109/TIE.2022.3150097
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this article, K-means clustering-based Kernel canonical correlation analysis algorithm is proposed for multimodal emotion recognition in human-robot interaction (HRI). The multimodal features (gray pixels; time and frequency domain) extracted from facial expression and speech are fused based on Kernel canonical correlation analysis. K-means clustering is used to select features from multiple modalities and reduce dimensionality. The proposed approach can improve the heterogenicity among different modalities and make multiple modalities complementary to promote multimodal emotion recognition. Experiments on two datasets, namely SAVEE and eNTER-FACE'05, are conducted to evaluate the accuracy of the proposed method. The results show that the proposed method produces good recognition rates that are higher than the ones produced by the methods without K-means clustering; more specifically, they are 2.77% higher in SAVEE and 4.7% higher in eNTERFACE'05.
引用
收藏
页码:1016 / 1024
页数:9
相关论文
共 50 条
  • [21] Multimodal Emotion Recognition Using Deep Generalized Canonical Correlation Analysis with an Attention Mechanism
    Lan, Yu-Ting
    Liu, Wei
    Lu, Bao-Liang
    2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,
  • [22] Air Targets Recognition Based on Feature Analysis and Particle Swarm Optimization K-Means Clustering
    Wang, Shaolei
    Chen, Weiyi
    MECHATRONICS AND INTELLIGENT MATERIALS II, PTS 1-6, 2012, 490-495 : 1718 - 1722
  • [23] Extended study of k-Means Clustering Technique for Human Face Classification and recognition
    Dey, Tumpa
    Deb, Tamojay
    2015 IEEE INTERNATIONAL CONFERENCE ON ELECTRICAL, COMPUTER AND COMMUNICATION TECHNOLOGIES, 2015,
  • [24] K-MEANS CLUSTERING TO TTR BASED LEXICAL DIVERSITY ANALYSIS
    Zhang, Yanhui
    ADVANCES AND APPLICATIONS IN STATISTICS, 2020, 64 (02) : 267 - 276
  • [25] Coupled Multimodal Emotional Feature Analysis Based on Broad-Deep Fusion Networks in Human-Robot Interaction
    Chen, Luefeng
    Li, Min
    Wu, Min
    Pedrycz, Witold
    Hirota, Kaoru
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (07) : 9663 - 9673
  • [26] Kernel-Reliability-Based K-Means (KRKM) Clustering Algorithm and Image Processing
    Hua, Chunsheng
    Qi, Juntong
    Han, Jianda
    Wu, Haiyuan
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2014, E97D (09) : 2423 - 2433
  • [27] Class Discovery Based on K-means Clustering and Perturbation Analysis
    Ru, Xiaohu
    Liu, Zheng
    Huang, Zhitao
    Jiang, Wenli
    2015 8TH INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING (CISP), 2015, : 1236 - 1240
  • [28] A K-means Clustering Based Algorithm for Shill Bidding Recognition in Online Auction
    Lei, Bin
    Zhang, Huichao
    Chen, Huiyu
    Liu, Lili
    Wang, Dingwei
    PROCEEDINGS OF THE 2012 24TH CHINESE CONTROL AND DECISION CONFERENCE (CCDC), 2012, : 939 - 943
  • [29] FACIAL EXPRESSION RECOGNITION BASED ON IMPROVED LBP OPERATOR AND K-MEANS CLUSTERING
    Wang Yunfei
    Ding Hui
    Liu Yi
    Pan Yanyan
    PROCEEDINGS OF THE 4TH INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY AND MANAGEMENT INNOVATION, 2015, 28 : 829 - 833
  • [30] Convolutional Features-Based Broad Learning With LSTM for Multidimensional Facial Emotion Recognition in Human-Robot Interaction
    Chen, Luefeng
    Li, Min
    Wu, Min
    Pedrycz, Witold
    Hirota, Kaoru
    IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2024, 54 (01): : 64 - 75