K-Means Clustering-Based Kernel Canonical Correlation Analysis for Multimodal Emotion Recognition in Human-Robot Interaction

被引:47
|
作者
Chen, Luefeng [1 ,2 ,3 ]
Wang, Kuanlin [1 ,2 ,3 ]
Li, Min [1 ,2 ,3 ]
Wu, Min [1 ,2 ,3 ]
Pedrycz, Witold [4 ,5 ,6 ]
Hirota, Kaoru [7 ]
机构
[1] China Univ Geosci, Sch Automat, Wuhan 430074, Peoples R China
[2] Hubei Key Lab Adv Control & Intelligent Automat C, Wuhan 430074, Peoples R China
[3] Minist Educ, Engn Res Ctr Intelligent Technol Geoexplorat, Wuhan 430074, Peoples R China
[4] Univ Alberta, Dept Elect & Comp Engn, Edmonton, AB T6R 2V4, Canada
[5] King Abgudulaziz Univ, Fac Engn, Dept Elect & Comp Engn, Jeddah 21589, Saudi Arabia
[6] Polish Acad Sci, Syst Res Inst, PL-01447 Warsaw, Poland
[7] Tokyo Inst Technol, Yokohama, Kanagawa 2268502, Japan
基金
中国国家自然科学基金;
关键词
Feature fusion; K-means clustering; Kernel canonical correlation analysis (KCCA); multimodal emotion recognition; REGRESSION; FEATURES; PATTERN;
D O I
10.1109/TIE.2022.3150097
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this article, K-means clustering-based Kernel canonical correlation analysis algorithm is proposed for multimodal emotion recognition in human-robot interaction (HRI). The multimodal features (gray pixels; time and frequency domain) extracted from facial expression and speech are fused based on Kernel canonical correlation analysis. K-means clustering is used to select features from multiple modalities and reduce dimensionality. The proposed approach can improve the heterogenicity among different modalities and make multiple modalities complementary to promote multimodal emotion recognition. Experiments on two datasets, namely SAVEE and eNTER-FACE'05, are conducted to evaluate the accuracy of the proposed method. The results show that the proposed method produces good recognition rates that are higher than the ones produced by the methods without K-means clustering; more specifically, they are 2.77% higher in SAVEE and 4.7% higher in eNTERFACE'05.
引用
收藏
页码:1016 / 1024
页数:9
相关论文
共 50 条
  • [31] Fuzzy K-means clustering with fast density peak clustering on multivariate kernel estimator with evolutionary multimodal optimization clusters on a large dataset
    G. Surya Narayana
    Kamakshaiah Kolli
    Multimedia Tools and Applications, 2021, 80 : 4769 - 4787
  • [32] Fuzzy K-means clustering with fast density peak clustering on multivariate kernel estimator with evolutionary multimodal optimization clusters on a large dataset
    Narayana, G. Surya
    Kolli, Kamakshaiah
    MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (03) : 4769 - 4787
  • [33] Clustering of transformer condition using frequency response analysis based on k-means and GOA
    Bigdeli, Mehdi
    Abu-Siada, Ahmed
    ELECTRIC POWER SYSTEMS RESEARCH, 2022, 202
  • [34] Energy Efficient K-means Clustering-based Routing Protocol for WSN Using Optimal Packet Size
    Razzaq, Madiha
    Ningombam, Devarani Devi
    Shin, Seokjoo
    2018 32ND INTERNATIONAL CONFERENCE ON INFORMATION NETWORKING (ICOIN), 2018, : 632 - 635
  • [35] Power Transformer Abnormal State Recognition Model Based on Improved K-means Clustering
    Liang, Xuanhong
    Wang, Youyuan
    Li, Houying
    He, Yigang
    Zhao, Yushun
    2018 IEEE ELECTRICAL INSULATION CONFERENCE (EIC), 2018, : 327 - 330
  • [36] Multimodal shared features learning for emotion recognition by enhanced sparse local discriminative canonical correlation analysis
    Fu, Jiamin
    Mao, Qirong
    Tu, Juanjuan
    Zhan, Yongzhao
    MULTIMEDIA SYSTEMS, 2019, 25 (05) : 451 - 461
  • [37] Multimodal shared features learning for emotion recognition by enhanced sparse local discriminative canonical correlation analysis
    Jiamin Fu
    Qirong Mao
    Juanjuan Tu
    Yongzhao Zhan
    Multimedia Systems, 2019, 25 : 451 - 461
  • [38] A Speech and Facial Information based Emotion Recognition System of Collaborative Robot for Empathic Human-Robot Collaboration
    Loor, Jianna
    Murphy, Jordan
    Li, Rui
    2024 33RD IEEE INTERNATIONAL CONFERENCE ON ROBOT AND HUMAN INTERACTIVE COMMUNICATION, ROMAN 2024, 2024, : 2327 - 2332
  • [39] Research on Feature Fusion for Emotion Recognition Based on Discriminative Canonical Correlation Analysis
    ChuqiLiu
    Li, Chao
    ZipingZhao
    PROCEEDINGS OF 2018 INTERNATIONAL CONFERENCE ON MATHEMATICS AND ARTIFICIAL INTELLIGENCE (ICMAI 2018), 2018, : 30 - 36
  • [40] Two-layer fuzzy multiple random forest for speech emotion recognition in human-robot interaction
    Chen, Luefeng
    Su, Wanjuan
    Feng, Yu
    Wu, Min
    She, Jinhua
    Hirota, Kaoru
    INFORMATION SCIENCES, 2020, 509 : 150 - 163