Energy Efficient Graph-Based Hybrid Learning for Speech Emotion Recognition on Humanoid Robot

被引:2
|
作者
Wu, Haowen [1 ]
Xu, Hanyue [1 ,2 ]
Seng, Kah Phooi [1 ,3 ,4 ]
Chen, Jieli [1 ,2 ]
Ang, Li Minn [4 ]
机构
[1] Xian Jiaotong Liverpool Univ, Sch AI & Adv Comp, Suzhou 215000, Peoples R China
[2] Univ Liverpool, Dept Elect Engn & Elect, Liverpool L69 3GJ, England
[3] Queensland Univ Technol, Sch Comp Sci, Brisbane, Qld 4000, Australia
[4] Univ Sunshine Coast, Sch Sci Technol & Engn, Petrie, Qld 4502, Australia
关键词
energy efficient deep learning; graph convolutional neural network; speech emotion recognition; humanoid robot;
D O I
10.3390/electronics13061151
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper presents a novel deep graph-based learning technique for speech emotion recognition which has been specifically tailored for energy efficient deployment within humanoid robots. Our methodology represents a fusion of scalable graph representations, rooted in the foundational principles of graph signal processing theories. By delving into the utilization of cycle or line graphs as fundamental constituents shaping a robust Graph Convolution Network (GCN)-based architecture, we propose an approach which allows the capture of relationships between speech signals to decode intricate emotional patterns and responses. Our methodology is validated and benchmarked against established databases such as IEMOCAP and MSP-IMPROV. Our model outperforms standard GCNs and prevalent deep graph architectures, demonstrating performance levels that align with state-of-the-art methodologies. Notably, our model achieves this feat while significantly reducing the number of learnable parameters, thereby increasing computational efficiency and bolstering its suitability for resource-constrained environments. This proposed energy-efficient graph-based hybrid learning methodology is applied towards multimodal emotion recognition within humanoid robots. Its capacity to deliver competitive performance while streamlining computational complexity and energy efficiency represents a novel approach in evolving emotion recognition systems, catering to diverse real-world applications where precision in emotion recognition within humanoid robots stands as a pivotal requisite.
引用
收藏
页数:19
相关论文
共 50 条
  • [41] Web-based and Interactive Learning - Recognition Method for a Humanoid Robot
    Hidalgo-Pena, Enrique
    Marin-Urias, Luis F.
    Montes-Gonzalez, Fernando
    Marin-Hernandez, Antonio
    Rios-Figueroa, Homero
    3RD IBEROAMERICAN CONFERENCE ON ELECTRONICS ENGINEERING AND COMPUTER SCIENCE, CIIECC 2013, 2013, 7 : 370 - 376
  • [42] Efficient learning of supervised kernels with a graph-based loss function
    Pan, Binbin
    Chen, Wen-Sheng
    Chen, Bo
    Xu, Chen
    INFORMATION SCIENCES, 2016, 370 : 50 - 62
  • [43] GHOSM: Graph-based Hybrid Outline and Skeleton Modelling for Shape Recognition
    Alwaely, Basheer
    Abhayaratne, Charith
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2023, 19 (02)
  • [44] SEMI-SUPERVISED SPEECH RECOGNITION VIA GRAPH-BASED TEMPORAL CLASSIFICATION
    Moritz, Niko
    Hori, Takaaki
    Le Roux, Jonathan
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 6548 - 6552
  • [45] Efficient Join Order Selection Learning with Graph-based Representation
    Chen, Jin
    Ye, Guanyu
    Zhao, Yan
    Liu, Shuncheng
    Deng, Liwei
    Chen, Xu
    Zhou, Rui
    Zheng, Kai
    PROCEEDINGS OF THE 28TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD 2022, 2022, : 97 - 107
  • [46] Efficient locality weighted sparse representation for graph-based learning
    Feng, Xiaodong
    Wu, Sen
    Zhou, Wenjun
    Quan, Min
    KNOWLEDGE-BASED SYSTEMS, 2017, 121 : 129 - 141
  • [47] Stream-based Active Learning for Speech Emotion Recognition via Hybrid Data Selection and Continuous Learning
    Moreno-Acevedo, Santiago A.
    Vasquez-Correa, Juan Camilo
    Martin-Donas, Juan M.
    Alvarez, Aitor
    TEXT, SPEECH, AND DIALOGUE, TSD 2024, PT II, 2024, 15049 : 105 - 117
  • [48] Exploration of Energy-Efficient Architecture for Graph-Based Point-Cloud Deep Learning
    Zhang, Jie-Fang
    Zhang, Zhengya
    2021 IEEE WORKSHOP ON SIGNAL PROCESSING SYSTEMS (SIPS 2021), 2021, : 260 - 264
  • [49] Representation Learning for Speech Emotion Recognition
    Ghosh, Sayan
    Laksana, Eugene
    Morency, Louis-Philippe
    Scherer, Stefan
    17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 3603 - 3607
  • [50] Speech Emotion Recognition with Deep Learning
    Harar, Pavol
    Burget, Radim
    Dutta, Malay Kishore
    2017 4TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND INTEGRATED NETWORKS (SPIN), 2017, : 137 - 140