Energy Efficient Graph-Based Hybrid Learning for Speech Emotion Recognition on Humanoid Robot

被引:2
|
作者
Wu, Haowen [1 ]
Xu, Hanyue [1 ,2 ]
Seng, Kah Phooi [1 ,3 ,4 ]
Chen, Jieli [1 ,2 ]
Ang, Li Minn [4 ]
机构
[1] Xian Jiaotong Liverpool Univ, Sch AI & Adv Comp, Suzhou 215000, Peoples R China
[2] Univ Liverpool, Dept Elect Engn & Elect, Liverpool L69 3GJ, England
[3] Queensland Univ Technol, Sch Comp Sci, Brisbane, Qld 4000, Australia
[4] Univ Sunshine Coast, Sch Sci Technol & Engn, Petrie, Qld 4502, Australia
关键词
energy efficient deep learning; graph convolutional neural network; speech emotion recognition; humanoid robot;
D O I
10.3390/electronics13061151
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper presents a novel deep graph-based learning technique for speech emotion recognition which has been specifically tailored for energy efficient deployment within humanoid robots. Our methodology represents a fusion of scalable graph representations, rooted in the foundational principles of graph signal processing theories. By delving into the utilization of cycle or line graphs as fundamental constituents shaping a robust Graph Convolution Network (GCN)-based architecture, we propose an approach which allows the capture of relationships between speech signals to decode intricate emotional patterns and responses. Our methodology is validated and benchmarked against established databases such as IEMOCAP and MSP-IMPROV. Our model outperforms standard GCNs and prevalent deep graph architectures, demonstrating performance levels that align with state-of-the-art methodologies. Notably, our model achieves this feat while significantly reducing the number of learnable parameters, thereby increasing computational efficiency and bolstering its suitability for resource-constrained environments. This proposed energy-efficient graph-based hybrid learning methodology is applied towards multimodal emotion recognition within humanoid robots. Its capacity to deliver competitive performance while streamlining computational complexity and energy efficiency represents a novel approach in evolving emotion recognition systems, catering to diverse real-world applications where precision in emotion recognition within humanoid robots stands as a pivotal requisite.
引用
收藏
页数:19
相关论文
共 50 条
  • [31] Graph-based Kinship Recognition
    Guo, Yuanhao
    Dibeklioglu, Hamdi
    van der Maaten, Laurens
    2014 22ND INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2014, : 4287 - 4292
  • [32] Adaptive facial point detection and emotion recognition for a humanoid robot
    Zhang, Li
    Mistry, Kamlesh
    Jiang, Ming
    Neoh, Siew Chin
    Hossain, Mohammed Alamgir
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2015, 140 : 93 - 114
  • [33] Graph weeds net: A graph-based deep learning method for weed recognition
    Hu, Kun
    Coleman, Guy
    Zeng, Shan
    Wang, Zhiyong
    Walsh, Michael
    COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2020, 174
  • [34] Deep Learning Algorithms for Speech Emotion Recognition with Hybrid Spectral Features
    Kogila R.
    Sadanandam M.
    Bhukya H.
    SN Computer Science, 5 (1)
  • [35] Speech emotion recognition based on Graph-LSTM neural network
    Li, Yan
    Wang, Yapeng
    Yang, Xu
    Im, Sio-Kei
    EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2023, 2023 (01)
  • [36] Speech emotion recognition based on Graph-LSTM neural network
    Yan Li
    Yapeng Wang
    Xu Yang
    Sio-Kei Im
    EURASIP Journal on Audio, Speech, and Music Processing, 2023
  • [37] Efficient Emotion Recognition from Speech Using Deep Learning on Spectrograms
    Satt, Aharon
    Rozenberg, Shai
    Hoory, Ron
    18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 1089 - 1093
  • [38] Intelligent facial emotion recognition and semantic-based topic detection for a humanoid robot
    Zhang, Li
    Jiang, Ming
    Farid, Dewan
    Hossain, M. A.
    EXPERT SYSTEMS WITH APPLICATIONS, 2013, 40 (13) : 5160 - 5168
  • [39] Efficient bimodal emotion recognition system based on speech/text embeddings and ensemble learning fusion
    Chakhtouna, Adil
    Sekkate, Sara
    Adib, Abdellah
    ANNALS OF TELECOMMUNICATIONS, 2025,
  • [40] Graph-Based Visual Semantic Perception for Humanoid Robots
    Grotz, Markus
    Kaiser, Peter
    Aksoy, Eren Erdal
    Paus, Fabian
    Asfour, Minim
    2017 IEEE-RAS 17TH INTERNATIONAL CONFERENCE ON HUMANOID ROBOTICS (HUMANOIDS), 2017, : 869 - 875