Energy Efficient Graph-Based Hybrid Learning for Speech Emotion Recognition on Humanoid Robot

被引:2
|
作者
Wu, Haowen [1 ]
Xu, Hanyue [1 ,2 ]
Seng, Kah Phooi [1 ,3 ,4 ]
Chen, Jieli [1 ,2 ]
Ang, Li Minn [4 ]
机构
[1] Xian Jiaotong Liverpool Univ, Sch AI & Adv Comp, Suzhou 215000, Peoples R China
[2] Univ Liverpool, Dept Elect Engn & Elect, Liverpool L69 3GJ, England
[3] Queensland Univ Technol, Sch Comp Sci, Brisbane, Qld 4000, Australia
[4] Univ Sunshine Coast, Sch Sci Technol & Engn, Petrie, Qld 4502, Australia
关键词
energy efficient deep learning; graph convolutional neural network; speech emotion recognition; humanoid robot;
D O I
10.3390/electronics13061151
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper presents a novel deep graph-based learning technique for speech emotion recognition which has been specifically tailored for energy efficient deployment within humanoid robots. Our methodology represents a fusion of scalable graph representations, rooted in the foundational principles of graph signal processing theories. By delving into the utilization of cycle or line graphs as fundamental constituents shaping a robust Graph Convolution Network (GCN)-based architecture, we propose an approach which allows the capture of relationships between speech signals to decode intricate emotional patterns and responses. Our methodology is validated and benchmarked against established databases such as IEMOCAP and MSP-IMPROV. Our model outperforms standard GCNs and prevalent deep graph architectures, demonstrating performance levels that align with state-of-the-art methodologies. Notably, our model achieves this feat while significantly reducing the number of learnable parameters, thereby increasing computational efficiency and bolstering its suitability for resource-constrained environments. This proposed energy-efficient graph-based hybrid learning methodology is applied towards multimodal emotion recognition within humanoid robots. Its capacity to deliver competitive performance while streamlining computational complexity and energy efficiency represents a novel approach in evolving emotion recognition systems, catering to diverse real-world applications where precision in emotion recognition within humanoid robots stands as a pivotal requisite.
引用
收藏
页数:19
相关论文
共 50 条
  • [21] Speech emotion recognition based on a hybrid of HMM/ANN
    Mao, Xia
    Zhang, Bing
    Luo, Yi
    PROCEEDINGS OF THE 7TH WSEAS INTERNATIONAL CONFERENCE ON APPLIED INFORMATICS AND COMMUNICATIONS, 2007, : 369 - 372
  • [22] GMSS: Graph-Based Multi-Task Self-Supervised Learning for EEG Emotion Recognition
    Li, Yang
    Chen, Ji
    Li, Fu
    Fu, Boxun
    Wu, Hao
    Ji, Youshuo
    Zhou, Yijin
    Niu, Yi
    Shi, Guangming
    Zheng, Wenming
    IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2023, 14 (03) : 2512 - 2525
  • [23] Emotion Rendering for Conversational Speech Synthesis with Heterogeneous Graph-Based Context Modeling
    Liu, Rui
    Hu, Yifan
    Ren, Yi
    Yin, Xiang
    Li, Haizhou
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 17, 2024, : 18698 - 18706
  • [24] HCL: Hybrid Contrastive Learning for Graph-based Recommendation
    Ma, Xiyao
    Gao, Zheng
    Hu, Qian
    AbdelHady, Mohamed
    2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
  • [25] Modelling graph-based observation spaces for segment-based speech recognition
    Glass, JR
    MATHEMATICAL FOUNDATIONS OF SPEECH AND LANGUAGE PROCESSING, 2004, 138 : 157 - 167
  • [26] GSC: A graph-based skill composition framework for robot learning
    Tian, Qiangxing
    Zhang, Shanshan
    Wang, Donglin
    Liu, Jinxin
    Yang, Shuyu
    ROBOTICS AND AUTONOMOUS SYSTEMS, 2024, 182
  • [27] Speech emotion recognition based on an improved brain emotion learning model
    Liu, Zhen-Tao
    Xie, Qiao
    Wu, Min
    Cao, Wei-Hua
    Mei, Ying
    Mao, Jun-Wei
    NEUROCOMPUTING, 2018, 309 : 145 - 156
  • [28] Speech based Emotion Recognition using Machine Learning
    Deshmukh, Girija
    Gaonkar, Apurva
    Golwalkar, Gauri
    Kulkarni, Sukanya
    PROCEEDINGS OF THE 2019 3RD INTERNATIONAL CONFERENCE ON COMPUTING METHODOLOGIES AND COMMUNICATION (ICCMC 2019), 2019, : 812 - 817
  • [29] Graph Isomorphism Network for Speech Emotion Recognition
    Liu, Jiawang
    Wang, Haoxiang
    INTERSPEECH 2021, 2021, : 3405 - 3409
  • [30] COMPACT GRAPH ARCHITECTURE FOR SPEECH EMOTION RECOGNITION
    Shirian, Amir
    Guha, Tanaya
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 6284 - 6288