Memristor-Based Progressive Hierarchical Conformer Architecture for Speech Emotion Recognition

被引:0
作者
Zhao, Tianhao [1 ]
Zhou, Yue [1 ,2 ]
Hu, Xiaofang [1 ,2 ]
机构
[1] Southwest Univ, Coll Artificial Intelligence, Chongqing 400715, Peoples R China
[2] Southwest Univ, Chongqing Key Lab Brain inspired Comp & Intelligen, Chongqing 400715, Peoples R China
来源
INTERNATIONAL JOURNAL OF BIFURCATION AND CHAOS | 2024年 / 34卷 / 09期
基金
中国国家自然科学基金;
关键词
Memristor; self-attention mechanism; speech emotion recognition; conformer; circuit; CIRCUIT IMPLEMENTATION; FEATURES; SYSTEM;
D O I
10.1142/S0218127424501177
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
Speech Emotion Recognition (SER) is a challenging task characterized by the diversity and complexity of emotional expression. Due to its powerful feature extraction capabilities, Transformer Network (TN) demonstrates advantages and potential in SER. However, the limited size of available datasets and the difficulty of decoupling emotional features restrain its performance and present challenges in implementing SER on edge devices. To address these issues, we present a Memristor-based Progressive Hierarchical Conformer Architecture (MPCA) and design a conformer submodule that leverages convolution to mitigate TN's limitations in SER. We propose attention-based feature decoupling, employing hierarchical extraction to decouple speaker characteristics and retain the relevant components, thereby obtaining reliable emotional features. Furthermore, we propose a reconfigurable circuit implementation scheme for MPCA based on operator multiplexing achieving flexible modules that can be dynamically adjusted based on the resources of edge devices, and the stability of the designed circuit is analyzed by simulation experiments with PSPICE. We show that the suggested MPCA demonstrates state-of-the-art performance in SER while significantly reducing system power consumption, offering a solution for SER implementation on edge devices.
引用
收藏
页数:14
相关论文
共 46 条
[31]   Survey of Emotions in Human-Robot Interactions: Perspectives from Robotic Psychology on 20 Years of Research [J].
Stock-Homburg, Ruth .
INTERNATIONAL JOURNAL OF SOCIAL ROBOTICS, 2022, 14 (02) :389-411
[32]  
Tariq Z, 2019, IEEE INT CONF BIG DA, P4191, DOI 10.1109/BigData47090.2019.9005638
[33]   Speech Emotion Analysis: Exploring the Role of Context [J].
Tawari, Ashish ;
Trivedi, Mohan Manubhai .
IEEE TRANSACTIONS ON MULTIMEDIA, 2010, 12 (06) :502-509
[34]  
Vaswani A, 2017, ADV NEUR IN, V30
[35]   A systematic review on affective computing: emotion models, databases, and recent advances [J].
Wang, Yan ;
Song, Wei ;
Tao, Wei ;
Liotta, Antonio ;
Yang, Dawei ;
Li, Xinlei ;
Gao, Shuyong ;
Sun, Yixuan ;
Ge, Weifeng ;
Zhang, Wei ;
Zhang, Wenqiang .
INFORMATION FUSION, 2022, 83 :19-52
[36]  
Wu XX, 2019, INT CONF ACOUST SPEE, P6695, DOI [10.1109/icassp.2019.8683163, 10.1109/ICASSP.2019.8683163]
[37]   Transformer-Based Self-Supervised Multimodal Representation Learning for Wearable Emotion Recognition [J].
Wu, Yujin ;
Daoudi, Mohamed ;
Amad, Ali .
IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2024, 15 (01) :157-172
[38]   Memristor-Based Light-Weight Transformer Circuit Implementation for Speech Recognizing [J].
Xiao, He ;
Zhou, Yue ;
Gao, Tongtong ;
Duan, Shukai ;
Chen, Guanrong ;
Hu, Xiaofang .
IEEE JOURNAL ON EMERGING AND SELECTED TOPICS IN CIRCUITS AND SYSTEMS, 2023, 13 (01) :344-356
[39]   Pure-Attention-Based Multifunction Memristive Neuromorphic Circuit and System [J].
Xiao, He ;
Sun, Haohang ;
Zhao, Tianhao ;
Zhou, Yue ;
Hu, Xiaofang .
INTERNATIONAL JOURNAL OF BIFURCATION AND CHAOS, 2023, 33 (09)
[40]   Full-Circuit Implementation of Transformer Network Based on Memristor [J].
Yang, Chao ;
Wang, Xiaoping ;
Zeng, Zhigang .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS I-REGULAR PAPERS, 2022, 69 (04) :1395-1407