A comparison of neural networks for real-time emotion recognition from speech signals

被引:0
|
作者
Department of Software Engineering, Izmir University of Economics, Sakarya Cad No.156, Balcova, Izmir 35330, Turkey [1 ]
机构
来源
WSEAS Trans. Signal Process. | 2009年 / 3卷 / 116-125期
关键词
Neural networks - Human computer interaction - Speech recognition - Emotion Recognition - Application programs;
D O I
暂无
中图分类号
学科分类号
摘要
Speech and emotion recognition improve the quality of human computer interaction and allow easier to use interfaces for every level of user in software applications. In this study, we have developed two different neural networks called emotion recognition neural network (ERNN) and Gram-Charlier emotion recognition neural network (GERNN) to classify the voice signals for emotion recognition. The ERNN has 128 input nodes, 20 hidden neurons, and three summing output nodes. A set of 97920 training sets is used to train the ERNN. A new set of 24480 testing sets is utilized to test the ERNN performance. The samples tested for voice recognition are acquired from the movies Anger Management and Pick of Destiny . ERNN achieves an average recognition performance of 100%. This high level of recognition suggests that the ERNN is a promising method for emotion recognition in computer applications. Furthermore, the GERNN has four input nodes, 20 hidden neurons, and three output nodes. The GERNN achieves an average recognition performance of 33%. This shows us that we cannot use Gram-Charlier coefficients to discriminate emotion signals. In addition, Hinton diagrams were utilized to display the optimality of ERNN weights.
引用
收藏
相关论文
共 50 条
  • [21] Real-time Emotion Recognition for Sales
    Naas, Si-Ahmed
    Sigg, Stephan
    2020 16TH INTERNATIONAL CONFERENCE ON MOBILITY, SENSING AND NETWORKING (MSN 2020), 2020, : 584 - 591
  • [22] Emotion Recognition from Speech using Spectrograms and Shallow Neural Networks
    Slimi, Anwer
    Hamroun, Mohamed
    Zrigui, Mounir
    Nicolas, Henri
    MOMM 2020: THE 18TH INTERNATIONAL CONFERENCE ON ADVANCES IN MOBILE COMPUTING & MULTIMEDIA, 2020, : 35 - 39
  • [23] Emotion Recognition from EEG Signals Using Recurrent Neural Networks
    Chowdary, M. Kalpana
    Anitha, J.
    Hemanth, D. Jude
    ELECTRONICS, 2022, 11 (15)
  • [24] Real-time speech emotion recognition using deep learning and data augmentation
    Barhoumi, Chawki
    Benayed, Yassine
    ARTIFICIAL INTELLIGENCE REVIEW, 2024, 58 (02)
  • [25] Investigation of Fixed-dimensional Speech Representations for Real-time Speech Emotion Recognition System
    Rao, Wei
    Lim, Zhi Hao
    Wang, Qing
    Xu, Chenglin
    Tian, Xiaohai
    Chng, Eng Siong
    Li, Haizhou
    PROCEEDINGS OF THE 2017 INTERNATIONAL CONFERENCE ON ORANGE TECHNOLOGIES (ICOT), 2017, : 197 - 200
  • [26] Towards real-time speech emotion recognition for affective e-learning
    Bahreini K.
    Nadolski R.
    Westera W.
    Education and Information Technologies, 2016, 21 (5) : 1367 - 1386
  • [27] Design and Implementation of a Real-time Emotion Recognition System Based on Physiological signals
    Liu X.
    Zhong M.-L.
    Lin Y.-F.
    Liu Z.-W.
    Beijing Ligong Daxue Xuebao/Transaction of Beijing Institute of Technology, 2019, 39 : 176 - 180
  • [28] Deep Spiking Neural Network model for time-variant signals classification: a real-time speech recognition approach
    Dominguez-Morales, Juan P.
    Liu, Qian
    James, Robert
    Gutierrez-Galan, Daniel
    Jimenez-Fernandez, Angel
    Davidson, Simon
    Furber, Steve
    2018 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2018,
  • [29] Speech emotion recognition with deep convolutional neural networks
    Issa, Dias
    Demirci, M. Fatih
    Yazici, Adnan
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2020, 59
  • [30] Speech emotion recognition using spiking neural networks
    Buscicchio, Cosimo A.
    Gorecki, Przemyslaw
    Caponetti, Laura
    FOUNDATIONS OF INTELLIGENT SYSTEMS, PROCEEDINGS, 2006, 4203 : 38 - 46