A comparison of neural networks for real-time emotion recognition from speech signals

被引:0
|
作者
Department of Software Engineering, Izmir University of Economics, Sakarya Cad No.156, Balcova, Izmir 35330, Turkey [1 ]
机构
来源
WSEAS Trans. Signal Process. | 2009年 / 3卷 / 116-125期
关键词
Neural networks - Human computer interaction - Speech recognition - Emotion Recognition - Application programs;
D O I
暂无
中图分类号
学科分类号
摘要
Speech and emotion recognition improve the quality of human computer interaction and allow easier to use interfaces for every level of user in software applications. In this study, we have developed two different neural networks called emotion recognition neural network (ERNN) and Gram-Charlier emotion recognition neural network (GERNN) to classify the voice signals for emotion recognition. The ERNN has 128 input nodes, 20 hidden neurons, and three summing output nodes. A set of 97920 training sets is used to train the ERNN. A new set of 24480 testing sets is utilized to test the ERNN performance. The samples tested for voice recognition are acquired from the movies Anger Management and Pick of Destiny . ERNN achieves an average recognition performance of 100%. This high level of recognition suggests that the ERNN is a promising method for emotion recognition in computer applications. Furthermore, the GERNN has four input nodes, 20 hidden neurons, and three output nodes. The GERNN achieves an average recognition performance of 33%. This shows us that we cannot use Gram-Charlier coefficients to discriminate emotion signals. In addition, Hinton diagrams were utilized to display the optimality of ERNN weights.
引用
收藏
相关论文
共 50 条
  • [1] Towards Real-time Speech Emotion Recognition using Deep Neural Networks
    Fayek, H. M.
    Lech, M.
    Cavedon, L.
    2015 9TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND COMMUNICATION SYSTEMS (ICSPCS), 2015,
  • [2] Real-time emotion recognition from speech using echo state networks
    Scherer, Stefan
    Oubbati, Mohamed
    Schwenker, Friedhelm
    Palm, Guenther
    ARTIFICIAL NEURAL NETWORKS IN PATTERN RECOGNITION, PROCEEDINGS, 2008, 5064 : 205 - 216
  • [3] Research on Real-Time Speech Emotion Recognition Framework
    Tang, Guichen
    Liang, Ruiyu
    Wang, Jie
    Zhu, Yue
    2018 INTERNATIONAL CONFERENCE ON ROBOTS & INTELLIGENT SYSTEM (ICRIS 2018), 2018, : 446 - 448
  • [4] Real-Time Speech Emotion Recognition by Minimum Number of Features
    Savargiv, Mohammad
    Bastanfard, Azam
    2016 ARTIFICIAL INTELLIGENCE AND ROBOTICS (IRANOPEN), 2016, : 72 - 76
  • [5] Real-time emotion recognition system with multiple physiological signals
    Zhuang, Jyun-Rong
    Guan, Ya-Jing
    Nagayoshi, Hayato
    Muramatsu, Keiichi
    Watanuki, Keiichi
    Tanaka, Eiichiro
    JOURNAL OF ADVANCED MECHANICAL DESIGN SYSTEMS AND MANUFACTURING, 2019, 13 (04)
  • [6] An Evaluation of Emotion Units and Feature Types for Real-Time Speech Emotion Recognition
    Vogt, Thurid
    Andre, Elisabeth
    KUNSTLICHE INTELLIGENZ, 2011, 25 (03): : 213 - 223
  • [7] A comparison between recurrent neural architectures for real-time nonlinear prediction of speech signals
    Pérez-Ortiz, JA
    Calera-Rubio, J
    Forcada, ML
    NEURAL NETWORKS FOR SIGNAL PROCESSING XI, 2001, : 73 - 81
  • [8] DEEP NEURAL NETWORKS WITH AUXILIARY GAUSSIAN MIXTURE MODELS FOR REAL-TIME SPEECH RECOGNITION
    Lei, Xin
    Lin, Hui
    Heigold, Georg
    2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 7634 - 7638
  • [9] Real-Time Hand Gesture Recognition Based on Electromyographic Signals and Artificial Neural Networks
    Motoche, Cristhian
    Benalcazar, Marco E.
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2018, PT I, 2018, 11139 : 352 - 361
  • [10] Real-Time Movie-Induced Discrete Emotion Recognition from EEG Signals
    Liu, Yong-Jin
    Yu, Minjing
    Zhao, Guozhen
    Song, Jinjing
    Ge, Yan
    Shi, Yuanchun
    IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2018, 9 (04) : 550 - 562