Two-Dimensional Cepstrum Analysis Approach in Emotion Recognition from Speech

被引:0
|
作者
Guoth, Igor [1 ]
Chmulik, Michal [1 ]
Polacky, Jozef [1 ]
Kuba, Michal [1 ]
机构
[1] Univ Zilina, Fac Elect Engn, Dept Telecommun & Multimedia, AudioLab, Zilina, Slovakia
关键词
two-dimensional cepstrum; support vector regression; emotion speech recognition; bat algorithm; particle swarm optimization;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper we propose two-dimensional cepstrum analysis approach applied in the emotion recognition task in continuous space. Experiments has been done on IEMOCAP database which allows emotion recognition in three dimensions: dimension valence, arousal and dominance. Sequence of experiments has been done over whole range of emotions available in the IEMOCAP database. It includes 6 basic emotional states with three additional emotional states. Performance of system based on two-dimensional cepstrum approach is immediate compared with different approaches. For that cause we utilized concept of "bag-of-features". It includes vast collection of features with final amount of 961 features per vector. Then we selected most appropriate features with Bat Algorithm and Particle Swarm Optimization method. Performance of our system has been evaluated with Pearson correlation coefficient. Results from both systems presented in this contribution show that we achieved better results in dimension arousal (DA), dimension dominance (DO) and similar results in last one dimension valence (DV).
引用
收藏
页码:335 / 339
页数:5
相关论文
共 50 条
  • [1] SPEECH ANALYSIS USING TWO-DIMENSIONAL CEPSTRUM.
    Imai, Satoshi
    Kitamura, Tadashi
    Electronics and Communications in Japan (English translation of Denshi Tsushin Gakkai Zasshi), 1976, 59 (12): : 55 - 63
  • [2] Two-dimensional root cepstrum as feature extraction method for speech recognition
    Chilton, E
    Marvi, H
    ELECTRONICS LETTERS, 2003, 39 (10) : 815 - 816
  • [3] GA-based noisy speech recognition using two-dimensional cepstrum
    Lin, CT
    Nein, HW
    Hwu, JY
    IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2000, 8 (06): : 664 - 675
  • [4] A DIMENSIONAL APPROACH TO EMOTION RECOGNITION OF SPEECH FROM MOVIES
    Giannakopoulos, Theodoros
    Pikrakis, Aggelos
    Theodoridis, Sergios
    2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 65 - 68
  • [5] Modified two-dimensional root cepstrum analysis
    Marvi, H
    Chilton, E
    ELECTRONICS LETTERS, 2005, 41 (05) : 285 - 286
  • [7] THE TWO-DIMENSIONAL DIFFERENTIAL CEPSTRUM
    RAGHURAMIREDDY, D
    UNBEHAUEN, R
    IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1985, 33 (05): : 1335 - 1337
  • [8] Revolutionizing Speech Emotion Recognition: A Novel Hilbert Curve Approach for Two-Dimensional Representation and Convolutional Neural Network Classification
    Tyagi, Suryakant
    Szenasi, Sandor
    ADVANCES IN SERVICE AND INDUSTRIAL ROBOTICS, RAAD 2024, 2024, 157 : 75 - 85
  • [9] Two-dimensional multi-resolution analysis of speech signals and its application to speech recognition
    Chan, CP
    Wong, YW
    Lee, T
    Ching, PC
    ICASSP '99: 1999 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS VOLS I-VI, 1999, : 405 - 408
  • [10] Two-dimensional multi-resolution analysis of speech signals and its application to speech recognition
    Chan, C.P.
    Wong, Y.W.
    Lee, Tan.
    Ching, P.C.
    ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, 1999, 1 : 405 - 408