Floating to Fixed-point Translation with its Application to Speech-based Emotion Recognition

被引:2
|
作者
Kabi, Bibek [1 ]
Sahoo, Subhasmita [2 ]
Samantaray, Amiya Kumar [3 ]
Routray, Aurobinda [2 ]
机构
[1] Indian Inst Technol, Adv Technol Dev Ctr, Kharagpur, W Bengal, India
[2] Indian Inst Technol, Dept Elect Engn, Kharagpur, W Bengal, India
[3] Natl Inst Technol, Rourkela, India
关键词
Fixed-point arithmetic; hidden Markov model (HMM); mel-frequency cepstral coeffcients (MFCCs); quantization; range estimation; speech-based emotion recognition; wordlength optimization; OPTIMIZATION;
D O I
10.1109/EAIT.2014.57
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Speech-based emotion recognition is one of the latest challenges in speech processing. The algorithms are developed using floating-point arithmetic because of its wide dynamic range and constant relative accuracy. However, they are finally implemented in hand held devices which are required to consume less power, time and have a lower market price. Fixed-point arithmetic with proper determination of integer and fractional bitwidths can help in satisfying these requirements. Therefore we have made an attempt to develop a fixed-point speech-based emotion recognition system using Mel frequency cepstral coefficients (MFCCs) and hidden Markov model (HMM). Accurate range and precision analysis has been carried out to compute optimum integer and fractional wordlengths. The speech emotion engine has been evaluated using Berlin emotional speech database, EMO-DB. A speaker independent emotion recognition accuracy of 71.02% and 67.42% for floating-point and fixed-point formats with optimized wordlenghs respectively was achieved. Finite wordlength effect like quantization with range of relative errors and its effect on emotion recognition task has been analyzed.
引用
收藏
页码:21 / 26
页数:6
相关论文
共 50 条
  • [1] Speech-based Emotion Recognition: Application of Collective Decision Making Concepts
    Brester, Christina
    Semenkin, Eugene
    Sidorov, Maxim
    INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND ARTIFICIAL INTELLIGENCE (ICCSAI 2014), 2015, : 216 - 220
  • [2] Effect of Reverberation in Speech-based Emotion Recognition
    Zhao, Shujie
    Yang, Yan
    Chen, Jingdong
    2018 IEEE INTERNATIONAL CONFERENCE ON THE SCIENCE OF ELECTRICAL ENGINEERING IN ISRAEL (ICSEE), 2018,
  • [3] An investigation of speech-based human emotion recognition
    Wang, YJ
    Guan, L
    2004 IEEE 6TH WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING, 2004, : 15 - 18
  • [4] Towards Robust Speech-Based Emotion Recognition
    Tabatabaei, Talieh S.
    Krishnan, Sridhar
    2010 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS (SMC 2010), 2010,
  • [5] Speech-based Emotion Recognition and Next Reaction Prediction
    Noroozi, Fatemeh
    Akrami, Neda
    Anbarjafari, Gholamreza
    2017 25TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2017,
  • [6] Utilizing Psychoacoustic Modeling to Improve Speech-Based Emotion Recognition
    Siegert, Ingo
    Lotz, Alicia Flores
    Egorow, Olga
    Wolff, Susann
    SPEECH AND COMPUTER (SPECOM 2018), 2018, 11096 : 625 - 635
  • [7] NEW FIXED-POINT THEOREM AND ITS APPLICATION
    PETRYSHYN, WV
    BULLETIN OF THE AMERICAN MATHEMATICAL SOCIETY, 1972, 78 (02) : 225 - +
  • [8] Floating-point to fixed-point conversion of tropical wood recognition system classifier
    Kusuma, Enas Dhuhri
    Yusof, Rubiyah
    Othman, Mohammad Fauzi
    International Journal of Circuits, Systems and Signal Processing, 2014, 8 : 376 - 387
  • [9] Robust Multi-Scenario Speech-Based Emotion Recognition System
    Fangfang Zhu-Zhou
    Gil-Pita, Roberto
    Garcia-Gomez, Joaquin
    Rosa-Zurera, Manuel
    SENSORS, 2022, 22 (06)
  • [10] Multicriteria Neural Network Design in the Speech-based Emotion Recognition Problem
    Brester, Christina
    Semenkin, Eugene
    Sidorov, Maxim
    Semenkina, Olga
    ICIMCO 2015 PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON INFORMATICS IN CONTROL, AUTOMATION AND ROBOTICS, VOL. 1, 2015, : 621 - 628