Application of Vector Quantization in Emotion Recognition from Human Speech

被引:0
|
作者
Khanna, Preeti [1 ]
Kumar, M. Sasi [2 ]
机构
[1] SVKMs NMIMS, SBM, Bombay, Maharashtra, India
[2] CDAC, Bombay, Maharashtra, India
来源
INFORMATION INTELLIGENCE, SYSTEMS, TECHNOLOGY AND MANAGEMENT | 2011年 / 141卷
关键词
Emotion recognition; Mel frequency cepstral coefficient; vector quantization; German database;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recognition of emotions from speech is a complex task that is furthermore complicated by the fact that there is no unambiguous answer to what the "correct" emotion is for a given speech sample. In this paper, we discuss emotion classification of a well known German database consisting of 6 basic emotions: sadness, boredom, neutral, fear, happiness, and anger using Mel frequency Cepstral Coefficients (MFCCs). A concern with MFCC is the large number of features. We discuss the use of LBG-VQ algorithm to minimize the amount of data to be handled. At last, emotion classification is done using Euclidean distance, Manhattan distance and Chebyshev distance of the codebooks between neutral state and other emotional states for the same sample.
引用
收藏
页码:118 / +
页数:2
相关论文
共 50 条
  • [31] Speech Based Human Emotion Recognition Using MFCC
    Likitha, M. S.
    Gupta, Raksha R.
    Hasitha, K.
    Raju, A. Upendra
    2017 2ND IEEE INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS, SIGNAL PROCESSING AND NETWORKING (WISPNET), 2017, : 2257 - 2260
  • [32] DHMM Speech Recognition Algorithm Based on Immune Particle Swarm Vector Quantization
    Ning, Aiping
    Zhang, Xueying
    Duan, Wei
    ARTIFICIAL INTELLIGENCE AND COMPUTATIONAL INTELLIGENCE, PT III, 2011, 7004 : 420 - 427
  • [33] Speech Recognition Method Based on Genetic Vector Quantization and BP Neural Network
    Gao Li'ai
    Li Lihua
    Zhou Jian
    Zhao Qiuxia
    PIAGENG 2009: IMAGE PROCESSING AND PHOTONICS FOR AGRICULTURAL ENGINEERING, 2009, 7489
  • [34] Speech Emotion Recognition Based on Fuzzy Least Squares Support Vector Machines
    Zhang, Shiqing
    2008 7TH WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION, VOLS 1-23, 2008, : 1299 - 1302
  • [35] Evaluating intonational features for emotion recognition from speech
    Zervas, Panagiotis
    Mporas, Iosif
    Fakotakis, Nikos
    Kokkinakis, George
    INTERNATIONAL JOURNAL ON ARTIFICIAL INTELLIGENCE TOOLS, 2007, 16 (06) : 1001 - 1014
  • [36] SUPERVISED DOMAIN ADAPTATION FOR EMOTION RECOGNITION FROM SPEECH
    Abdelwahab, Mohammed
    Busso, Carlos
    2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 5058 - 5062
  • [37] Autoencoder With Emotion Embedding for Speech Emotion Recognition
    Zhang, Chenghao
    Xue, Lei
    IEEE ACCESS, 2021, 9 : 51231 - 51241
  • [38] Learning Alignment for Multimodal Emotion Recognition from Speech
    Xu, Haiyang
    Zhang, Hui
    Han, Kun
    Wang, Yun
    Peng, Yiping
    Li, Xiangang
    INTERSPEECH 2019, 2019, : 3569 - 3573
  • [39] Emotion recognition from the facial image and speech signal
    Go, HJ
    Kwak, KC
    Lee, DJ
    Chun, MG
    SICE 2003 ANNUAL CONFERENCE, VOLS 1-3, 2003, : 2890 - 2895
  • [40] A DIMENSIONAL APPROACH TO EMOTION RECOGNITION OF SPEECH FROM MOVIES
    Giannakopoulos, Theodoros
    Pikrakis, Aggelos
    Theodoridis, Sergios
    2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 65 - 68