Application of Vector Quantization in Emotion Recognition from Human Speech

被引:0
|
作者
Khanna, Preeti [1 ]
Kumar, M. Sasi [2 ]
机构
[1] SVKMs NMIMS, SBM, Bombay, Maharashtra, India
[2] CDAC, Bombay, Maharashtra, India
来源
INFORMATION INTELLIGENCE, SYSTEMS, TECHNOLOGY AND MANAGEMENT | 2011年 / 141卷
关键词
Emotion recognition; Mel frequency cepstral coefficient; vector quantization; German database;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recognition of emotions from speech is a complex task that is furthermore complicated by the fact that there is no unambiguous answer to what the "correct" emotion is for a given speech sample. In this paper, we discuss emotion classification of a well known German database consisting of 6 basic emotions: sadness, boredom, neutral, fear, happiness, and anger using Mel frequency Cepstral Coefficients (MFCCs). A concern with MFCC is the large number of features. We discuss the use of LBG-VQ algorithm to minimize the amount of data to be handled. At last, emotion classification is done using Euclidean distance, Manhattan distance and Chebyshev distance of the codebooks between neutral state and other emotional states for the same sample.
引用
收藏
页码:118 / +
页数:2
相关论文
共 50 条
  • [41] Emotion Recognition from Speech: An Unsupervised Learning Approach
    Rovetta, Stefano
    Mnasri, Zied
    Masulli, Francesco
    Cabri, Alberto
    INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE SYSTEMS, 2021, 14 (01) : 23 - 35
  • [42] Multiroom Speech Emotion Recognition
    Shalev, Erez
    Cohen, Israel
    2022 30TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2022), 2022, : 135 - 139
  • [43] Adaptive hierarchical emotion recognition from speech signal for human-robot communication
    Le, Ba-Vui
    Lee, Sungyoung
    2014 TENTH INTERNATIONAL CONFERENCE ON INTELLIGENT INFORMATION HIDING AND MULTIMEDIA SIGNAL PROCESSING (IIH-MSP 2014), 2014, : 807 - 810
  • [44] Persian Speech Emotion Recognition
    Savargiv, Mohammad
    Bastanfard, Azam
    2015 7TH CONFERENCE ON INFORMATION AND KNOWLEDGE TECHNOLOGY (IKT), 2015,
  • [45] INFLUENCE OF AUDIO BANDWIDTH ON SPEECH EMOTION RECOGNITION BY HUMAN SUBJECTS
    Lahaie, Olivier
    Lefebvre, Roch
    Gournay, Philippe
    2017 IEEE GLOBAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING (GLOBALSIP 2017), 2017, : 61 - 65
  • [46] English speech emotion recognition method based on speech recognition
    Liu, Man
    INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2022, 25 (2) : 391 - 398
  • [47] Improving Automatic Emotion Recognition from Speech Signals
    Bozkurt, Elif
    Erzin, Engin
    Erdem, Cigdem Eroglu
    Erdem, A. Tanju
    INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 312 - +
  • [48] Emotion Recognition and Spoof Detection from Whispered Speech
    Sivan, Dawn
    Gopakumar, C.
    2017 INTERNATIONAL CONFERENCE ON COMPUTING METHODOLOGIES AND COMMUNICATION (ICCMC), 2017, : 1091 - 1095
  • [49] Speech recognition in mixed sound of speech and music based on vector quantization and non-negative matrix factorization
    Nakano, Shoichi
    Yamamoto, Kazumasa
    Nakagawa, Seiichi
    12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 1792 - 1795
  • [50] English speech emotion recognition method based on speech recognition
    Man Liu
    International Journal of Speech Technology, 2022, 25 : 391 - 398