Application of Vector Quantization in Emotion Recognition from Human Speech

被引：0

作者：

Khanna, Preeti ^{[1
]}

Kumar, M. Sasi ^{[2
]}

机构：

[1] SVKMs NMIMS, SBM, Bombay, Maharashtra, India

[2] CDAC, Bombay, Maharashtra, India

来源：

INFORMATION INTELLIGENCE, SYSTEMS, TECHNOLOGY AND MANAGEMENT | 2011年 / 141卷

关键词：

Emotion recognition; Mel frequency cepstral coefficient; vector quantization; German database;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Recognition of emotions from speech is a complex task that is furthermore complicated by the fact that there is no unambiguous answer to what the "correct" emotion is for a given speech sample. In this paper, we discuss emotion classification of a well known German database consisting of 6 basic emotions: sadness, boredom, neutral, fear, happiness, and anger using Mel frequency Cepstral Coefficients (MFCCs). A concern with MFCC is the large number of features. We discuss the use of LBG-VQ algorithm to minimize the amount of data to be handled. At last, emotion classification is done using Euclidean distance, Manhattan distance and Chebyshev distance of the codebooks between neutral state and other emotional states for the same sample.

引用

页码：118 / +

页数：2

共 50 条

[41] Emotion Recognition from Speech: An Unsupervised Learning Approach
Rovetta, Stefano
Mnasri, Zied
Masulli, Francesco
Cabri, Alberto
INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE SYSTEMS, 2021, 14 (01) : 23 - 35
[42] Multiroom Speech Emotion Recognition
Shalev, Erez
Cohen, Israel
2022 30TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2022), 2022, : 135 - 139
[43] Adaptive hierarchical emotion recognition from speech signal for human-robot communication
Le, Ba-Vui
Lee, Sungyoung
2014 TENTH INTERNATIONAL CONFERENCE ON INTELLIGENT INFORMATION HIDING AND MULTIMEDIA SIGNAL PROCESSING (IIH-MSP 2014), 2014, : 807 - 810
[44] Persian Speech Emotion Recognition
Savargiv, Mohammad
Bastanfard, Azam
2015 7TH CONFERENCE ON INFORMATION AND KNOWLEDGE TECHNOLOGY (IKT), 2015,
[45] INFLUENCE OF AUDIO BANDWIDTH ON SPEECH EMOTION RECOGNITION BY HUMAN SUBJECTS
Lahaie, Olivier
Lefebvre, Roch
Gournay, Philippe
2017 IEEE GLOBAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING (GLOBALSIP 2017), 2017, : 61 - 65
[46] English speech emotion recognition method based on speech recognition
Liu, Man
INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2022, 25 (2) : 391 - 398
[47] Improving Automatic Emotion Recognition from Speech Signals
Bozkurt, Elif
Erzin, Engin
Erdem, Cigdem Eroglu
Erdem, A. Tanju
INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 312 - +
[48] Emotion Recognition and Spoof Detection from Whispered Speech
Sivan, Dawn
Gopakumar, C.
2017 INTERNATIONAL CONFERENCE ON COMPUTING METHODOLOGIES AND COMMUNICATION (ICCMC), 2017, : 1091 - 1095
[49] Speech recognition in mixed sound of speech and music based on vector quantization and non-negative matrix factorization
Nakano, Shoichi
Yamamoto, Kazumasa
Nakagawa, Seiichi
12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 1792 - 1795
[50] English speech emotion recognition method based on speech recognition
Man Liu
International Journal of Speech Technology, 2022, 25 : 391 - 398

← 1 2 3 4 5 →