Application of Vector Quantization in Emotion Recognition from Human Speech

被引：0

作者：

Khanna, Preeti ^{[1
]}

Kumar, M. Sasi ^{[2
]}

机构：

[1] SVKMs NMIMS, SBM, Bombay, Maharashtra, India

[2] CDAC, Bombay, Maharashtra, India

来源：

INFORMATION INTELLIGENCE, SYSTEMS, TECHNOLOGY AND MANAGEMENT | 2011年 / 141卷

关键词：

Emotion recognition; Mel frequency cepstral coefficient; vector quantization; German database;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Recognition of emotions from speech is a complex task that is furthermore complicated by the fact that there is no unambiguous answer to what the "correct" emotion is for a given speech sample. In this paper, we discuss emotion classification of a well known German database consisting of 6 basic emotions: sadness, boredom, neutral, fear, happiness, and anger using Mel frequency Cepstral Coefficients (MFCCs). A concern with MFCC is the large number of features. We discuss the use of LBG-VQ algorithm to minimize the amount of data to be handled. At last, emotion classification is done using Euclidean distance, Manhattan distance and Chebyshev distance of the codebooks between neutral state and other emotional states for the same sample.

引用

页码：118 / +

页数：2

共 50 条

[31] Speech Based Human Emotion Recognition Using MFCC
Likitha, M. S.
Gupta, Raksha R.
Hasitha, K.
Raju, A. Upendra
2017 2ND IEEE INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS, SIGNAL PROCESSING AND NETWORKING (WISPNET), 2017, : 2257 - 2260
[32] DHMM Speech Recognition Algorithm Based on Immune Particle Swarm Vector Quantization
Ning, Aiping
Zhang, Xueying
Duan, Wei
ARTIFICIAL INTELLIGENCE AND COMPUTATIONAL INTELLIGENCE, PT III, 2011, 7004 : 420 - 427
[33] Speech Recognition Method Based on Genetic Vector Quantization and BP Neural Network
Gao Li'ai
Li Lihua
Zhou Jian
Zhao Qiuxia
PIAGENG 2009: IMAGE PROCESSING AND PHOTONICS FOR AGRICULTURAL ENGINEERING, 2009, 7489
[34] Speech Emotion Recognition Based on Fuzzy Least Squares Support Vector Machines
Zhang, Shiqing
2008 7TH WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION, VOLS 1-23, 2008, : 1299 - 1302
[35] Evaluating intonational features for emotion recognition from speech
Zervas, Panagiotis
Mporas, Iosif
Fakotakis, Nikos
Kokkinakis, George
INTERNATIONAL JOURNAL ON ARTIFICIAL INTELLIGENCE TOOLS, 2007, 16 (06) : 1001 - 1014
[36] SUPERVISED DOMAIN ADAPTATION FOR EMOTION RECOGNITION FROM SPEECH
Abdelwahab, Mohammed
Busso, Carlos
2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 5058 - 5062
[37] Autoencoder With Emotion Embedding for Speech Emotion Recognition
Zhang, Chenghao
Xue, Lei
IEEE ACCESS, 2021, 9 : 51231 - 51241
[38] Learning Alignment for Multimodal Emotion Recognition from Speech
Xu, Haiyang
Zhang, Hui
Han, Kun
Wang, Yun
Peng, Yiping
Li, Xiangang
INTERSPEECH 2019, 2019, : 3569 - 3573
[39] Emotion recognition from the facial image and speech signal
Go, HJ
Kwak, KC
Lee, DJ
Chun, MG
SICE 2003 ANNUAL CONFERENCE, VOLS 1-3, 2003, : 2890 - 2895
[40] A DIMENSIONAL APPROACH TO EMOTION RECOGNITION OF SPEECH FROM MOVIES
Giannakopoulos, Theodoros
Pikrakis, Aggelos
Theodoridis, Sergios
2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 65 - 68

← 1 2 3 4 5 →