Improved Emotion Recognition With a Novel Speaker-Independent Feature

被引：48

作者：

Kim, Eun Ho ^{[1
]}

Hyun, Kyung Hak ^{[1
]}

Kim, Soo Hyun ^{[1
]}

Kwak, Yoon Keun ^{[1
]}

机构：

[1] Korea Adv Inst Sci & Technol, Dept Mech Engn, Taejon 305701, South Korea

来源：

IEEE-ASME TRANSACTIONS ON MECHATRONICS | 2009年 / 14卷 / 03期

基金：

新加坡国家研究基金会;

关键词：

Emotional interaction; intelligent robots; speaker-independent system; speech emotion recognition; SPEECH; SYSTEM; INTERFACE; STRESS; NOISE;

D O I：

10.1109/TMECH.2008.2008644

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Emotion recognition is one of the latest challenges in human-robot interaction. This paper describes the realization of emotional interaction for a Thinking Robot, focusing on speech emotion recognition. In general, speaker-independent systems show a lower accuracy rate compared with speaker-dependent systems, as emotional feature values depend on the speaker and their gender. However, speaker-independent systems are required for commercial applications. In this paper, a novel speaker-independent feature, the ratio of a spectral flatness measure to a spectral center (RSS), with a small variation in speakers when constructing a speaker-independent system is proposed. Gender and emotion are hierarchically classified by using the proposed feature (RSS), pitch, energy, and the mel frequency cepstral coefficients. An average recognition rate of 57.2% (+/- 5.7%) at a 90% confidence interval is achieved with the proposed systems in the speaker-independent mode.

引用

页码：317 / 325

页数：9

共 50 条

[1] Speaker-Independent Emotion Recognition based on Feature Vector Classification
Park, Jeong-Sik
Kim, Ji-Hwan
Yoon, Sang-Min
Oh, Yung-Hwan
INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 2775 - +
[2] Domain Invariant Feature Learning for Speaker-Independent Speech Emotion Recognition
Lu, Cheng
Zong, Yuan
Zheng, Wenming
Li, Yang
Tang, Chuangao
Schuller, Bjoern W.
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2022, 30 : 2217 - 2230
[3] A FEATURE SELECTION AND FEATURE FUSION COMBINATION METHOD FOR SPEAKER-INDEPENDENT SPEECH EMOTION RECOGNITION
Jin, Yun
Song, Peng
Zheng, Wenming
Zhao, Li
2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
[4] IMPROVED SPEAKER-INDEPENDENT EMOTION RECOGNITION FROM SPEECH USING TWO-STAGE FEATURE REDUCTION
Nazid, Hasrul Mohd
Muthusamy, Hariharan
Vijean, Vikneswaran
Yaacob, Sazali
JOURNAL OF INFORMATION AND COMMUNICATION TECHNOLOGY-MALAYSIA, 2015, 14 : 57 - 76
[5] Wavelet packet analysis for speaker-independent emotion recognition
Wang, Kunxia
Su, Guoxin
Liu, Li
Wang, Shu
NEUROCOMPUTING, 2020, 398 (398) : 257 - 264
[6] Speaker-independent Speech Emotion Recognition Based on Random Forest Feature Selection Algorithm
Cao, Wei-Hua
Xu, Jian-Ping
Liu, Zhen-Tao
PROCEEDINGS OF THE 36TH CHINESE CONTROL CONFERENCE (CCC 2017), 2017, : 10995 - 10998
[7] Speaker Adversarial Neural Network (SANN) for Speaker-independent Speech Emotion Recognition
Fahad, Md Shah
Ranjan, Ashish
Deepak, Akshay
Pradhan, Gayadhar
CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2022, 41 (11) : 6113 - 6135
[8] Speaker Adversarial Neural Network (SANN) for Speaker-independent Speech Emotion Recognition
Md Shah Fahad
Ashish Ranjan
Akshay Deepak
Gayadhar Pradhan
Circuits, Systems, and Signal Processing, 2022, 41 : 6113 - 6135
[9] A novel robust feature of speech signal based on the mellin transform for speaker-independent speech recognition
Chen, JD
Xu, B
Huang, TY
PROCEEDINGS OF THE 1998 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-6, 1998, : 629 - 632
[10] Speaker-independent recognition of Chinese tones
GUAN Cuntai and CHEN Yongbin(Dep. of Radio Eng.
Chinese Journal of Acoustics, 1993, (02) : 142 - 148

← 1 2 3 4 5 →