Improved Emotion Recognition With a Novel Speaker-Independent Feature

被引:48
|
作者
Kim, Eun Ho [1 ]
Hyun, Kyung Hak [1 ]
Kim, Soo Hyun [1 ]
Kwak, Yoon Keun [1 ]
机构
[1] Korea Adv Inst Sci & Technol, Dept Mech Engn, Taejon 305701, South Korea
基金
新加坡国家研究基金会;
关键词
Emotional interaction; intelligent robots; speaker-independent system; speech emotion recognition; SPEECH; SYSTEM; INTERFACE; STRESS; NOISE;
D O I
10.1109/TMECH.2008.2008644
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Emotion recognition is one of the latest challenges in human-robot interaction. This paper describes the realization of emotional interaction for a Thinking Robot, focusing on speech emotion recognition. In general, speaker-independent systems show a lower accuracy rate compared with speaker-dependent systems, as emotional feature values depend on the speaker and their gender. However, speaker-independent systems are required for commercial applications. In this paper, a novel speaker-independent feature, the ratio of a spectral flatness measure to a spectral center (RSS), with a small variation in speakers when constructing a speaker-independent system is proposed. Gender and emotion are hierarchically classified by using the proposed feature (RSS), pitch, energy, and the mel frequency cepstral coefficients. An average recognition rate of 57.2% (+/- 5.7%) at a 90% confidence interval is achieved with the proposed systems in the speaker-independent mode.
引用
收藏
页码:317 / 325
页数:9
相关论文
共 50 条
  • [1] Speaker-Independent Emotion Recognition based on Feature Vector Classification
    Park, Jeong-Sik
    Kim, Ji-Hwan
    Yoon, Sang-Min
    Oh, Yung-Hwan
    INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 2775 - +
  • [2] Domain Invariant Feature Learning for Speaker-Independent Speech Emotion Recognition
    Lu, Cheng
    Zong, Yuan
    Zheng, Wenming
    Li, Yang
    Tang, Chuangao
    Schuller, Bjoern W.
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2022, 30 : 2217 - 2230
  • [3] A FEATURE SELECTION AND FEATURE FUSION COMBINATION METHOD FOR SPEAKER-INDEPENDENT SPEECH EMOTION RECOGNITION
    Jin, Yun
    Song, Peng
    Zheng, Wenming
    Zhao, Li
    2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
  • [4] IMPROVED SPEAKER-INDEPENDENT EMOTION RECOGNITION FROM SPEECH USING TWO-STAGE FEATURE REDUCTION
    Nazid, Hasrul Mohd
    Muthusamy, Hariharan
    Vijean, Vikneswaran
    Yaacob, Sazali
    JOURNAL OF INFORMATION AND COMMUNICATION TECHNOLOGY-MALAYSIA, 2015, 14 : 57 - 76
  • [5] Wavelet packet analysis for speaker-independent emotion recognition
    Wang, Kunxia
    Su, Guoxin
    Liu, Li
    Wang, Shu
    NEUROCOMPUTING, 2020, 398 (398) : 257 - 264
  • [6] Speaker-independent Speech Emotion Recognition Based on Random Forest Feature Selection Algorithm
    Cao, Wei-Hua
    Xu, Jian-Ping
    Liu, Zhen-Tao
    PROCEEDINGS OF THE 36TH CHINESE CONTROL CONFERENCE (CCC 2017), 2017, : 10995 - 10998
  • [7] Speaker Adversarial Neural Network (SANN) for Speaker-independent Speech Emotion Recognition
    Fahad, Md Shah
    Ranjan, Ashish
    Deepak, Akshay
    Pradhan, Gayadhar
    CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2022, 41 (11) : 6113 - 6135
  • [8] Speaker Adversarial Neural Network (SANN) for Speaker-independent Speech Emotion Recognition
    Md Shah Fahad
    Ashish Ranjan
    Akshay Deepak
    Gayadhar Pradhan
    Circuits, Systems, and Signal Processing, 2022, 41 : 6113 - 6135
  • [9] A novel robust feature of speech signal based on the mellin transform for speaker-independent speech recognition
    Chen, JD
    Xu, B
    Huang, TY
    PROCEEDINGS OF THE 1998 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-6, 1998, : 629 - 632
  • [10] Speaker-independent recognition of Chinese tones
    GUAN Cuntai and CHEN Yongbin(Dep. of Radio Eng.
    Chinese Journal of Acoustics, 1993, (02) : 142 - 148