Emotion Recognition using Imperfect Speech Recognition

被引:0
|
作者
Metze, Florian [1 ]
Batliner, Anton [4 ]
Eyben, Florian [2 ]
Polzehl, Tim [3 ]
Schuller, Bjoern [2 ]
Steidl, Stefan [4 ]
机构
[1] Carnegie Mellon Univ, Language Technol Inst, Pittsburgh, PA 15213 USA
[2] Tech Univ Munich, Inst Human Machine Commun, Munich, Germany
[3] Tech Univ Berlin, Qual & Usabil Lab, Berlin, Germany
[4] Friedrich Alexander Univ, Lehrstuhl Mustererkennung, Erlangen, Germany
关键词
speech-to-text; emotion detection; meta-data extraction; rich transcription; children's speech;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper investigates the use of speech-to-text methods for assigning an emotion class to a given speech utterance. Previous work shows that an emotion extracted from text can convey complementary evidence to the information extracted by classifiers based on spectral, or other non-linguistic features. As speech-to-text usually presents significantly more computational effort, in this study we investigate the degree of speech-to-text accuracy needed for reliable detection of emotions from an automatically generated transcription of an utterance. We evaluate the use of hypotheses in both training and testing, and compare several classification approaches on the same task. Our results show that emotion recognition performance stays roughly constant as long as word accuracy doesn't fall below a reasonable value, making the use of speech-to-text viable for training of emotion classifiers based on linguistics.
引用
收藏
页码:478 / +
页数:2
相关论文
共 50 条
  • [21] SPEECH EMOTION RECOGNITION USING SEMANTIC INFORMATION
    Tzirakis, Panagiotis
    Anh Nguyen
    Zafeiriou, Stefanos
    Schuller, Bjoern W.
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 6279 - 6283
  • [22] Emotion Recognition in Speech Using MFCC and Classifiers
    Ajitha, G.
    Prashanth, Addagatla
    Radhika, Chelle
    Chaitanya, Kancharapu
    COMPUTATIONAL VISION AND BIO-INSPIRED COMPUTING ( ICCVBIC 2021), 2022, 1420 : 197 - 207
  • [23] Speech Emotion Recognition Using Multiple Classifiers
    Wang, Kunxia
    Chu, Zongcheng
    Wang, Kai
    Yu, Tongqing
    Liu, Li
    WEB AND BIG DATA, 2017, 10612 : 84 - 93
  • [24] Speech Emotion Recognition 'in the wild' Using an Autoencoder
    Dissanayake, Vipula
    Zhang, Haimo
    Billinghurst, Mark
    Nanayakkara, Suranga
    INTERSPEECH 2020, 2020, : 526 - 530
  • [25] Speech Emotion Recognition using Affective Saliency
    Chorianopoulou, Arodami
    Koatsakis, Polychronis
    Potamianos, Alexandros
    17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 500 - 504
  • [26] Speech Emotion Recognition Using Spectral Entropy
    Lee, Woo-Seok
    Roh, Yong-Wan
    Kim, Dong-Ju
    Kim, Jung-Hyun
    Hong, Kwang-Seok
    INTELLIGENT ROBOTICS AND APPLICATIONS, PT II, PROCEEDINGS, 2008, 5315 : 45 - 54
  • [27] SPEECH EMOTION RECOGNITION USING CAPSULE NETWORKS
    Wu, Xixin
    Liu, Songxiang
    Cao, Yuewen
    Li, Xu
    Yu, Jianwei
    Dai, Dongyang
    Ma, Xi
    Hu, Shoukang
    Wu, Zhiyong
    Liu, Xunying
    Meng, Helen
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 6695 - 6699
  • [28] Speech Emotion Recognition Using Transfer Learning
    Song, Peng
    Jin, Yun
    Zhao, Li
    Xin, Minghai
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2014, E97D (09): : 2530 - 2532
  • [29] English speech emotion recognition method based on speech recognition
    Liu, Man
    INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2022, 25 (2) : 391 - 398
  • [30] English speech emotion recognition method based on speech recognition
    Man Liu
    International Journal of Speech Technology, 2022, 25 : 391 - 398