Emotion Recognition using Imperfect Speech Recognition

被引:0
|
作者
Metze, Florian [1 ]
Batliner, Anton [4 ]
Eyben, Florian [2 ]
Polzehl, Tim [3 ]
Schuller, Bjoern [2 ]
Steidl, Stefan [4 ]
机构
[1] Carnegie Mellon Univ, Language Technol Inst, Pittsburgh, PA 15213 USA
[2] Tech Univ Munich, Inst Human Machine Commun, Munich, Germany
[3] Tech Univ Berlin, Qual & Usabil Lab, Berlin, Germany
[4] Friedrich Alexander Univ, Lehrstuhl Mustererkennung, Erlangen, Germany
关键词
speech-to-text; emotion detection; meta-data extraction; rich transcription; children's speech;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper investigates the use of speech-to-text methods for assigning an emotion class to a given speech utterance. Previous work shows that an emotion extracted from text can convey complementary evidence to the information extracted by classifiers based on spectral, or other non-linguistic features. As speech-to-text usually presents significantly more computational effort, in this study we investigate the degree of speech-to-text accuracy needed for reliable detection of emotions from an automatically generated transcription of an utterance. We evaluate the use of hypotheses in both training and testing, and compare several classification approaches on the same task. Our results show that emotion recognition performance stays roughly constant as long as word accuracy doesn't fall below a reasonable value, making the use of speech-to-text viable for training of emotion classifiers based on linguistics.
引用
收藏
页码:478 / +
页数:2
相关论文
共 50 条
  • [31] Emotion Prompting for Speech Emotion Recognition
    Zhou, Xingfa
    Li, Min
    Yang, Lan
    Sun, Rui
    Wang, Xin
    Zhan, Huayi
    INTERSPEECH 2023, 2023, : 3108 - 3112
  • [32] Speech emotion recognition using emotion perception spectral feature
    Jiang, Lin
    Tan, Ping
    Yang, Junfeng
    Liu, Xingbao
    Wang, Chao
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2021, 33 (11):
  • [33] Speech Emotion Recognition Using Speech Feature and Word Embedding
    Atmaja, Bagus Tris
    Shirai, Kiyoaki
    Akagi, Masato
    2019 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2019, : 519 - 523
  • [34] Research on Emergency Parking Instruction Recognition Based on Speech Recognition and Speech Emotion Recognition
    Tian Kexin
    Huang Yongming
    Zhang Guobao
    Zhang Lin
    2019 CHINESE AUTOMATION CONGRESS (CAC2019), 2019, : 2933 - 2937
  • [35] Emotion Recognition in Arabic Speech
    Klaylat, Samira
    Hamandi, Lama
    Osman, Ziad
    Zantout, Rached
    2017 SENSORS NETWORKS SMART AND EMERGING TECHNOLOGIES (SENSET), 2017,
  • [36] Persian Speech Emotion Recognition
    Savargiv, Mohammad
    Bastanfard, Azam
    2015 7TH CONFERENCE ON INFORMATION AND KNOWLEDGE TECHNOLOGY (IKT), 2015,
  • [37] Multiroom Speech Emotion Recognition
    Shalev, Erez
    Cohen, Israel
    2022 30TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2022), 2022, : 135 - 139
  • [38] Emotion recognition in Arabic speech
    Samira Klaylat
    Ziad Osman
    Lama Hamandi
    Rached Zantout
    Analog Integrated Circuits and Signal Processing, 2018, 96 : 337 - 351
  • [39] Windowing for Speech Emotion Recognition
    Puterka, Boris
    Kacur, Juraj
    Pavlovicova, Jarmila
    2019 61ST INTERNATIONAL SYMPOSIUM ELMAR, 2019, : 147 - 150
  • [40] Mandarin emotion recognition in speech
    Pao, TL
    Chen, YT
    ASRU'03: 2003 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING ASRU '03, 2003, : 227 - 230