How to find trouble in communication

被引:139
作者
Batliner, A
Fischer, K
Huber, R
Spilker, J
Nöth, E
机构
[1] Univ Erlangen Nurnberg, Lehrstuhl Mustererkennung Informat 5, D-91058 Erlangen, Germany
[2] Univ Bremen, Fachbereich 10, Sprach & Literaturwissensch, D-28334 Bremen, Germany
关键词
emotion; dialogue; prosody; annotation; automatic classification; spontaneous speech; neural networks;
D O I
10.1016/S0167-6393(02)00079-1
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Automatic dialogue systems used, for instance, in call centers, should be able to determine in a critical phase of the dialogue-indicated by the customers vocal expression of anger/irritation-when it is better to pass over to a human operator. At a first glance, this does not seem to be a complicated task: It is reported in the literature that emotions can be told apart quite reliably on the basis of prosodic features. However, these results are achieved most of the time in a laboratory setting, with experienced speakers (actors), and with elicited, controlled speech. We compare classification results obtained with the same feature set for elicited speech and for a Wizard-of-Oz scenario, where users believe that they are really communicating with an automatic dialogue system. It turns out that the closer we get to a realistic scenario, the less reliable is prosody as an indicator of the speakers' emotional state. As a consequence, we propose to change the target such that we cease looking for traces of particular emotions in the users' speech, but instead look for indicators of TROUBLE IN COMMUNICATION. For this reason, we propose the module Monitoring of User State [especially of] Emotion (MOUSE) in which a prosodic classifier is combined with other knowledge sources, such as conversationally peculiar linguistic behavior, for example, the use of repetitions. For this module, preliminary experimental results are reported showing a more adequate modelling Of TROUBLE IN COMMUNICATION. (C) 2002 Elsevier Science B.V. All rights reserved.
引用
收藏
页码:117 / 143
页数:27
相关论文
共 41 条
  • [1] [Anonymous], 1998, 5 INT C SPOK LANG PR
  • [2] [Anonymous], 2000, P 1 N AM CHAPT ASS C
  • [3] [Anonymous], P ISCA WORKSH SPEECH
  • [4] [Anonymous], 1998, Proceedings of the Xth Conference of the International Society for Research on Emotions
  • [5] Acoustic profiles in vocal emotion expression
    Banse, R
    Scherer, KR
    [J]. JOURNAL OF PERSONALITY AND SOCIAL PSYCHOLOGY, 1996, 70 (03) : 614 - 636
  • [6] M = Syntax plus Prosody: A syntactic-prosodic labelling scheme for large spontaneous speech databases
    Batliner, A
    Kompe, R
    Kiessling, A
    Mast, M
    Niemann, H
    Noth, E
    [J]. SPEECH COMMUNICATION, 1998, 25 (04) : 193 - 222
  • [7] Batliner A, 2000, ART INTEL, P122
  • [8] Batliner A, 2000, ART INTEL, P106
  • [9] BATLINER A, 1999, P EUR C SPEECH COMM, V1, P519
  • [10] BATLINER A, 2001, P EUR C SPEECH COMM, V4, P2781