Impact of Background Noise and Contribution of Visual Information in Emotion Identification by Native Mandarin Speakers

被引:1
作者
Zhang, Minyue [1 ]
Ding, Hongwei [1 ]
机构
[1] Shanghai Jiao Tong Univ, Speech Language Hearing Ctr, Sch Foreign Languages, Shanghai, Peoples R China
来源
INTERSPEECH 2022 | 2022年
关键词
multisensory integration; emotion perception; babble noise; audiovisual; AUDIOVISUAL INTEGRATION; SPEECH; PERCEPTION; VOICE; RECOGNITION; TONE;
D O I
10.21437/Interspeech.2022-10142
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Many studies on emotion processing considered little about the issue of ecological validity and insufficient attention has been drawn to uni-sensory and multisensory emotion perception in challenging environments. The current research explored how adding multi-talker babble noise impacts emotion perception and how visual information affects the results in comparison with the audio-alone conditions. Forty native Mandarin participants (21 females and 19 males) were asked to identify the emotion according to the auditory or audiovisual information they received. Results showed that the emotion identification accuracy was significantly lower in noisy conditions than in noiseless ones, whether additional visual information was presented simultaneously or not. In noisy environments, providing multisensory emotional information greatly facilitated recognition performances even when the visual information was less reliable. To conclude, multi-talker babble noise had a corrupting effect on emotion identification, which worked in both unisensory and multisensory settings, and emotion perception is a robust multisensory situation that follows the inverse effectiveness principle.
引用
收藏
页码:1993 / 1997
页数:5
相关论文
共 32 条
[1]   INTERJECTIONS - THE UNIVERSAL YET NEGLECTED PART-OF-SPEECH - INTRODUCTION [J].
AMEKA, F .
JOURNAL OF PRAGMATICS, 1992, 18 (2-3) :101-118
[2]  
BENALI AR, 2021, IEEE T AFFECT COMPUT, DOI DOI 10.1002/SAJ2.20281
[3]   Investigating Multisensory Integration in Emotion Recognition Through Bio-Inspired Computational Models [J].
Benssassi, Esma Mansouri ;
Ye, Juan .
IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2023, 14 (02) :906-918
[4]  
Chen F, 2015, EAR HEARING, V36, P61, DOI 10.1097/AUD.0000000000000074
[5]   Cultural Experience Influences Multisensory Emotion Perception in Bilinguals [J].
Chen, Peiyao ;
Chung-Fat-Yim, Ashley ;
Marian, Viorica .
LANGUAGES, 2022, 7 (01)
[6]   Robust emotion recognition by spectro-temporal modulation statistic features [J].
Chi, Tai-Shih ;
Yeh, Lan-Ying ;
Hsu, Chin-Cheng .
JOURNAL OF AMBIENT INTELLIGENCE AND HUMANIZED COMPUTING, 2012, 3 (01) :47-60
[7]   Audio-visual integration of emotion expression [J].
Collignon, Olivier ;
Girard, Simon ;
Gosselin, Frederic ;
Roy, Sylvain ;
Saint-Amour, Dave ;
Lassonde, Maryse ;
Lepore, Franco .
BRAIN RESEARCH, 2008, 1242 :126-135
[8]   Degraded visual and auditory input individually impair audiovisual emotion recognition from speech-like stimuli, but no evidence for an exacerbated effect from combined degradation [J].
de Boer, Minke J. ;
Juergens, Tim ;
Cornelissen, Frans W. ;
Baskent, Deniz .
VISION RESEARCH, 2021, 180 :51-62
[9]   The perception of emotions by ear and by eye [J].
de Gelder, B ;
Vroomen, J .
COGNITION & EMOTION, 2000, 14 (03) :289-311
[10]   A new approach for speech enhancement based on the adaptive thresholding of the wavelet packets [J].
Ghanbari, Yasser ;
Karami-Mollaei, Mohammad Reza .
SPEECH COMMUNICATION, 2006, 48 (08) :927-940