The Jena Audiovisual Stimuli of Morphed Emotional Pseudospeech (JAVMEPS): A database for emotional auditory-only, visual-only, and congruent and incongruent audiovisual voice and dynamic face stimuli with varying voice intensities

被引:0
作者
von Eiff, Celina I. [1 ,2 ,3 ,4 ]
Kauk, Julian [1 ]
Schweinberger, Stefan R. [1 ,2 ,3 ,4 ]
机构
[1] Friedrich Schiller Univ Jena, Inst Psychol, Dept Gen Psychol & Cognit Neuroscience, D-307743JEN Am Steiger, Germany
[2] Friedrich Schiller Univ Jena, Inst Psychol, Voice Res Unit, D-107743JEN Leutragraben, Germany
[3] DFG SPP 2392 Visual Commun ViCom, Frankfurt, Germany
[4] Jena Univ Hosp, Jena, Germany
关键词
Emotion; Audiovisual integration; Voice morphing; Stimulus database; Adaptive testing; Emotion induction; Cochlear implant; FACIAL EXPRESSIONS; NONVERBAL DIALECTS; PERCEPTION; RECOGNITION; INTEGRATION; SPECTRUM; SET; REPRESENTATION; APERIODICITY; INFORMATION;
D O I
10.3758/s13428-023-02249-4
中图分类号
B841 [心理学研究方法];
学科分类号
040201 ;
摘要
We describe JAVMEPS, an audiovisual (AV) database for emotional voice and dynamic face stimuli, with voices varying in emotional intensity. JAVMEPS includes 2256 stimulus files comprising (A) recordings of 12 speakers, speaking four bisyllabic pseudowords with six naturalistic induced basic emotions plus neutral, in auditory-only, visual-only, and congruent AV conditions. It furthermore comprises (B) caricatures (140%), original voices (100%), and anti-caricatures (60%) for happy, fearful, angry, sad, disgusted, and surprised voices for eight speakers and two pseudowords. Crucially, JAVMEPS contains (C) precisely time-synchronized congruent and incongruent AV (and corresponding auditory-only) stimuli with two emotions (anger, surprise), (C1) with original intensity (ten speakers, four pseudowords), (C2) and with graded AV congruence (implemented via five voice morph levels, from caricatures to anti-caricatures; eight speakers, two pseudowords). We collected classification data for Stimulus Set A from 22 normal-hearing listeners and four cochlear implant users, for two pseudowords, in auditory-only, visual-only, and AV conditions. Normal-hearing individuals showed good classification performance (M-corrAV = .59 to .92), with classification rates in the auditory-only condition >= .38 correct (surprise: .67, anger: .51). Despite compromised vocal emotion perception, CI users performed above chance levels of .14 for auditory-only stimuli, with best rates for surprise (.31) and anger (.30). We anticipate JAVMEPS to become a useful open resource for researchers into auditory emotion perception, especially when adaptive testing or calibration of task difficulty is desirable. With its time-synchronized congruent and incongruent stimuli, JAVMEPS can also contribute to filling a gap in research regarding dynamic audiovisual integration of emotion perception via behavioral or neurophysiological recordings.
引用
收藏
页码:5103 / 5115
页数:13
相关论文
共 101 条
[91]   The development of a series of photographs of Chinese facial expressions of emotion [J].
Wang, L ;
Markham, R .
JOURNAL OF CROSS-CULTURAL PSYCHOLOGY, 1999, 30 (04) :397-410
[92]   The representation and plasticity of body emotion expression [J].
Watson, Rebecca ;
de Gelder, Beatrice .
PSYCHOLOGICAL RESEARCH-PSYCHOLOGISCHE FORSCHUNG, 2020, 84 (05) :1400-1406
[93]   Studying the dynamics of emotional expression using synthesized facial muscle movements [J].
Wehrle, T ;
Kaiser, S ;
Schmidt, S ;
Scherer, KR .
JOURNAL OF PERSONALITY AND SOCIAL PSYCHOLOGY, 2000, 78 (01) :105-119
[94]   A method for creation and validation of a natural spoken language corpus used for prosodic and speech perception [J].
Wendt, B ;
Hufnagel, K ;
Brechmann, A ;
Gaschler-Markefski, B ;
Tiedge, E ;
Ackermann, H ;
Scheich, H .
BRAIN AND LANGUAGE, 2003, 87 (01) :187-187
[95]  
Wendt B., 2002, SPEECH PROSODY 2002
[96]  
Westermann R, 1996, EUR J SOC PSYCHOL, V26, P557, DOI 10.1002/(SICI)1099-0992(199607)26:4<557::AID-EJSP769>3.0.CO
[97]  
2-4
[98]   The perception of caricatured emotion in voice [J].
Whiting, Caroline M. ;
Kotz, Sonja A. ;
Gross, Joachim ;
Giordano, Bruno L. ;
Belin, Pascal .
COGNITION, 2020, 200
[99]  
Xin Luo, 2007, Trends Amplif, V11, P301
[100]   Face and Voice Perception: Understanding Commonalities and Differences [J].
Young, Andrew W. ;
Fruhholz, Sascha ;
Schweinberger, Stefan R. .
TRENDS IN COGNITIVE SCIENCES, 2020, 24 (05) :398-410