Parameter-Specific Morphing Reveals Contributions of Timbre and Fundamental Frequency Cues to the Perception of Voice Gender and Age in Cochlear Implant Users

被引:18
作者
Skuk, Verena G. [1 ,2 ,3 ]
Kirchen, Louisa [2 ,4 ,5 ]
Oberhoffner, Tobias [3 ,6 ]
Guntinas-Lichius, Orlando [3 ]
Dobel, Christian [3 ]
Schweinberger, Stefan R. [1 ,2 ,7 ]
机构
[1] Friedrich Schiller Univ Jena, DFG Res Unit Person Percept, Jena, Germany
[2] Friedrich Schiller Univ Jena, Inst Psychol, Dept Gen Psychol & Cognit Neurosci, Jena, Germany
[3] Jena Univ Hosp, Inst Phoniatry & Pedaudiol, Dept Otorhinolaryngol, Jena, Germany
[4] Social Pediat Ctr, Trier, Germany
[5] Ctr Adults Special Needs, Trier, Germany
[6] Otto Korner Univ Med Ctr Rostock, Dept Otorhinolaryngol Head & Neck Surg, Rostock, Germany
[7] Swiss Ctr Affect Sci, Geneva, Switzerland
来源
JOURNAL OF SPEECH LANGUAGE AND HEARING RESEARCH | 2020年 / 63卷 / 09期
关键词
QUALITY-OF-LIFE; NORMAL-HEARING LISTENERS; VOCAL-TRACT LENGTH; FORMANT FREQUENCIES; SPEAKER GENDER; TEMPORAL CUES; RECOGNITION; DISCRIMINATION; SPEECH; CHILDREN;
D O I
10.1044/2020_JSLHR-20-00026
中图分类号
R36 [病理学]; R76 [耳鼻咽喉科学];
学科分类号
100104 ; 100213 ;
摘要
Purpose: Using naturalistic synthesized speech, we determined the relative importance of acoustic cues in voice gender and age perception in cochlear implant (CI) users. Method: We investigated 28 CI users' abilities to utilize fundamental frequency (FO) and timbre in perceiving voice gender (Experiment 1) and vocal age (Experiment 2). Parameter-specific voice morphing was used to selectively control acoustic cues (FO; time; timbre, i.e., formant frequencies, spectral-level information, and aperiodicity, as defined in TANDEM-STRAIGHT) in voice stimuli. Individual differences in CI users' performance were quantified via deviations from the mean performance of 19 normal-hearing (NH) listeners. Results: CI users' gender perception seemed exclusively based on F0, whereas NH listeners efficiently used timbre. For age perception, timbre was more informative than F0 for both groups, with minor contributions of temporal cues. While a few CI users performed comparable to NH listeners overall, others were at chance. Separate analyses confirmed that even high-performing CI users classified gender almost exclusively based on F0. While high performers could discriminate age in male and female voices, low performers were close to chance overall but used FO as a misleading cue to age (classifying female voices as young and male voices as old). Satisfaction with CI generally correlated with performance in age perception. Conclusions: We confirmed that CI users' gender classification is mainly based on F0. However, high performers could make reasonable usage of timbre cues in age perception. Overall, parameter-specific morphing can serve to objectively assess individual profiles of CI users' abilities to perceive nonverbal social-communicative vocal signals.
引用
收藏
页码:3155 / 3175
页数:21
相关论文
共 75 条
[1]   Electrophysiological responses to emotional prosody perception in cochlear implant users [J].
Agrawal, D. ;
Thorne, J. D. ;
Viola, F. C. ;
Timm, L. ;
Debener, S. ;
Buechner, A. ;
Dengler, R. ;
Wittfoth, M. .
NEUROIMAGE-CLINICAL, 2013, 2 :229-238
[2]  
[Anonymous], 2002, E-Prime user's guide
[3]  
Assmann P. F., 2006, INTERSPEECH 2006 ICS
[4]   Factors Affecting Auditory Performance of Postlinguistically Deaf Adults Using Cochlear Implants: An Update with 2251 Patients [J].
Blamey, Peter ;
Artieres, Franoise ;
Baskent, Deniz ;
Bergeron, Francois ;
Beynon, Andy ;
Burke, Elaine ;
Dillier, Norbert ;
Dowell, Richard ;
Fraysse, Bernard ;
Gallego, Stephane ;
Govaerts, Paul J. ;
Green, Kevin ;
Huber, Alexander M. ;
Kleine-Punte, Andrea ;
Maat, Bert ;
Marx, Mathieu ;
Mawman, Deborah ;
Mosnier, Isabelle ;
O'Connor, Alec Fitzgerald ;
O'Leary, Stephen ;
Rousset, Alexandra ;
Schauwers, Karen ;
Skarzynski, Henryk ;
Skarzynski, Piotr H. ;
Sterkers, Olivier ;
Terranti, Assia ;
Truy, Eric ;
Van de Heyning, Paul ;
Venail, Frederic ;
Vincent, Christophe ;
Lazard, Diane S. .
AUDIOLOGY AND NEURO-OTOLOGY, 2013, 18 (01) :36-47
[5]  
Boersma P., 2001, PRAAT DOING PHONETIC, DOI [DOI 10.1097/AUD.0B013E31821473F7, 10.1097/AUD.0b013-31821473f7, DOI 10.1097/AUD.0B013-31821473F7]
[6]  
Caldwell MT, 2017, LARYNGOSCOPE INVEST, V2, P119, DOI 10.1002/lio2.71
[7]   Fundamental frequency discrimination and speech perception in noise in cochlear implant simulations [J].
Carroll, Jeff ;
Zeng, Fan-Gang .
HEARING RESEARCH, 2007, 231 (1-2) :42-53
[8]   Caricaturing as a General Method to Improve Poor Face Recognition: Evidence From Low-Resolution Images, Other-Race Faces, and Older Adults [J].
Dawel, Amy ;
Wong, Tsz Ying ;
McMorrow, Jodie ;
Ivanovici, Callin ;
He, Xuming ;
Barnes, Nick ;
Irons, Jessica ;
Gradden, Tamara ;
Robbins, Rachel ;
Goodhew, Stephanie C. ;
Lane, Jo ;
McKone, Elinor .
JOURNAL OF EXPERIMENTAL PSYCHOLOGY-APPLIED, 2019, 25 (02) :256-279
[9]   Voice gender identification by cochlear implant users: The role of spectral and temporal resolution [J].
Fu, QJ ;
Chinchilla, S ;
Nogaki, G ;
Galvin, JJ .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2005, 118 (03) :1711-1718
[10]   The role of spectral and temporal cues in voice gender discrimination by normal-hearing listeners and cochlear implant users [J].
Fu, QJ ;
Chinchilla, S ;
Galvin, JJ .
JARO-JOURNAL OF THE ASSOCIATION FOR RESEARCH IN OTOLARYNGOLOGY, 2004, 5 (03) :253-260