The Use of Voice Cues for Speaker Gender Recognition in Cochlear Implant Recipients

被引:32
作者
Meister, Hartmut [1 ]
Fuersen, Katrin [1 ,2 ,3 ]
Streicher, Barbara [2 ,3 ]
Lang-Roth, Ruth [2 ,3 ]
Walger, Martin [1 ,2 ,3 ]
机构
[1] Univ Cologne, Jean Uhrmacher Inst Clin ENT Res, Cologne, Germany
[2] Univ Cologne, Clin Otorhinolaryngol Head & Neck Surg, Cologne, Germany
[3] Cochlear Implant Ctr Cologne, Cologne, Germany
来源
JOURNAL OF SPEECH LANGUAGE AND HEARING RESEARCH | 2016年 / 59卷 / 03期
关键词
NORMAL-HEARING LISTENERS; TEMPORAL CUES; FUNDAMENTAL-FREQUENCY; SPEECH-PERCEPTION; SIMULTANEOUS TALKERS; MUSIC PERCEPTION; WORD RECOGNITION; VOCAL-TRACT; IDENTIFICATION; FEMALE;
D O I
10.1044/2015_JSLHR-H-15-0128
中图分类号
R36 [病理学]; R76 [耳鼻咽喉科学];
学科分类号
100104 ; 100213 ;
摘要
Purpose: The focus of this study was to examine the influence of fundamental frequency (F0) and vocal tract length (VTL) modifications on speaker gender recognition in cochlear implant (CI) recipients for different stimulus types. Method: Single words and sentences were manipulated using isolated or combined F0 and VTL cues. Using an 11-point rating scale, CI recipients and listeners with normal hearing rated the maleness/femaleness of the corresponding voice. Results: Speaker gender ratings for combined F0 and VTL modifications were similar across all stimulus types in both CI recipients and listeners with normal hearing, although the CI recipients showed a somewhat larger ambiguity. In contrast to listeners with normal hearing, F0-VTL and F0-only modifications revealed similar ratings in the CI recipients when using words as stimuli. However, when sentences were used, a difference was found between F0-VTL-based and F0-based ratings. Modifying VTL cues alone did not affect ratings in the CI group. Conclusions: Whereas speaker gender ratings by listeners with normal hearing relied on combined VTL and F0 cues, CI recipients made only limited use of VTL cues, which might be one reason behind problems with identifying the speaker on the basis of voice. However, use of the voice cues depended on stimulus type, with the greater information in sentences allowing a more detailed analysis than single words in both listener groups.
引用
收藏
页码:546 / 556
页数:11
相关论文
共 52 条
[1]   Speech and music perception with the new fine structure speech coding strategy: preliminary results [J].
Arnoldner, Christoph ;
Riss, Dominik ;
Brunner, Markus ;
Durisin, Martin ;
Baumgartner, Wolf-Dieter ;
Hamzavi, Jafar-Sasan .
ACTA OTO-LARYNGOLOGICA, 2007, 127 (12) :1298-1303
[2]   Acoustic correlates of talker sex and individual talker identity are present in a short vowel segment produced in running speech [J].
Bachorowski, JA ;
Owren, MJ .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1999, 106 (02) :1054-1063
[3]  
Boersma P., 2018, Praat: doing phonetics by computer (Version 5.3) Computer software
[4]   What breaks a melody: Perceiving FO and intensity sequences with a cochlear implant [J].
Cousineau, Marion ;
Demany, Laurent ;
Meyer, Bernard ;
Pressnitzer, Daniel .
HEARING RESEARCH, 2010, 269 (1-2) :34-41
[5]   Comparison of Bimodal and Bilateral Cochlear Implant Users on Speech Recognition With Competing Talker, Music Perception, Affective Prosody Discrimination, and Talker Identification [J].
Cullington, Helen E. ;
Zeng, Fan-Gang .
EAR AND HEARING, 2011, 32 (01) :16-30
[6]   Effects of fundamental frequency and vocal-tract length changes on attention to one of two simultaneous talkers [J].
Darwin, CJ ;
Brungart, DS ;
Simpson, BD .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2003, 114 (05) :2913-2922
[7]   Speech perception and talker segregation: Effects of level, pitch, and tactile support with multiple simultaneous talkers [J].
Drullman, R ;
Bronkhorst, AW .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2004, 116 (05) :3090-3098
[8]   LARYNGEAL MECHANISMS AND FEATURES - INTRODUCTORY-REMARKS [J].
FANT, G .
PHONETICA, 1977, 34 (04) :252-255
[9]   Morphology and development of the human vocal tract: A study using magnetic resonance imaging [J].
Fitch, WT ;
Giedd, J .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1999, 106 (03) :1511-1522
[10]  
Fu Qian-Jie, 2007, Trends Amplif, V11, P193, DOI 10.1177/1084713807301379