Determining the relevance of different aspects of formant contours to intelligibility

被引:5
作者
Amano-Kusumoto, Akiko [1 ,2 ]
Hosom, John-Paul [2 ]
Kain, Alexander [2 ]
Aronoff, Justin M. [1 ]
机构
[1] House Res Inst, Dept Human Commun Sci Devices, Los Angeles, CA 90057 USA
[2] Oregon Hlth & Sci Univ, Ctr Spoken Language Understanding CSLU, Beaverton, OR 97006 USA
基金
美国国家科学基金会;
关键词
Speech intelligibility; Vowel perception; Speech synthesis; CLEAR SPEECH; CONVERSATIONAL SPEECH; VOWEL INTELLIGIBILITY; NORMAL-HEARING; PERCEPTION; TRANSITION;
D O I
10.1016/j.specom.2013.12.001
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Previous studies have shown that "clear" speech, where the speaker intentionally tries to enunciate, has better intelligibility than "conversational" speech, which is produced in regular conversation. However, conversational and clear speech vary along a number of acoustic dimensions and it is unclear what aspects of clear speech lead to better intelligibility. Previously, Kain et al. (2008) showed that a combination of short-term spectra and duration was responsible for the improved intelligibility of one speaker. This study investigates subsets of specific features of short-term spectra including temporal aspects. Similar to Kain's study, hybrid stimuli were synthesized with a combination of features from clear speech and complementary features from conversational speech to determine which acoustic features cause the improved intelligibility of clear speech. Our results indicate that, although steady-state formant values of tense vowels contributed to the intelligibility of clear speech, neither the steady-state portion nor the formant transition was sufficient to yield comparable intelligibility to that of clear speech. In contrast, when the entire formant contour of conversational speech including the phoneme duration was replaced by that of clear speech, intelligibility was comparable to that of clear speech. It indicated that the combination of formant contour and duration information was relevant to the improved intelligibility of clear speech. The study provides a better understanding of the relevance of different aspects of formant contours to the improved intelligibility of clear speech. (C) 2013 Elsevier B.V. All rights reserved.
引用
收藏
页码:1 / 9
页数:9
相关论文
共 23 条
[1]  
Amano-Kusumoto A., 2011, SCIENCE, P1
[2]   THE EFFECT OF FORMANT TRAJECTORIES AND PHONEME DURATIONS ON VOWEL INTELLIGIBILITY [J].
Amano-Kusumoto, Akiko ;
Hosom, John-Paul .
2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, :4677-4680
[3]  
[Anonymous], 1996, EL SOUND LEV MET
[4]   Speaking clearly for children with learning disabilities: Sentence perception in noise [J].
Bradlow, AR ;
Kraus, N ;
Hayes, E .
JOURNAL OF SPEECH LANGUAGE AND HEARING RESEARCH, 2003, 46 (01) :80-97
[5]   Talker differences in clear and conversational speech: Vowel intelligibility for normal-hearing listeners [J].
Ferguson, SH .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2004, 116 (04) :2365-2373
[6]   Vowel intelligibility in clear and conversational speech for normal-hearing and hearing-impaired listeners [J].
Ferguson, SH ;
Kewley-Port, D .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2002, 112 (01) :259-271
[7]   ON THE ROLE OF SPECTRAL TRANSITION FOR SPEECH-PERCEPTION [J].
FURUI, S .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1986, 80 (04) :1016-1025
[8]   Acoustic-phonetic correlates of talker intelligibility for adults and children [J].
Hazan, V ;
Markham, D .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2004, 116 (05) :3108-3118
[9]  
Helfer K S, 1998, J Am Acad Audiol, V9, P234
[10]   Identification of resynthesized |hVd| utterances:: Effects of formant contour [J].
Hillenbrand, JM ;
Nearey, TM .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1999, 105 (06) :3509-3523