The Recognition of Whispered Speech in Real-Time

被引:5
|
作者
Hendrickson, Kristi [1 ,2 ]
Ernest, Danielle [1 ]
机构
[1] Univ Iowa, Dept Commun Sci & Disorders, 250 Hawkins Dr, Iowa City, IA 52240 USA
[2] Univ Iowa, Dept Psychol & Brain Sci, 250 Hawkins Dr, Iowa City, IA 52240 USA
关键词
Competition; Eye tracking; Lexical; Speech perception; Whispered speech; Word recognition; SPOKEN-WORD RECOGNITION; PERCEIVED PITCH; PERCEPTION; INFORMATION; LANGUAGE; FEATURES; VOWELS; NOISE; MODEL;
D O I
10.1097/AUD.0000000000001114
中图分类号
R36 [病理学]; R76 [耳鼻咽喉科学];
学科分类号
100104 ; 100213 ;
摘要
Objectives: Whispered speech offers a unique set of challenges to speech perception and word recognition. The goals of the present study were twofold: First, to determine how listeners recognize whispered speech. Second, to inform major theories of spoken word recognition by considering how recognition changes when major cues to phoneme identity are reduced or largely absent compared with normal voiced speech. Design: Using eye tracking in the Visual World Paradigm, we examined how listeners recognize whispered speech. After hearing a target word (normal or whispered), participants selected the corresponding image from a display of four-a target (e.g., money), a word that shares sounds with the target at the beginning (cohort competitor, e.g., mother), a word that shares sounds with the target at the end (rhyme competitor, e.g., honey), and a phonologically unrelated word (e.g., whistle). Eye movements to each object were monitored to measure (1) how fast listeners process whispered speech, and (2) how strongly they consider lexical competitors (cohorts and rhymes) as the speech signal unfolds. Results: Listeners were slower to recognize whispered words. Compared with normal speech, listeners displayed slower reaction times to click the target image, were slower to fixate the target, and fixated the target less overall. Further, we found clear evidence that the dynamics of lexical competition are altered during whispered speech recognition. Relative to normal speech, words that overlapped with the target at the beginning (cohorts) displayed slower, reduced, and delayed activation, whereas words that overlapped with the target at the end (rhymes) exhibited faster, more robust, and longer lasting activation. Conclusion: When listeners are confronted with whispered speech, they engage in a "wait-and-see" approach. Listeners delay lexical access, and by the time they begin to consider what word they are hearing, the beginning of the word has largely come and gone, and activation for cohorts is reduced. However, delays in lexical access actually increase consideration of rhyme competitors; the delay pushes lexical activation to a point later in processing, and the recognition system puts more weight on the word-final overlap between the target and the rhyme.
引用
收藏
页码:554 / 562
页数:9
相关论文
共 50 条
  • [1] Lexical Access Changes Based on Listener Needs: Real-Time Word Recognition in Continuous Speech in Cochlear Implant Users
    Smith, Francis X.
    McMurray, Bob
    EAR AND HEARING, 2022, 43 (05) : 1487 - 1501
  • [2] Analysis and recognition of whispered speech
    Ito, T
    Takeda, K
    Itakura, F
    SPEECH COMMUNICATION, 2005, 45 (02) : 139 - 152
  • [3] Study on the Emotion Recognition of Whispered Speech
    Jin, Yun
    Zhao, Yan
    Huang, Chengwei
    Zhao, Li
    PROCEEDINGS OF THE 2009 WRI GLOBAL CONGRESS ON INTELLIGENT SYSTEMS, VOL III, 2009, : 242 - 246
  • [4] Lightweight Real-Time Recurrent Models for Speech Enhancement and Automatic Speech Recognition
    Dhahbi, Sami
    Saleem, Nasir
    Gunawan, Teddy Surya
    Bourouis, Sami
    Ali, Imad
    Trigui, Aymen
    Algarni, Abeer D.
    INTERNATIONAL JOURNAL OF INTERACTIVE MULTIMEDIA AND ARTIFICIAL INTELLIGENCE, 2024, 8 (06): : 74 - 85
  • [5] Maturation of Speech-in-Speech Recognition for Whispered and Voiced Speech
    Buss, Emily
    Miller, Margaret K.
    Leibold, Lori J.
    JOURNAL OF SPEECH LANGUAGE AND HEARING RESEARCH, 2022, 65 (08): : 3117 - 3128
  • [6] Real-Time Robust Automatic Speech Recognition Using Compact Support Vector Machines
    Solera-Urena, Ruben
    Isabel Garcia-Moral, Ana
    Pelaez-Moreno, Carmen
    Martinez-Ramon, Manel
    Diaz-de-Maria, Fernando
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2012, 20 (04): : 1347 - 1361
  • [7] Mandarin Connected Digits Recognition for Whispered Speech
    Ru Tingting
    Xie Xiang
    Yin Hui
    Kuang Jingming
    INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 1141 - 1144
  • [8] The Slow Developmental Time Course of Real-Time Spoken Word Recognition
    Rigler, Hannah
    Farris-Trimble, Ashley
    Greiner, Lea
    Walker, Jessica
    Tomblin, J. Bruce
    McMurray, Bob
    DEVELOPMENTAL PSYCHOLOGY, 2015, 51 (12) : 1690 - 1703
  • [9] Real-time lexical competitions during speech-in-speech comprehension
    Boulenger, Veronique
    Hoen, Michel
    Pellegrino, Francois
    Meunier, Fanny
    INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 1839 - +
  • [10] Comparison of performance with voiced and whispered speech in word recognition and mean-formant-frequency discrimination
    Irino, Toshio
    Aoki, Yoshie
    Kawahara, Hideki
    Patterson, Roy D.
    SPEECH COMMUNICATION, 2012, 54 (09) : 998 - 1013