Contribution of frequency compressed temporal fine structure cues to the speech recognition in noise: An implication in cochlear implant signal processing

被引:2
|
作者
Poluboina, Venkateswarlu [1 ]
Pulikala, Aparna [1 ]
Muthu, Arivudai Nambi Pitchai [2 ]
机构
[1] Natl Inst Technol Karnataka, Dept Elect & Commun, Mangalore 575025, Karnataka, India
[2] Dept Audiol & Speech Language Pathol, Mangalore 575001, Karnataka, India
关键词
Cochlear implant signal processing; Temporal fine structure; Proportional frequency compression; Vocoder simulation; Speech recognition; PERFORMANCE; HEARING; ENCODER; PITCH;
D O I
10.1016/j.apacoust.2021.108616
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
The study investigated the effect of proportionally frequency compressed encoding of temporal fine structure information on speech perception in noise using vocoder simulations of cochlear implant signal processing. The study proposed a pitch synchronous overlap-add algorithm (PSOLA) for downward frequency shifting of TFS. The speech recognition scores (SRS) were measured at-10 dB, 0 dB, and +10 dB for eight signal processing conditions corresponding to sinewave vocoder without TFS (NOTFS), four unshifted TFS conditions including full band TFS, TFS up to 2000, 1000, and 600 Hz, and three conditions with PSOLA which shifted 2000, 1000 and 600 Hz TFS to 1000, 500 and 300 Hz respectively. The original envelope was unchanged across the conditions. SRS at +10 dB and-10 dB SNR reached ceiling and floor respectively, in most conditions. Hence, SRS at 0 dB SNR was compared across the conditions. The results showed that the SRS was highest with full band TFS and lowest for the NO-TFS condition.The SRS for TFS 600 Hz shifted to 300 Hz through PSOLA was higher than the NO-TFS condition. Study findings suggest that encoding TFS by proportional frequency compression results in better speech perception in noise compared to NO-TFS. An important observation of this current study is that the speech recognition was better than the sine wave vocoder for all TFS conditions including frequency compressed 600 Hz TFS.(c) 2021 Elsevier Ltd. All rights reserved.
引用
收藏
页数:5
相关论文
共 48 条
  • [21] Contribution of low-frequency acoustic information to Chinese speech recognition in cochlear implant simulations
    Luo, Xin
    Fu, Qian-Jie
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2006, 120 (04): : 2260 - 2266
  • [22] Contribution of low-frequency acoustic information to Chinese speech recognition in cochlear implant simulations
    Luo, Xin
    Fu, Qian-Jie
    Journal of the Acoustical Society of America, 2006, 120 (04): : 2260 - 2266
  • [23] Better Speech Recognition in Noise with the Fine Structure Processing Coding Strategy
    Vermeire, Katrien
    Punte, Andrea Kleine
    Van de Heyning, Paul
    ORL-JOURNAL FOR OTO-RHINO-LARYNGOLOGY AND ITS RELATED SPECIALTIES, 2010, 72 (06): : 305 - 311
  • [24] Speech Recognition and Temporal Amplitude Modulation Processing by Mandarin-Speaking Cochlear Implant Users
    Luo, Xin
    Fu, Qian-Jie
    Wei, Chao-Gang
    Cao, Ke-Li
    EAR AND HEARING, 2008, 29 (06): : 957 - 970
  • [25] Contribution of Temporal Fine Structure Cues to Concurrent Vowel Identification and Perception of Zebra Speech
    Serrao, Delora Samantha
    Theruvan, Nikhitha
    Fathima, Hasna
    Pitchaimuthu, Arivudai Nambi
    INTERNATIONAL ARCHIVES OF OTORHINOLARYNGOLOGY, 2024, 28 (03) : e492 - e501
  • [26] Spatial Hearing by Bilateral Cochlear Implant Users With Temporal Fine-Structure Processing
    Ausili, Sebastian A.
    Agterberg, Martijn J. H.
    Engel, Andreas
    Voelter, Christiane
    Thomas, Jan Peter
    Brill, Stefan
    Snik, Ad F. M.
    Dazert, Stefan
    Van Opstal, A. John
    Mylanus, Emmanuel A. M.
    FRONTIERS IN NEUROLOGY, 2020, 11
  • [27] Within- and across-frequency temporal processing and speech perception in cochlear implant users
    Blankenship, Chelsea M.
    Meinzen-Derr, Jareen
    Zhang, Fawen
    PLOS ONE, 2022, 17 (10):
  • [28] Contribution of Verbal Learning & Memory and Spectro-Temporal Discrimination to Speech Recognition in Cochlear Implant Users
    Harris, Michael S.
    Hamel, Benjamin L.
    Wichert, Kristin
    Kozlowski, Kristin
    Mleziva, Sarah
    Ray, Christin
    Pisoni, David B.
    Kronenberger, William G.
    Moberly, Aaron C.
    LARYNGOSCOPE, 2023, 133 (03): : 661 - 669
  • [29] Dual-carrier processing to convey temporal fine structure cues: Implications for cochlear implants
    Apoux, Frederic
    Youngdahl, Carla L.
    Yoho, Sarah E.
    Healy, Eric W.
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2015, 138 (03): : 1469 - 1480
  • [30] Temporal fine structure frequency bands criticality in perception of the speech in the presence of noise
    Yellamsetty, Anusha
    INDIAN JOURNAL OF OTOLOGY, 2016, 22 (02) : 92 - 99