Combining spectral and temporal modification techniques for speech intelligibility enhancement

被引:6
作者
Cooke, Martin [1 ,2 ]
Aubanel, Vincent [3 ]
Garcia Lecumberri, Maria Luisa [2 ]
机构
[1] Ikerbasque Basque Sci Fdn, Bilbao, Spain
[2] Univ Basque Country, Language & Speech Lab, Vitoria 01006, Spain
[3] Univ Grenoble Alpes, Ctr Natl Rech Sci, GIPSA Lab, Grenoble, France
关键词
Speech modification; Intelligibility; Retiming; Glimpsing; COCHLEA-SCALED ENTROPY; NOISE; CLEAR; INTENSITY;
D O I
10.1016/j.csl.2018.10.003
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Modifying clean speech prior to output in noisy conditions can lead to substantial intelligibility gains. Most algorithms operate by redistributing energy across the signal, leaving the timing of the underlying speech sounds intact. Other techniques do alter the timing of speech relative to the masker. Both classes of approach - spectral and temporal - lead to a reduction in energetic masking. The current study examines how their combination affects intelligibility. Arguments can be made for both synergy and redundancy, and the presence of distortions introduced by both spectral and temporal approaches might even lead to an antagonistic combination. A cohort of native Spanish listeners identified keywords in sentences in unmodified form and following spectral, temporal and spectro-temporal modification, in the presence of a fluctuating masker. Errors in the spectro-temporal condition were substantially lower than following spectral or temporal modification alone, with a three-fold reduction compared to unmodified speech. Spectro-temporal gains were observed for all phonemes. A glimpse-based model of energetic masking incorporating speech rate changes predicts intelligibility (r = .96), and a glimpsing analysis provides further insights into the distinct mechanisms through which spectral and temporal approaches lead to a release from energetic masking. (C) 2018 Elsevier Ltd. All rights reserved.
引用
收藏
页码:26 / 39
页数:14
相关论文
共 50 条
  • [41] Loudness Balancing Optimization for Better Speech Intelligibility, Music Perception, and Spectral Temporal Resolution in Cochlear Implant Users
    Deniz, Burcu
    Deniz, Risvan
    Atas, Ahmet
    OTOLOGY & NEUROTOLOGY, 2024, 45 (05) : e385 - e392
  • [42] Increasing Speech Intelligibility via Spectral Shaping with Frequency Warping and Dynamic Range Compression plus Transient Enhancement
    Godoy, Elizabeth
    Stylianou, Yannis
    14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 3539 - 3543
  • [43] Comparitive Analysis of Speech Enhancement Techniques: A Review
    Rohith, K.
    Chethan, K.
    2017 INTERNATIONAL CONFERENCE ON CURRENT TRENDS IN COMPUTER, ELECTRICAL, ELECTRONICS AND COMMUNICATION (CTCEEC), 2017, : 562 - 565
  • [44] Neural Decoding of the Speech Envelope: Effects of Intelligibility and Spectral Degradation
    Macintyre, Alexis Deighton
    Carlyon, Robert P.
    Goehring, Tobias
    TRENDS IN HEARING, 2024, 28
  • [45] Speech Intelligibility Enhancement using an Optimal Formant Shifting Approach
    Nathwani, Karan
    Hafiz, Faizal
    Swain, Akshya
    Biswas, Ritujoy
    PROCEEDINGS OF THE 12TH INTERNATIONAL SYMPOSIUM ON IMAGE AND SIGNAL PROCESSING AND ANALYSIS (ISPA 2021), 2021, : 120 - 125
  • [46] Spectro-temporal modulation glimpsing for speech intelligibility prediction
    Edraki, Amin
    Chan, Wai-Yip
    Jensen, Jesper
    Fogerty, Daniel
    HEARING RESEARCH, 2022, 426
  • [47] COMBINING MULTIPLE KERNEL MODELS FOR AUTOMATIC INTELLIGIBILITY DETECTION OF PATHOLOGICAL SPEECH
    Huang, Dong-Yan
    Dong, Minghui
    Li, Haizhou
    2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 6485 - 6489
  • [48] Chinese speech intelligibility of elderly people in environments combining reverberation and noise
    Zhang Honghu
    Yan Jia
    Peng Jianxin
    APPLIED ACOUSTICS, 2019, 150 : 1 - 4
  • [49] The Role of Phase-locking to the Temporal Envelope of Speech in Auditory Perception and Speech Intelligibility
    Millman, Rebecca E.
    Johnson, Sam R.
    Prendergast, Garreth
    JOURNAL OF COGNITIVE NEUROSCIENCE, 2015, 27 (03) : 533 - 545
  • [50] On Improvement of Speech Intelligibility and Quality: A Survey of Unsupervised Single Channel Speech Enhancement Algorithms
    Saleem, Nasir
    Khattak, Muhammad Irfan
    Verdu, Elena
    INTERNATIONAL JOURNAL OF INTERACTIVE MULTIMEDIA AND ARTIFICIAL INTELLIGENCE, 2020, 6 (02): : 78 - 89