Preservation of Speech Spectral Dynamics Enhances Intelligibility

被引:0
作者
Petkov, Petko N. [1 ]
Kleijn, W. Bastiaan [1 ,2 ]
机构
[1] KTH Royal Inst Technol, Sch Elect Engn, Stockholm, Sweden
[2] Victoria Univ Wellington, Sch Engn & Comp Sci, Wellington, New Zealand
来源
14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5 | 2013年
关键词
speech intelligibility; spectral dynamics;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose a method for the enhancement of intelligibility in scenarios where speech is rendered in a noisy environment. The method is based on the hypothesis that intelligibility is a monotonic function of the degree of preservation of the speech spectral dynamics. The accuracy of the speech spectral dynamics can then be traded against the power of the rendered speech signal. We can either maximize the dynamics accuracy given the signal power, or minimize the signal power given the dynamics accuracy. In our implementation, the spectral dynamics is quantified as the difference of the mel cepstra between time frames of the speech signal. We compared the speech rendered by our implementation against both natural speech and a reference method, for the scenario where signal power is minimized given a target dynamics accuracy, and observed a significantly improved intelligibility. The low system delay, and the low complexity and memory requirements make the new method particularly suitable for real-time applications.
引用
收藏
页码:3564 / 3568
页数:5
相关论文
共 50 条
  • [31] Vowel Contrast and Speech Intelligibility in Dysarthria
    Kim, Heejin
    Hasegawa-Johnson, Mark
    Perlman, Adrienne
    FOLIA PHONIATRICA ET LOGOPAEDICA, 2011, 63 (04) : 187 - 194
  • [32] Effects of simulated cataracts on speech intelligibility
    Morris, Nichole L.
    Chaparro, Alex
    Downs, David
    Wood, Joanne M.
    VISION RESEARCH, 2012, 66 : 49 - 54
  • [33] Increasing Speech Intelligibility in Children with Autism
    Robert L. Koegel
    Stephen Camarata
    Lynn Kern Koegel
    Ayala Ben-Tall
    Annette E. Smith
    Journal of Autism and Developmental Disorders, 1998, 28 : 241 - 251
  • [34] Dysarthric speech: A comparison of computerized speech recognition and listener intelligibility
    Doyle, PC
    Leeper, HA
    Kotler, AL
    ThomasStonell, N
    ONeill, C
    Dylke, MC
    Rolls, K
    JOURNAL OF REHABILITATION RESEARCH AND DEVELOPMENT, 1997, 34 (03) : 309 - 316
  • [35] Autonomous measurement of speech intelligibility utilizing automatic speech recognition
    Meyer, Bernd T.
    Kollmeier, Birger
    Ooster, Jasper
    16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 2982 - 2986
  • [36] EFFECT OF INDIVIDUALLY TAILORED SPECTRAL CHANGE ENHANCEMENT ON SPEECH INTELLIGIBILITY AND QUALITY FOR HEARING-IMPAIRED LISTENERS
    Chen, Jing
    Moore, Brian C. J.
    2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 8643 - 8647
  • [37] Aphasic Speech Recognition using a Mixture of Speech Intelligibility Experts
    Perez, Matthew
    Aldeneh, Zakaria
    Provost, Emily Mower
    INTERSPEECH 2020, 2020, : 4986 - 4990
  • [38] Can modified casual speech reach the intelligibility of clear speech?
    Koutsogiannaki, M.
    Pettinato, M.
    Mayo, C.
    Kandia, V.
    Stylianou, Y.
    13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 578 - 581
  • [39] Automatic Speech-to-Background Ratio Selection to Maintain Speech Intelligibility in Broadcasts Using an Objective Intelligibility Metric
    Tang, Yan
    Fazenda, Bruno M.
    Cox, Trevor J.
    APPLIED SCIENCES-BASEL, 2018, 8 (01):
  • [40] QUANTIFYING THE RELATION BETWEEN SPEECH QUALITY AND SPEECH-INTELLIGIBILITY
    PREMINGER, JE
    VANTASELL, DJ
    JOURNAL OF SPEECH AND HEARING RESEARCH, 1995, 38 (03): : 714 - 725