Preservation of Speech Spectral Dynamics Enhances Intelligibility

被引:0
作者
Petkov, Petko N. [1 ]
Kleijn, W. Bastiaan [1 ,2 ]
机构
[1] KTH Royal Inst Technol, Sch Elect Engn, Stockholm, Sweden
[2] Victoria Univ Wellington, Sch Engn & Comp Sci, Wellington, New Zealand
来源
14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5 | 2013年
关键词
speech intelligibility; spectral dynamics;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose a method for the enhancement of intelligibility in scenarios where speech is rendered in a noisy environment. The method is based on the hypothesis that intelligibility is a monotonic function of the degree of preservation of the speech spectral dynamics. The accuracy of the speech spectral dynamics can then be traded against the power of the rendered speech signal. We can either maximize the dynamics accuracy given the signal power, or minimize the signal power given the dynamics accuracy. In our implementation, the spectral dynamics is quantified as the difference of the mel cepstra between time frames of the speech signal. We compared the speech rendered by our implementation against both natural speech and a reference method, for the scenario where signal power is minimized given a target dynamics accuracy, and observed a significantly improved intelligibility. The low system delay, and the low complexity and memory requirements make the new method particularly suitable for real-time applications.
引用
收藏
页码:3564 / 3568
页数:5
相关论文
共 50 条
  • [21] Speech enhancement by speech intelligibility index In sensor network
    Parija, Smita
    Sahu, Prasanna Kumar
    Singh, Sudhansu Sekhar
    2012 THIRD INTERNATIONAL CONFERENCE ON COMPUTING COMMUNICATION & NETWORKING TECHNOLOGIES (ICCCNT), 2012,
  • [22] Estimation of Speech Intelligibility Using Speech Recognition Systems
    Takano, Yusuke
    Kondo, Kazuhiro
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2010, E93D (12): : 3368 - 3376
  • [23] Interaction of speech coders and atypical speech I: Effects on speech intelligibility
    Jamieson, DG
    Parsa, V
    Price, MC
    Till, J
    JOURNAL OF SPEECH LANGUAGE AND HEARING RESEARCH, 2002, 45 (03): : 482 - 493
  • [24] Respirator performance ratings for speech intelligibility
    Coyne, KM
    Johnson, AT
    Yeni-Komshian, GH
    Dooly, CR
    AMERICAN INDUSTRIAL HYGIENE ASSOCIATION JOURNAL, 1998, 59 (04): : 257 - 260
  • [25] Speech intelligibility from image processing
    Hines, Andrew
    Harte, Naomi
    SPEECH COMMUNICATION, 2010, 52 (09) : 736 - 752
  • [26] Speech intelligibility measurements in an office building
    Woycheese, John P.
    JOURNAL OF FIRE PROTECTION ENGINEERING, 2007, 17 (04) : 245 - 269
  • [27] On the Role of Spectral Dynamics in Unit Selection Speech Synthesis
    Kirkpatrick, Barry
    O'Brien, Darragh
    Scaife, Ronan
    Errity, Andrew
    INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 2029 - 2032
  • [28] EEG can predict speech intelligibility
    Iotzov, Ivan
    Parra, Lucas C.
    JOURNAL OF NEURAL ENGINEERING, 2019, 16 (03)
  • [29] Increasing speech intelligibility in children with autism
    Koegel, RL
    Camarata, S
    Koegel, LK
    Ben-Tall, A
    Smith, AE
    JOURNAL OF AUTISM AND DEVELOPMENTAL DISORDERS, 1998, 28 (03) : 241 - 251
  • [30] Improvement of Speech Intelligibility in Noisy Environments
    Yoon, Jae-Yul
    Kim, JungHoe
    Oh, Eunmi
    Park, Hochong
    JOURNAL OF THE ACOUSTICAL SOCIETY OF KOREA, 2009, 28 (01): : 70 - 76