Intelligibility Enhancement of Casual Speech for Reverberant Environments inspired by Clear Speech Properties

被引:0
|
作者
Koutsogiannaki, Maria [1 ]
Petkov, Petko N. [2 ]
Stylianou, Yannis [1 ,2 ]
机构
[1] Univ Crete, CSD, Multimedia Informat Lab, Iraklion, Greece
[2] Toshiba Res Europe Ltd, Cambridge Res Lab, Kawasaki, Kanagawa, Japan
关键词
Clear Speech; Casual Speech; Intelligibility; Reverberation; Spectral Transformations; Time Modifications; Pause insertion; HARD-OF-HEARING; CONVERSATIONAL SPEECH; VOWEL INTELLIGIBILITY; SPEAKING-RATE; PERCEPTION; TALKER; NOISE;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Clear speech has been shown to have an intelligibility advantage over casual speech in noisy and reverberant environments. This work validates spectral and time domain modifications to increase the intelligibility of casual speech in reverberant environments by compensating particular differences between the two speaking styles. To compensate spectral differences, a frequency-domain filtering approach is applied to casual speech. In time domain, two techniques for time-scaling casual speech are explored: (1) uniform time-scaling and (2) pause insertion and phoneme elongation based on loudness and modulation criteria. The effect of the proposed modifications is evaluated through subjective listening tests in two reverberant conditions with reverberation time 0.8s and 2s. The combination of spectral transformation and uniform time-scaling is shown to be the most successful in increasing the intelligibility of casual speech. The evaluation results support the conclusion that modifications inspired by clear speech can be beneficial for the intelligibility enhancement of speech in reverberant environments.
引用
收藏
页码:65 / 69
页数:5
相关论文
共 50 条
  • [21] The effect of nearby maskers on speech intelligibility in reverberant, multi-talker environments
    Westermann, Adam
    Buchholz, Joerg M.
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2017, 141 (03): : 2214 - 2223
  • [22] SESNet: A Speech Enhancement and Separation Network in Noisy Reverberant Environments
    Wang, Liusong
    Gao, Yuan
    Cao, Kaimin
    Hu, Ying
    MAN-MACHINE SPEECH COMMUNICATION, NCMMSC 2024, 2025, 2312 : 44 - 54
  • [23] USING AUTOMATIC SPEECH RECOGNITION AND SPEECH SYNTHESIS TO IMPROVE THE INTELLIGIBILITY OF COCHLEAR IMPLANT USERS IN REVERBERANT LISTENING ENVIRONMENTS
    Chu, Kevin
    Collins, Leslie
    Mainsah, Boyla
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 6929 - 6933
  • [24] Speech Enhancement of Noisy and Reverberant Speech for Text-to-Speech
    Valentini-Botinhao, Cassia
    Yamagishi, Junichi
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2018, 26 (08) : 1420 - 1433
  • [25] Enhancement methods for reverberant speech
    Cole, D
    Moody, M
    Sridharan, S
    ISSPA 96 - FOURTH INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND ITS APPLICATIONS, PROCEEDINGS, VOLS 1 AND 2, 1996, : 383 - 386
  • [26] A systematic study of DNN based speech enhancement in reverberant and reverberant-noisy environments
    Wang, Heming
    Pandey, Ashutosh
    Wang, Deliang
    COMPUTER SPEECH AND LANGUAGE, 2025, 89
  • [27] On Improvement to Non-Reference Speech Intelligibility Estimation Accuracy for Reverberant Speech
    Nakazawa K.
    Kondo K.
    IEEJ Transactions on Electronics, Information and Systems, 2023, 143 (08) : 830 - 841
  • [28] Enhancement of speech intelligibility under noisy reverberant conditions based on modulation spectrum concept
    Van Ngo, Thuan
    Ho, Tuan Vu
    Unoki, Masashi
    Kubo, Rieko
    Akagi, Masato
    2020 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2020, : 753 - 758
  • [29] Enhancement of speech intelligibility in reverberant rooms: Role of amplitude envelope and temporal fine structure
    Srinivasan, Nirmal Kumar
    Zahorik, Pavel
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2014, 135 (06): : EL239 - EL245
  • [30] Model based feature enhancement for automatic speech recognition in reverberant environments
    Krueger, Alexander
    Haeb-Umbach, Reinhold
    INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 1239 - 1242