Intelligibility Enhancement of Casual Speech for Reverberant Environments inspired by Clear Speech Properties

被引:0
|
作者
Koutsogiannaki, Maria [1 ]
Petkov, Petko N. [2 ]
Stylianou, Yannis [1 ,2 ]
机构
[1] Univ Crete, CSD, Multimedia Informat Lab, Iraklion, Greece
[2] Toshiba Res Europe Ltd, Cambridge Res Lab, Kawasaki, Kanagawa, Japan
来源
16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5 | 2015年
关键词
Clear Speech; Casual Speech; Intelligibility; Reverberation; Spectral Transformations; Time Modifications; Pause insertion; HARD-OF-HEARING; CONVERSATIONAL SPEECH; VOWEL INTELLIGIBILITY; SPEAKING-RATE; PERCEPTION; TALKER; NOISE;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Clear speech has been shown to have an intelligibility advantage over casual speech in noisy and reverberant environments. This work validates spectral and time domain modifications to increase the intelligibility of casual speech in reverberant environments by compensating particular differences between the two speaking styles. To compensate spectral differences, a frequency-domain filtering approach is applied to casual speech. In time domain, two techniques for time-scaling casual speech are explored: (1) uniform time-scaling and (2) pause insertion and phoneme elongation based on loudness and modulation criteria. The effect of the proposed modifications is evaluated through subjective listening tests in two reverberant conditions with reverberation time 0.8s and 2s. The combination of spectral transformation and uniform time-scaling is shown to be the most successful in increasing the intelligibility of casual speech. The evaluation results support the conclusion that modifications inspired by clear speech can be beneficial for the intelligibility enhancement of speech in reverberant environments.
引用
收藏
页码:65 / 69
页数:5
相关论文
共 50 条
  • [1] SIMPLE AND ARTEFACT-FREE SPECTRAL MODIFICATIONS FOR ENHANCING THE INTELLIGIBILITY OF CASUAL SPEECH
    Koutsogiannaki, Maria
    Stylianou, Yannis
    2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
  • [2] Can modified casual speech reach the intelligibility of clear speech?
    Koutsogiannaki, M.
    Pettinato, M.
    Mayo, C.
    Kandia, V.
    Stylianou, Y.
    13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 578 - 581
  • [3] Intelligibility of Clear Speech: Effect of Instruction
    Lam, Jennifer
    Tjaden, Kris
    JOURNAL OF SPEECH LANGUAGE AND HEARING RESEARCH, 2013, 56 (05): : 1429 - 1440
  • [4] Effects of urgent speech and preceding sounds on speech intelligibility in noisy and reverberant environments
    Hodoshima, Nao
    17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 1696 - 1699
  • [5] Intelligibility and Acoustic Characteristics of Clear and Conversational Speech in Telugu (A South Indian Dravidian Language)
    Durisala, Naresh
    Prakash, S. G. R.
    Nambi, Arivudai
    Batra, Ridhima
    INDIAN JOURNAL OF OTOLARYNGOLOGY AND HEAD & NECK SURGERY, 2011, 63 (02) : 165 - 171
  • [6] Chinese speech intelligibility of children in noisy and reverberant environments
    Peng, Jianxin
    Wu, Shengju
    INDOOR AND BUILT ENVIRONMENT, 2018, 27 (10) : 1357 - 1363
  • [7] Korean Clear Speech Improves Speech Intelligibility for Individuals with Normal Hearing and Individuals with Hearing Loss
    Shin, Su Yeon
    Oh, Hongyeop
    Jin, In-Ki
    JOURNAL OF THE AMERICAN ACADEMY OF AUDIOLOGY, 2020, 31 (10) : 719 - 724
  • [8] Speech Intelligibility of Microphone Arrays in Reverberant Environments with Interference
    Ideli, Elham
    Vaughan, Rodney G.
    Bajic, Ivan, V
    2018 IEEE 20TH INTERNATIONAL WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING (MMSP), 2018,
  • [9] Intelligibility of speech spoken in noise/reverberation for older adults in reverberant environments
    Hodoshima, Nao
    Arai, Takayuki
    Kurisu, Kiyohiro
    13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 1462 - 1465
  • [10] Modulation enhancement of speech by a pre-processing algorithm for improving intelligibility in reverberant environments
    Kusumoto, A
    Arai, T
    Kinoshita, K
    Hodoshima, N
    Vaughan, N
    SPEECH COMMUNICATION, 2005, 45 (02) : 101 - 113