Speech Modification for Intelligibility in Cochlear Implant Listeners: Individual Effects of Vowel- and Consonant-Boosting

被引:1
作者
Saba, Juliana N. [1 ]
Hansen, John H. L. [1 ]
机构
[1] Univ Texas Dallas, Ctr Robust Speech Syst CRSS, Cochlear Implant Proc Lab CILab, Erik Jonsson Sch Engn & Comp Sci, Richardson, TX 75083 USA
来源
INTERSPEECH 2022 | 2022年
关键词
cochlear implants; Lombard Effect; formants; signal processing; compression; enhancement; STRESSED SPEECH; LOMBARD; RECOGNITION; NOISE; SPEAKING; STRATEGY; DURATION;
D O I
10.21437/Interspeech.2022-11131
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Previous research has demonstrated techniques to improve automatic speech recognition and speech-in-noise intelligibility for normal hearing (NH) and cochlear implant (CI) listeners by synthesizing Lombard Effect (LE) speech. In this study, we emulate and evaluate segment-specific modifications based on speech production characteristics observed in natural LE speech in order to improve intelligibility for CI listeners. Two speech processing approaches were designed to modify representation of vowels, consonants, and the combination using amplitude-based compression techniques in the "electric domain" - referring to the stimulation sequence delivered to the intracochlear electrode array that corresponds to the acoustic signal. Performance with CI listeners resulted in no significant difference using consonant-boosting and consonant- and vowel-boosting strategies with better representation of mid-frequency and high-frequency content corresponding to both formant and consonant structure, respectively. Spectral smearing and decreased amplitude variation were also observed which may have negatively impacted intelligibility. Segmental perturbations using a weighted logarithmic and sigmoid compression functions in this study demonstrated the ability to improve representation of frequency content but disrupted amplitude-based cues, regardless of comparable speech intelligibility. While there are an infinite number of acoustic domain modifications characterizing LE speech, this study demonstrates a basic framework for emulating segmental differences in the electric domain.
引用
收藏
页码:5473 / 5477
页数:5
相关论文
共 35 条
[1]   HMM-based stressed speech modeling with application to improved synthesis and recognition of isolated speech under stress [J].
Bou-Ghazale, SE ;
Hansen, JHL .
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1998, 6 (03) :201-216
[2]   Generating stressed speech from neutral speech using a modified CELP vocoder [J].
BouGhazale, SE ;
Hansen, JHL .
SPEECH COMMUNICATION, 1996, 20 (1-2) :93-110
[3]   NONLINEAR-ANALYSIS AND CLASSIFICATION OF SPEECH UNDER STRESSED CONDITIONS [J].
CAIRNS, DA ;
HANSEN, JHL .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1994, 96 (06) :3392-3400
[4]   The contribution of durational and spectral changes to the Lombard speech intelligibility benefit [J].
Cooke, Martin ;
Mayo, Catherine ;
Villegas, Julian .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2014, 135 (02) :874-883
[5]   Evaluating the intelligibility benefit of speech modifications in known noise conditions [J].
Cooke, Martin ;
Mayo, Catherine ;
Valentini-Botinhao, Cassia ;
Stylianou, Yannis ;
Sauert, Bastian ;
Tang, Yan .
SPEECH COMMUNICATION, 2013, 55 (04) :572-585
[6]   Perceptual contributions of the consonant-vowel boundary to sentence intelligibility [J].
Fogerty, Daniel ;
Kewley-Port, Diane .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2009, 126 (02) :847-857
[7]   Factors affecting predicted speech intelligibility with cochlear implants in an auditory model for electrical stimulation [J].
Fredelake, Stefan ;
Hohmann, Volker .
HEARING RESEARCH, 2012, 287 (1-2) :76-90
[8]   Temporal processing and speech recognition in cochlear implant users [J].
Fu, QJ .
NEUROREPORT, 2002, 13 (13) :1635-1639
[9]   Speaking in noise: How does the Lombard effect improve acoustic contrasts between speech and ambient noise? [J].
Garnier, Maeva ;
Henrich, Nathalie .
COMPUTER SPEECH AND LANGUAGE, 2014, 28 (02) :580-597
[10]   CCi-MOBILE: A Portable Real Time Speech Processing Platform for Cochlear Implant and Hearing Research [J].
Ghosh, Ria ;
Ali, Hussnain ;
Hansen, John H. L. .
IEEE TRANSACTIONS ON BIOMEDICAL ENGINEERING, 2022, 69 (03) :1251-1263