Enhancement of speech-in-noise comprehension through vibrotactile stimulation at the syllabic rate

被引:10
作者
Guilleminot, Pierre [1 ]
Reichenbach, Tobias [1 ,2 ]
机构
[1] Imperial Coll London, Dept Bioengn, South Kensington Campus, London SW7 2BX, England
[2] Friedrich Alexander Univ Erlangen Nurnberg, Dept Artificial Intelligence Biomed Engn, D-91052 Erlangen, Germany
基金
英国工程与自然科学研究理事会;
关键词
audiotactile integration; speech-in-noise comprehension; multisensory processing; EEG; AUDITORY ASSOCIATION CORTEX; FALSE DISCOVERY RATE; MULTISENSORY INTEGRATION; TACTILE INTEGRATION; WORD RECOGNITION; PERCEPTION; HEARING; INTELLIGIBILITY; OSCILLATIONS; ORGANIZATION;
D O I
10.1073/pnas.2117000119
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Speech unfolds over distinct temporal scales, in particular, those related to the rhythm of phonemes, syllables, and words. When a person listens to continuous speech, the syllabic rhythm is tracked by neural activity in the theta frequency range. The tracking plays a functional role in speech processing: Influencing the theta activity through transcranial current stimulation, for instance, can impact speech perception. The theta-band activity in the auditory cortex can also be modulated through the somatosensory system, but the effect on speech processing has remained unclear. Here, we show that vibrotactile feedback presented at the rate of syllables can modulate and, in fact, enhance the comprehension of a speech signal in background noise. The enhancement occurs when vibrotactile pulses occur at the perceptual center of the syllables, whereas a temporal delay between the vibrotactile signals and the speech stream can lead to a lower level of speech comprehension. We further investigate the neural mechanisms underlying the audiotactile integration through electroencephalographic (EEG) recordings. We find that the audiotactile stimulation modulates the neural response to the speech rhythm, as well as the neural response to the vibrotactile pulses. The modulations of these neural activities reflect the behavioral effects on speech comprehension. Moreover, we demonstrate that speech comprehension can be predicted by particular aspects of the neural responses. Our results evidence a role of vibrotactile information for speech processing and may have applications in future auditory prosthesis.
引用
收藏
页数:10
相关论文
共 66 条
[1]   High visual resolution matters in audiovisual speech perception, but only for some [J].
Alsius, Agnes ;
Wayne, Rachel V. ;
Pare, Martin ;
Munhall, Kevin G. .
ATTENTION PERCEPTION & PSYCHOPHYSICS, 2016, 78 (05) :1472-1487
[2]  
Amemiya T., 1983, HDB ECONOMETRICS, V1, P333, DOI [10.1016/S1573-4412(83)01010-7, DOI 10.1016/S1573-4412(83)01010-7]
[3]  
[Anonymous], 2007, Speech Enhancement: Theory and Practice
[4]  
Armstrong M, 1997, AM J OTOL, V18, pS140
[5]   CONTROLLING THE FALSE DISCOVERY RATE VIA KNOCKOFFS [J].
Barber, Rina Foygel ;
Candes, Emmanuel J. .
ANNALS OF STATISTICS, 2015, 43 (05) :2055-2085
[6]  
Beck D L., 2018, Journal of Otolaryngology-ENT Research, V10, P00303
[7]   CONTROLLING THE FALSE DISCOVERY RATE - A PRACTICAL AND POWERFUL APPROACH TO MULTIPLE TESTING [J].
BENJAMINI, Y ;
HOCHBERG, Y .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 1995, 57 (01) :289-300
[8]   The SUS test: A method for the assessment of text-to-speech synthesis intelligibility using Semantically Unpredictable Sentences [J].
Benoit, C ;
Grice, M ;
Hazan, V .
SPEECH COMMUNICATION, 1996, 18 (04) :381-392
[9]  
Bird S, 2009, Natural language processing with Python: analyzing text with the natural language toolkit
[10]   Semantic Context Enhances the Early Auditory Encoding of Natural Speech [J].
Broderick, Michael P. ;
Anderson, Andrew J. ;
Lalor, Edmund C. .
JOURNAL OF NEUROSCIENCE, 2019, 39 (38) :7564-7575