Degradation levels of continuous speech affect neural speech tracking and alpha power differently

被引:24
作者
Hauswald, Anne [1 ,2 ]
Keitel, Anne [3 ,4 ]
Chen, Ya-Ping [1 ,2 ]
Roesch, Sebastian [5 ]
Weisz, Nathan [1 ,2 ]
机构
[1] Univ Salzburg, Ctr Cognit Neurosci, Hellbrunnerstr 34, A-5020 Salzburg, Austria
[2] Univ Salzburg, Dept Psychol, Salzburg, Austria
[3] Univ Dundee, Psychol, Sch Social Sci, Dundee, Scotland
[4] Univ Glasgow, Ctr Cognit Neuroimaging, Glasgow, Lanark, Scotland
[5] Paracelsus Med Univ, Dept Otorhinolaryngol, Salzburg, Austria
基金
英国惠康基金; 奥地利科学基金会;
关键词
alpha power; continuous speech; degraded speech; low-frequency speech tracking; MEG; LISTENING EFFORT; DEGRADED SPEECH; WORKING-MEMORY; PHASE PATTERNS; ENTRAINMENT; BRAIN; INTELLIGIBILITY; COMPREHENSION; RECOGNITION; INTEGRATION;
D O I
10.1111/ejn.14912
中图分类号
Q189 [神经科学];
学科分类号
071006 ;
摘要
Making sense of a poor auditory signal can pose a challenge. Previous attempts to quantify speech intelligibility in neural terms have usually focused on one of two measures, namely low-frequency speech-brain synchronization or alpha power modulations. However, reports have been mixed concerning the modulation of these measures, an issue aggravated by the fact that they have normally been studied separately. We present two MEG studies analyzing both measures. In study 1, participants listened to unimodal auditory speech with three different levels of degradation (original, 7-channel and 3-channel vocoding). Intelligibility declined with declining clarity, but speech was still intelligible to some extent even for the lowest clarity level (3-channel vocoding). Low-frequency (1-7 Hz) speech tracking suggested a U-shaped relationship with strongest effects for the medium-degraded speech (7-channel) in bilateral auditory and left frontal regions. To follow up on this finding, we implemented three additional vocoding levels (5-channel, 2-channel and 1-channel) in a second MEG study. Using this wider range of degradation, the speech-brain synchronization showed a similar pattern as in study 1, but further showed that when speech becomes unintelligible, synchronization declines again. The relationship differed for alpha power, which continued to decrease across vocoding levels reaching a floor effect for 5-channel vocoding. Predicting subjective intelligibility based on models either combining both measures or each measure alone showed superiority of the combined model. Our findings underline that speech tracking and alpha power are modified differently by the degree of degradation of continuous speech but together contribute to the subjective speech understanding.
引用
收藏
页码:3288 / 3302
页数:15
相关论文
共 68 条
[1]   Measures of Listening Effort Are Multidimensional [J].
Alhanbali, Sara ;
Dawes, Piers ;
Millman, Rebecca E. ;
Munro, Kevin J. .
EAR AND HEARING, 2019, 40 (05) :1084-1097
[2]  
[Anonymous], 2018, Language, Cognition and Neuroscience, DOI [10.1080/23273798.2018.1518534, DOI 10.1080/23273798.2018.1518534]
[3]   Fitting Linear Mixed-Effects Models Using lme4 [J].
Bates, Douglas ;
Maechler, Martin ;
Bolker, Benjamin M. ;
Walker, Steven C. .
JOURNAL OF STATISTICAL SOFTWARE, 2015, 67 (01) :1-48
[4]   Left temporal alpha-band activity reflects single word intelligibility [J].
Becker, Robert ;
Pefkou, Maria ;
Michel, Christoph M. ;
Hervais-Adelman, Alexis G. .
FRONTIERS IN SYSTEMS NEUROSCIENCE, 2013, 7
[5]   CONTROLLING THE FALSE DISCOVERY RATE - A PRACTICAL AND POWERFUL APPROACH TO MULTIPLE TESTING [J].
BENJAMINI, Y ;
HOCHBERG, Y .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 1995, 57 (01) :289-300
[6]   The psychophysics toolbox [J].
Brainard, DH .
SPATIAL VISION, 1997, 10 (04) :433-436
[7]   The Natural Statistics of Audiovisual Speech [J].
Chandrasekaran, Chandramouli ;
Trubanova, Andrea ;
Stillittano, Sebastien ;
Caplier, Alice ;
Ghazanfar, Asif A. .
PLOS COMPUTATIONAL BIOLOGY, 2009, 5 (07)
[8]   Toward EEG-Assisted Hearing Aids: Objective Threshold Estimation Based on Ear-EEG in Subjects With Sensorineural Hearing Loss [J].
Christensen, Christian Bech ;
Hietkamp, Renskje K. ;
Harte, James M. ;
Lunner, Thomas ;
Kidmose, Preben .
TRENDS IN HEARING, 2018, 22
[9]   The Multivariate Temporal Response Function (mTRF) Toolbox: A MATLAB Toolbox for Relating Neural Signals to Continuous Stimuli [J].
Crosse, Michael J. ;
Di Liberto, Giovanni M. ;
Bednar, Adam ;
Lalor, Edmund C. .
FRONTIERS IN HUMAN NEUROSCIENCE, 2016, 10
[10]   Lexical information drives; Perceptual learning of distorted speech: Evidence from the comprehension of noise-vocoded sentences [J].
Davis, MH ;
Johnsrude, IS ;
Hervais-Adelman, A ;
Taylor, K ;
McGettigan, C .
JOURNAL OF EXPERIMENTAL PSYCHOLOGY-GENERAL, 2005, 134 (02) :222-241