Contributions of local speech encoding and functional connectivity to audio-visual speech perception

被引:55
作者
Giordano, Bruno L. [1 ,2 ]
Ince, Robin A. A. [2 ]
Gross, Joachim [2 ]
Schyns, Philippe G. [2 ]
Panzeri, Stefano [3 ]
Kayser, Christoph [2 ]
机构
[1] Aix Marseille Univ, CNRS, UMR 7289, Inst Neurosci Timone, Marseille, France
[2] Univ Glasgow, Inst Neurosci & Psychol, Glasgow, Lanark, Scotland
[3] Ist Italiano Tecnol, Ctr Neurosci & Cognit Syst, Neural Computat Lab, Rovereto, Italy
基金
欧洲研究理事会; 英国工程与自然科学研究理事会; 英国惠康基金; 英国生物技术与生命科学研究理事会;
关键词
MULTISENSORY INTEGRATION; AUDITORY-CORTEX; PREMOTOR CORTEX; VISUAL SPEECH; CORTICAL REPRESENTATION; NEURONAL OSCILLATIONS; RIGHT-HEMISPHERE; DEGRADED SPEECH; DYNAMIC FACES; BRAIN NETWORK;
D O I
10.7554/eLife.24763
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Seeing a speaker's face enhances speech intelligibility in adverse environments. We investigated the underlying network mechanisms by quantifying local speech representations and directed connectivity in MEG data obtained while human participants listened to speech of varying acoustic SNR and visual context. During high acoustic SNR speech encoding by temporally entrained brain activity was strong in temporal and inferior frontal cortex, while during low SNR strong entrainment emerged in premotor and superior frontal cortex. These changes in local encoding were accompanied by changes in directed connectivity along the ventral stream and the auditory-premotor axis. Importantly, the behavioral benefit arising from seeing the speaker's face was not predicted by changes in local encoding but rather by enhanced functional connectivity between temporal and inferior frontal cortex. Our results demonstrate a role of auditory-frontal interactions in visual speech representations and suggest that functional connectivity along the ventral pathway facilitates speech comprehension in multisensory environments.
引用
收藏
页数:27
相关论文
共 114 条
[21]   Phonetic perceptual identification by native- and second-language speakers differentially activates brain regions involved with acoustic phonetic processing and those involved with articulatory-auditory/orosensory internal models [J].
Callan, DE ;
Jones, JA ;
Callan, AM ;
Akahane-Yamada, R .
NEUROIMAGE, 2004, 22 (03) :1182-1194
[22]   Neural processes underlying perceptual enhancement by visual speech gestures [J].
Callan, DE ;
Jones, JA ;
Munhall, K ;
Callan, AM ;
Kroos, C ;
Vatikiotis-Bateson, E .
NEUROREPORT, 2003, 14 (17) :2213-2218
[23]   The functional role of cross-frequency coupling [J].
Canolty, Ryan T. ;
Knight, Robert T. .
TRENDS IN COGNITIVE SCIENCES, 2010, 14 (11) :506-515
[24]   Dynamic faces speed up the onset of auditory cortical spiking responses during vocal detection [J].
Chandrasekaran, Chandramouli ;
Lemus, Luis ;
Ghazanfar, Asif A. .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2013, 110 (48) :E4668-E4677
[25]   The Natural Statistics of Audiovisual Speech [J].
Chandrasekaran, Chandramouli ;
Trubanova, Andrea ;
Stillittano, Sebastien ;
Caplier, Alice ;
Ghazanfar, Asif A. .
PLOS COMPUTATIONAL BIOLOGY, 2009, 5 (07)
[26]   Silent Expectations: Dynamic Causal Modeling of Cortical Prediction and Attention to Sounds That Weren't [J].
Chennu, Srivas ;
Noreika, Valdas ;
Gueorguiev, David ;
Shtyrov, Yury ;
Bekinschtein, Tristan A. ;
Henson, Richard .
JOURNAL OF NEUROSCIENCE, 2016, 36 (32) :8305-8316
[27]   Effective Cerebral Connectivity during Silent Speech Reading Revealed by Functional Magnetic Resonance Imaginge [J].
Chu, Ying-Hua ;
Lin, Fa-Hsuan ;
Chou, Yu-Jen ;
Tsai, Kevin W. -K. ;
Kuo, Wen-Jui ;
Jaaskelainen, Iiro P. .
PLOS ONE, 2013, 8 (11)
[28]   Effects of Prior Information on Decoding Degraded Speech: An fMRI Study [J].
Clos, Mareike ;
Langner, Robert ;
Meyer, Martin ;
Oechslin, Mathias S. ;
Zilles, Karl ;
Eickhoff, Simon B. .
HUMAN BRAIN MAPPING, 2014, 35 (01) :61-74
[29]   Congruent Visual Speech Enhances Cortical Entrainment to Continuous Auditory Speech in Noise-Free Conditions [J].
Crosse, Michael J. ;
Butler, John S. ;
Lalor, Edmund C. .
JOURNAL OF NEUROSCIENCE, 2015, 35 (42) :14195-14204
[30]   Adaptive Temporal Encoding Leads to a Background-Insensitive Cortical Representation of Speech [J].
Ding, Nai ;
Simon, Jonathan Z. .
JOURNAL OF NEUROSCIENCE, 2013, 33 (13) :5728-5735