Simple Acoustic Features Can Explain Phoneme-Based Predictions of Cortical Responses to Speech

被引:95
作者
Daube, Christoph [1 ]
Ince, Robin A. A. [1 ]
Gross, Joachim [1 ,2 ]
机构
[1] Univ Glasgow, Inst Neurosci & Psychol, 62 Hillhead St, Glasgow G12 8QB, Lanark, Scotland
[2] Univ Munster, Inst Biomagnetism & Biosignalanal, Malmedyweg 15, D-48149 Munster, Germany
基金
英国惠康基金;
关键词
INFORMATION; DYNAMICS; COMPREHENSION; OSCILLATIONS; PRINCIPLES; FREQUENCY; ENVELOPE; MODELS; MEG;
D O I
10.1016/j.cub.2019.04.067
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
When we listen to speech, we have to make sense of a waveform of sound pressure. Hierarchical models of speech perception assume that, to extract semantic meaning, the signal is transformed into unknown, intermediate neuronal representations. Traditionally, studies of such intermediate representations are guided by linguistically defined concepts, such as phonemes. Here, we argue that in order to arrive at an unbiased understanding of the neuronal responses to speech, we should focus instead on representations obtained directly from the stimulus. We illustrate our view with a data-driven, information theoretic analysis of a dataset of 24 young, healthy humans who listened to a 1 h narrative while their magnetoencephalogram (MEG) was recorded. We find that two recent results, the improved performance of an encoding model in which annotated linguistic and acoustic features were combined and the decoding of phoneme subgroups from phoneme-locked responses, can be explained by an encoding model that is based entirely on acoustic features. These acoustic features capitalize on acoustic edges and outperform Gabor-filtered spectrograms, which can explicitly describe the spectrotemporal characteristics of individual phonemes. By replicating our results in publicly available electroencephalography (EEG) data, we conclude that models of brain responses based on linguistic features can serve as excellent benchmarks. However, we believe that in order to further our understanding of human cortical responses to speech, we should also explore low-level and parsimonious explanations for apparent high-level phenomena.
引用
收藏
页码:1924 / +
页数:23
相关论文
共 93 条
[31]   Contributions of local speech encoding and functional connectivity to audio-visual speech perception [J].
Giordano, Bruno L. ;
Ince, Robin A. A. ;
Gross, Joachim ;
Schyns, Philippe G. ;
Panzeri, Stefano ;
Kayser, Christoph .
ELIFE, 2017, 6
[32]   Cortical oscillations and speech processing: emerging computational principles and operations [J].
Giraud, Anne-Lise ;
Poeppel, David .
NATURE NEUROSCIENCE, 2012, 15 (04) :511-517
[33]   Speech Rhythms and Multiplexed Oscillatory Sensory Coding in the Human Brain [J].
Gross, Joachim ;
Hoogenboom, Nienke ;
Thut, Gregor ;
Schyns, Philippe ;
Panzeri, Stefano ;
Belin, Pascal ;
Garrod, Simon .
PLOS BIOLOGY, 2013, 11 (12)
[34]  
Hahn T., 2018, PYTHON BASED HYPERPA
[35]   Theta-band phase tracking in the two-talker problem [J].
Hambrook, Dillon A. ;
Tata, Matthew S. .
BRAIN AND LANGUAGE, 2014, 135 :52-56
[36]   The revolution will not be controlled: natural stimuli in speech neuroscience [J].
Hamilton, Liberty S. ;
Huth, Alexander G. .
LANGUAGE COGNITION AND NEUROSCIENCE, 2020, 35 (05) :573-582
[37]   A Spatial Map of Onset and Sustained Responses to Speech in the Human Superior Temporal Gyrus [J].
Hamilton, Liberty S. ;
Edwards, Erik ;
Chang, Edward F. .
CURRENT BIOLOGY, 2018, 28 (12) :1860-+
[38]   Grounding the neurobiology of language in first principles: The necessity of non-language-centric explanations for language comprehension [J].
Hasson, Uri ;
Egidi, Giovanna ;
Marelli, Marco ;
Willems, Roel M. .
COGNITION, 2018, 180 :135-157
[39]   Magnetic brain activity phase-locked to the envelope, the syllable onsets, and the fundamental frequency of a perceived speech signal [J].
Hertrich, Ingo ;
Dietrich, Susanne ;
Trouvain, Juergen ;
Moos, Anja ;
Ackermann, Hermann .
PSYCHOPHYSIOLOGY, 2012, 49 (03) :322-334
[40]   Encoding and Decoding Models in Cognitive Electrophysiology [J].
Holdgraf, Christopher R. ;
Rieger, Jochem W. ;
Micheli, Cristiano ;
Martin, Stephanie ;
Knight, Robert T. ;
Theunissen, Frederic E. .
FRONTIERS IN SYSTEMS NEUROSCIENCE, 2017, 11