Perceptual learning of degraded speech by minimizing prediction error

被引:75
作者
Sohoglu, Ediz [1 ,2 ]
Davis, Matthew H. [1 ]
机构
[1] MRC, Cognit & Brain Sci Unit, Cambridge CB2 7EF, England
[2] UCL, Ear Inst, London WC1X 8EE, England
基金
英国医学研究理事会;
关键词
perceptual learning; predictive coding; speech perception; magnetoencephalography; vocoded speech; NOISE-VOCODED SPEECH; COCHLEAR IMPLANTS; AUDITORY-CORTEX; SOURCE RECONSTRUCTION; PERCEIVED CLARITY; RAPID ADAPTATION; DISTORTED SPEECH; SPOKEN LANGUAGE; FREE-ENERGY; BRAIN;
D O I
10.1073/pnas.1523266113
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Human perception is shaped by past experience on multiple time-scales. Sudden and dramatic changes in perception occur when prior knowledge or expectations match stimulus content. These immediate effects contrast with the longer-term, more gradual improvements that are characteristic of perceptual learning. Despite extensive investigation of these two experience-dependent phenomena, there is considerable debate about whether they result from common or dissociable neural mechanisms. Here we test single-and dual-mechanism accounts of experience-dependent changes in perception using concurrent magnetoencephalographic and EEG recordings of neural responses evoked by degraded speech. When speech clarity was enhanced by prior knowledge obtained from matching text, we observed reduced neural activity in a peri-auditory region of the superior temporal gyrus (STG). Critically, longer-term improvements in the accuracy of speech recognition following perceptual learning resulted in reduced activity in a nearly identical STG region. Moreover, short-term neural changes caused by prior knowledge and longer-term neural changes arising from perceptual learning were correlated across subjects with the magnitude of learning-induced changes in recognition accuracy. These experience-dependent effects on neural processing could be dissociated from the neural effect of hearing physically clearer speech, which similarly enhanced perception but increased rather than decreased STG responses. Hence, the observed neural effects of prior knowledge and perceptual learning cannot be attributed to epiphenomenal changes in listening effort that accompany enhanced perception. Instead, our results support a predictive coding account of speech perception; computational simulations show how a single mechanism, minimization of prediction error, can drive immediate perceptual effects of prior knowledge and longer-term perceptual learning of degraded speech.
引用
收藏
页码:E1747 / E1756
页数:10
相关论文
共 85 条
[21]   Bayesian estimation of evoked and induced responses [J].
Friston, Karl J. ;
Henson, Richard ;
Phillips, Christophe ;
Mattout, Jeremie .
HUMAN BRAIN MAPPING, 2006, 27 (09) :722-735
[22]   Variational free energy and the Laplace approximation [J].
Friston, Karl J. ;
Mattout, Jeremie ;
Trujillo-Barreto, Nelson ;
Ashburner, John ;
Penny, Will .
NEUROIMAGE, 2007, 34 (01) :220-234
[23]   A theory of cortical responses [J].
Friston, KJ .
PHILOSOPHICAL TRANSACTIONS OF THE ROYAL SOCIETY B-BIOLOGICAL SCIENCES, 2005, 360 (1456) :815-836
[24]   Temporal Predictive Codes for Spoken Words in Auditory Cortex [J].
Gagnepain, Pierre ;
Henson, Richard N. ;
Davis, Matthew H. .
CURRENT BIOLOGY, 2012, 22 (07) :615-621
[25]   Perceptual learning [J].
Goldstone, RL .
ANNUAL REVIEW OF PSYCHOLOGY, 1998, 49 :585-612
[26]  
GROSSBERG S, 1987, COGNITIVE SCI, V11, P23, DOI 10.1111/j.1551-6708.1987.tb00862.x
[27]  
Hansen P., 2010, MEG: An Introduction to Methods
[28]   Nonstationary cluster-size inference with random field and permutation methods [J].
Hayasaka, S ;
Phan, KL ;
Liberzon, I ;
Worsley, KJ ;
Nichols, TE .
NEUROIMAGE, 2004, 22 (02) :676-687
[29]   A parametric empirical Bayesian framework for the EEG/MEG inverse problem: generative models for multi-subject and multi-modal integration [J].
Henson, Richard N. ;
Wakeman, Daniel G. ;
Litvak, Vladimir ;
Friston, Karl J. .
FRONTIERS IN HUMAN NEUROSCIENCE, 2011, 5
[30]   Predictive, Interactive Multiple Memory Systems [J].
Henson, Richard N. ;
Gagnepain, Pierre .
HIPPOCAMPUS, 2010, 20 (11) :1315-1326