Tracing the emergence of categorical speech perception in the human auditory system

被引:143
作者
Bidelman, Gavin M. [1 ,2 ]
Moreno, Sylvain [3 ]
Alain, Claude [3 ,4 ]
机构
[1] Univ Memphis, Inst Intelligent Syst, Memphis, TN 38105 USA
[2] Univ Memphis, Sch Commun Sci & Disorders, Memphis, TN 38105 USA
[3] Baycrest Ctr Geriatr Care, Rotman Res Inst, Toronto, ON M6A 2E1, Canada
[4] Univ Toronto, Dept Psychol, Toronto, ON M6A 2E1, Canada
基金
加拿大健康研究院; 加拿大自然科学与工程研究理事会;
关键词
Categorical perception; Speech perception; Brainstem response; Auditory event-related potentials (ERP); Neural computation; FREQUENCY-FOLLOWING RESPONSES; EVENT-RELATED POTENTIALS; BRAIN-STEM; CORTICAL REPRESENTATION; EVOKED-POTENTIALS; SELECTIVE ATTENTION; PITCH SALIENCE; INFORMATION; SOUND; DISCRIMINATION;
D O I
10.1016/j.neuroimage.2013.04.093
中图分类号
Q189 [神经科学];
学科分类号
071006 ;
摘要
Speech perception requires the effortless mapping from smooth, seemingly continuous changes in sound features into discrete perceptual units, a conversion exemplified in the phenomenon of categorical perception. Explaining how/when the human brain performs this acoustic-phonetic transformation remains an elusive problem in current models and theories of speech perception. In previous attempts to decipher the neural basis of speech perception, it is often unclear whether the alleged brain correlates reflect an underlying percept or merely changes in neural activity that covary with parameters of the stimulus. Here, we recorded neuroelectric activity generated at both cortical and subcortical levels of the auditory pathway elicited by a speech vowel continuum whose percept varied categorically from /u/ to /a/. This integrative approach allows us to characterize how various auditory structures code, transform, and ultimately render the perception of speech material as well as dissociate brain responses reflecting changes in stimulus acoustics from those that index true internalized percepts. We find that activity from the brainstem mirrors properties of the speech waveform with remarkable fidelity, reflecting progressive changes in speech acoustics but not the discrete phonetic classes reported behaviorally. In comparison, patterns of late cortical evoked activity contain information reflecting distinct perceptual categories and predict the abstract phonetic speech boundaries heard by listeners. Our findings demonstrate a critical transformation in neural speech representations between brainstem and early auditory cortex analogous to an acoustic-phonetic mapping necessary to generate categorical speech percepts. Analytic modeling demonstrates that a simple nonlinearity accounts for the transformation between early (subcortical) brain activity and subsequent cortical/behavioral responses to speech (>15-200 ms) thereby describing a plausible mechanism by which the brain achieves its acoustic-to-phonetic mapping. Results provide evidence that the neurophysiological underpinnings of categorical speech are present cortically by similar to 175 ms after sound enters the ear. (C) 2013 Elsevier Inc. All rights reserved.
引用
收藏
页码:201 / 212
页数:12
相关论文
共 95 条
[1]   The use of cortical auditory evoked potentials to evaluate neural encoding of speech sounds in adults [J].
Agung, Katrina ;
Purdy, Suzanne C. ;
McMahon, Catherine M. ;
Newall, Philip .
JOURNAL OF THE AMERICAN ACADEMY OF AUDIOLOGY, 2006, 17 (08) :559-572
[2]   Envelope following responses to natural vowels [J].
Aiken, Steven J. ;
Picton, Terence W. .
AUDIOLOGY AND NEURO-OTOLOGY, 2006, 11 (04) :213-232
[3]   Envelope and spectral frequency-following responses to vowel sounds [J].
Aiken, Steven J. ;
Picton, Terence W. .
HEARING RESEARCH, 2008, 245 (1-2) :35-47
[4]  
Alain C, 2000, FRONT BIOSCI, P202
[5]  
[Anonymous], 1993, An introduction to the bootstrap
[6]  
[Anonymous], 1976, Communication and Cybernetics
[7]  
[Anonymous], 1996, Phonological development: The origins of language in the child
[8]   CATEGORICAL EFFECTS IN THE PERCEPTION OF FACES [J].
BEALE, JM ;
KEIL, FC .
COGNITION, 1995, 57 (03) :217-239
[9]   Effects of reverberation on brainstem representation of speech in musicians and non-musicians [J].
Bidebnan, Gavin M. ;
Krishnan, Ananthanarayan .
BRAIN RESEARCH, 2010, 1355 :112-125
[10]   Musicians and tone-language speakers share enhanced brainstem encoding but not perceptual benefits for musical pitch [J].
Bidelman, Gavin M. ;
Gandour, Jackson T. ;
Krishnan, Ananthanarayan .
BRAIN AND COGNITION, 2011, 77 (01) :1-10