Reconstructing Speech from Human Auditory Cortex

被引:372
作者
Pasley, Brian N. [1 ]
David, Stephen V. [2 ,3 ]
Mesgarani, Nima [2 ,3 ,4 ]
Flinker, Adeen [1 ]
Shamma, Shihab A. [2 ,3 ]
Crone, Nathan E. [5 ]
Knight, Robert T. [1 ,4 ,6 ]
Chang, Edward F. [4 ]
机构
[1] Univ Calif Berkeley, Helen Wills Neurosci Inst, Berkeley, CA 94720 USA
[2] Univ Maryland, Syst Res Inst, College Pk, MD 20742 USA
[3] Univ Maryland, Dept Elect & Comp Engn, College Pk, MD 20742 USA
[4] Univ Calif San Francisco, Dept Neurol Surg, San Francisco, CA USA
[5] Johns Hopkins Univ, Dept Neurol, Baltimore, MD 21218 USA
[6] Univ Calif Berkeley, Dept Psychol, Berkeley, CA 94720 USA
关键词
SPECTROTEMPORAL RECEPTIVE-FIELDS; NEURAL REPRESENTATION; NATURAL IMAGES; TEMPORAL INFORMATION; MODULATION; RESPONSES; THALAMUS; ENVELOPE; SYSTEM;
D O I
10.1371/journal.pbio.1001251
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
How the human auditory system extracts perceptually relevant acoustic features of speech is unknown. To address this question, we used intracranial recordings from nonprimary auditory cortex in the human superior temporal gyrus to determine what acoustic information in speech sounds can be reconstructed from population neural activity. We found that slow and intermediate temporal fluctuations, such as those corresponding to syllable rate, were accurately reconstructed using a linear model based on the auditory spectrogram. However, reconstruction of fast temporal fluctuations, such as syllable onsets and offsets, required a nonlinear sound representation based on temporal modulation energy. Reconstruction accuracy was highest within the range of spectro-temporal fluctuations that have been found to be critical for speech intelligibility. The decoded speech representations allowed readout and identification of individual words directly from brain activity during single trial sound presentations. These findings reveal neural encoding mechanisms of speech acoustic parameters in higher order human auditory cortex.
引用
收藏
页数:13
相关论文
共 56 条
[1]   SPATIOTEMPORAL ENERGY MODELS FOR THE PERCEPTION OF MOTION [J].
ADELSON, EH ;
BERGEN, JR .
JOURNAL OF THE OPTICAL SOCIETY OF AMERICA A-OPTICS IMAGE SCIENCE AND VISION, 1985, 2 (02) :284-299
[2]   Differential neural coding of acoustic flutter within primate auditory cortex [J].
Bendor, Daniel ;
Wang, Xiaoqin .
NATURE NEUROSCIENCE, 2007, 10 (06) :763-771
[3]   READING A NEURAL CODE [J].
BIALEK, W ;
RIEKE, F ;
VANSTEVENINCK, RRD ;
WARLAND, D .
SCIENCE, 1991, 252 (5014) :1854-1857
[4]  
Boulesteix AL, 2008, CANCER INFORM, V6, P77
[5]   Spatiotemporal dynamics of word processing in the human brain [J].
Canolty, Ryan T. ;
Soltani, Maryam ;
Dalal, Sarang S. ;
Edwards, Erik ;
Dronkers, Nina F. ;
Nagarajan, Srikantan S. ;
Kirsch, Heidi E. ;
Barbaro, Nicholas M. ;
Knight, Robert T. .
FRONTIERS IN NEUROSCIENCE, 2007, 1 (01) :185-196
[6]   Categorical speech representation in human superior temporal gyrus [J].
Chang, Edward F. ;
Rieger, Jochem W. ;
Johnson, Keith ;
Berger, Mitchel S. ;
Barbaro, Nicholas M. ;
Knight, Robert T. .
NATURE NEUROSCIENCE, 2010, 13 (11) :1428-U169
[7]   Multiresolution spectrotemporal analysis of complex sounds [J].
Chi, T ;
Ru, PW ;
Shamma, SA .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2005, 118 (02) :887-906
[8]   Spectro-temporal modulation transfer functions and speech intelligibility [J].
Chi, TS ;
Gao, YJ ;
Guyton, MC ;
Ru, PW ;
Shamma, S .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1999, 106 (05) :2719-2732
[9]   Induced electrocorticographic gamma activity during auditory perception [J].
Crone, NE ;
Boatman, D ;
Gordon, B ;
Hao, L .
CLINICAL NEUROPHYSIOLOGY, 2001, 112 (04) :565-582
[10]   Spatial localization of cortical time-frequency dynamics [J].
Dalal, Sarang S. ;
Guggisberg, Adrian G. ;
Edwards, Erik ;
Sekihara, Kensuke ;
Findlay, Anne M. ;
Canolty, Ryan T. ;
Knight, Robert T. ;
Barbaro, Nicholas M. ;
Kirsch, Heidi E. ;
Nagarajan, Srikantan S. .
2007 ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY, VOLS 1-16, 2007, :4941-+