Reconstructing Speech from Human Auditory Cortex

被引:365
作者
Pasley, Brian N. [1 ]
David, Stephen V. [2 ,3 ]
Mesgarani, Nima [2 ,3 ,4 ]
Flinker, Adeen [1 ]
Shamma, Shihab A. [2 ,3 ]
Crone, Nathan E. [5 ]
Knight, Robert T. [1 ,4 ,6 ]
Chang, Edward F. [4 ]
机构
[1] Univ Calif Berkeley, Helen Wills Neurosci Inst, Berkeley, CA 94720 USA
[2] Univ Maryland, Syst Res Inst, College Pk, MD 20742 USA
[3] Univ Maryland, Dept Elect & Comp Engn, College Pk, MD 20742 USA
[4] Univ Calif San Francisco, Dept Neurol Surg, San Francisco, CA USA
[5] Johns Hopkins Univ, Dept Neurol, Baltimore, MD 21218 USA
[6] Univ Calif Berkeley, Dept Psychol, Berkeley, CA 94720 USA
关键词
SPECTROTEMPORAL RECEPTIVE-FIELDS; NEURAL REPRESENTATION; NATURAL IMAGES; TEMPORAL INFORMATION; MODULATION; RESPONSES; THALAMUS; ENVELOPE; SYSTEM;
D O I
10.1371/journal.pbio.1001251
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
How the human auditory system extracts perceptually relevant acoustic features of speech is unknown. To address this question, we used intracranial recordings from nonprimary auditory cortex in the human superior temporal gyrus to determine what acoustic information in speech sounds can be reconstructed from population neural activity. We found that slow and intermediate temporal fluctuations, such as those corresponding to syllable rate, were accurately reconstructed using a linear model based on the auditory spectrogram. However, reconstruction of fast temporal fluctuations, such as syllable onsets and offsets, required a nonlinear sound representation based on temporal modulation energy. Reconstruction accuracy was highest within the range of spectro-temporal fluctuations that have been found to be critical for speech intelligibility. The decoded speech representations allowed readout and identification of individual words directly from brain activity during single trial sound presentations. These findings reveal neural encoding mechanisms of speech acoustic parameters in higher order human auditory cortex.
引用
收藏
页数:13
相关论文
共 56 条
  • [1] SPATIOTEMPORAL ENERGY MODELS FOR THE PERCEPTION OF MOTION
    ADELSON, EH
    BERGEN, JR
    [J]. JOURNAL OF THE OPTICAL SOCIETY OF AMERICA A-OPTICS IMAGE SCIENCE AND VISION, 1985, 2 (02) : 284 - 299
  • [2] Differential neural coding of acoustic flutter within primate auditory cortex
    Bendor, Daniel
    Wang, Xiaoqin
    [J]. NATURE NEUROSCIENCE, 2007, 10 (06) : 763 - 771
  • [3] READING A NEURAL CODE
    BIALEK, W
    RIEKE, F
    VANSTEVENINCK, RRD
    WARLAND, D
    [J]. SCIENCE, 1991, 252 (5014) : 1854 - 1857
  • [4] Boulesteix AL, 2008, CANCER INFORM, V6, P77
  • [5] Spatiotemporal dynamics of word processing in the human brain
    Canolty, Ryan T.
    Soltani, Maryam
    Dalal, Sarang S.
    Edwards, Erik
    Dronkers, Nina F.
    Nagarajan, Srikantan S.
    Kirsch, Heidi E.
    Barbaro, Nicholas M.
    Knight, Robert T.
    [J]. FRONTIERS IN NEUROSCIENCE, 2007, 1 (01): : 185 - 196
  • [6] Categorical speech representation in human superior temporal gyrus
    Chang, Edward F.
    Rieger, Jochem W.
    Johnson, Keith
    Berger, Mitchel S.
    Barbaro, Nicholas M.
    Knight, Robert T.
    [J]. NATURE NEUROSCIENCE, 2010, 13 (11) : 1428 - U169
  • [7] Multiresolution spectrotemporal analysis of complex sounds
    Chi, T
    Ru, PW
    Shamma, SA
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2005, 118 (02) : 887 - 906
  • [8] Spectro-temporal modulation transfer functions and speech intelligibility
    Chi, TS
    Gao, YJ
    Guyton, MC
    Ru, PW
    Shamma, S
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1999, 106 (05) : 2719 - 2732
  • [9] Induced electrocorticographic gamma activity during auditory perception
    Crone, NE
    Boatman, D
    Gordon, B
    Hao, L
    [J]. CLINICAL NEUROPHYSIOLOGY, 2001, 112 (04) : 565 - 582
  • [10] Spatial localization of cortical time-frequency dynamics
    Dalal, Sarang S.
    Guggisberg, Adrian G.
    Edwards, Erik
    Sekihara, Kensuke
    Findlay, Anne M.
    Canolty, Ryan T.
    Knight, Robert T.
    Barbaro, Nicholas M.
    Kirsch, Heidi E.
    Nagarajan, Srikantan S.
    [J]. 2007 ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY, VOLS 1-16, 2007, : 4941 - +