A Task-Optimized Neural Network Replicates Human Auditory Behavior, Predicts Brain Responses, and Reveals a Cortical Processing Hierarchy

被引:263
作者
Kell, Alexander J. E. [1 ,2 ]
Yamins, Daniel L. K. [3 ,4 ,5 ]
Shook, Erica N. [1 ,2 ]
Norman-Haignere, Sam V. [1 ]
McDermott, Josh H. [1 ,2 ,6 ]
机构
[1] MIT, Dept Brain & Cognit Sci, E25-618, Cambridge, MA 02139 USA
[2] MIT, Ctr Brains Minds & Machines, 77 Massachusetts Ave, Cambridge, MA 02139 USA
[3] Stanford Univ, Dept Psychol, Stanford, CA USA
[4] Stanford Univ, Dept Comp Sci, Stanford, CA 94305 USA
[5] Stanford Neurosci Inst, Stanford, CA USA
[6] Harvard Univ, Program Speech & Hearing Biosci & Technol, Cambridge, MA 02138 USA
关键词
CORTEX EVIDENCE; COMPLEX SOUNDS; SPEECH; ORGANIZATION; REPRESENTATION; MODULATION; SUBDIVISIONS; STATISTICS; PERCEPTION; REGIONS;
D O I
10.1016/j.neuron.2018.03.044
中图分类号
Q189 [神经科学];
学科分类号
071006 ;
摘要
A core goal of auditory neuroscience is to build quantitative models that predict cortical responses to natural sounds. Reasoning that a complete model of auditory cortex must solve ecologically relevant tasks, we optimized hierarchical neural networks for speech and music recognition. The best-performing network contained separate music and speech pathways following early shared processing, potentially replicating human cortical organization. The network performed both tasks as well as humans and exhibited human-like errors despite not being optimized to do so, suggesting common constraints on network and human performance. The network predicted fMRI voxel responses substantially better than traditional spectrotemporal filter models throughout auditory cortex. It also provided a quantitative signature of cortical representational hierarchy-primary and non-primary responses were best predicted by intermediate and late network layers, respectively. The results suggest that task optimization provides a powerful set of tools for modeling sensory systems.
引用
收藏
页码:630 / +
页数:31
相关论文
共 77 条
  • [1] Abadi M, 2016, PROCEEDINGS OF OSDI'16: 12TH USENIX SYMPOSIUM ON OPERATING SYSTEMS DESIGN AND IMPLEMENTATION, P265
  • [2] Music listening engages specific cortical regions within the temporal lobes: Differences between musicians and non-musicians
    Angulo-Perkins, Arafat
    Aube, William
    Peretz, Isabelle
    Barrios, Fernando A.
    Armony, Jorge L.
    Concha, Luis
    [J]. CORTEX, 2014, 59 : 126 - 137
  • [3] [Anonymous], 2001, SciPy: Open source scientific tools for Python
  • [4] [Anonymous], 2006, A guide to NumPy
  • [5] [Anonymous], 2017, BIORXIV
  • [6] [Anonymous], ARXIV171102653
  • [7] [Anonymous], 2017, ARXIV170707012
  • [8] Receptive field dimensionality increases from the auditory midbrain to cortex
    Atencio, Craig A.
    Sharpee, Tatyana O.
    Schreiner, Christoph E.
    [J]. JOURNAL OF NEUROPHYSIOLOGY, 2012, 107 (10) : 2594 - 2603
  • [9] Bertin-Mahieux T., 2011, P 12 INT SOC MUS INF, V2, P10, DOI DOI 10.7916/D8NZ8J07
  • [10] Hierarchical and asymmetric temporal sensitivity in human auditory cortices
    Boemio, A
    Fromm, S
    Braun, A
    Poeppel, D
    [J]. NATURE NEUROSCIENCE, 2005, 8 (03) : 389 - 395