Toward an Integration of Deep Learning and Neuroscience

被引:368
作者
Marblestone, Adam H. [1 ]
Wayne, Greg [2 ]
Kording, Konrad P. [3 ]
机构
[1] MIT, Media Lab, Synthet Neurobiol Grp, Cambridge, MA 02139 USA
[2] Google Deepmind, London, England
[3] Northwestern Univ, Rehabil Inst Chicago, Chicago, IL 60611 USA
关键词
cost functions; neural networks; neuroscience; cognitive architecture; TIMING-DEPENDENT PLASTICITY; ORGANIZING NEURAL-NETWORK; LONG-TERM POTENTIATION; PREFRONTAL CORTEX; RECEPTIVE-FIELD; WORKING-MEMORY; BASAL GANGLIA; BAYESIAN-INFERENCE; CONCEPTUAL KNOWLEDGE; COMPUTATIONAL MODEL;
D O I
10.3389/fncom.2016.00094
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Neuroscience has focused on the detailed implementation of computation, studying neural codes, dynamics and circuits. In machine learning, however, artificial neural networks tend to eschew precisely designed codes, dynamics or circuits in favor of brute force optimization of a cost function, often using simple and relatively uniform initial architectures. Two recent developments have emerged within machine learning that create an opportunity to connect these seemingly divergent perspectives. First, structured architectures are used, including dedicated systems for attention, recursion and various forms of short- and long-term memory storage. Second, cost functions and training procedures have become more complex and are varied across layers and over time. Here we think about the brain in terms of these ideas. We hypothesize that (1) the brain optimizes cost functions, (2) these cost functions are diverse and differ across brain locations and over development, and (3) optimization operates within a pre-structured architecture matched to the computational problems posed by behavior. Such a heterogeneously optimized system, enabled by a series of interacting cost functions, serves to make learning data-efficient and precisely targeted to the needs of the organism. We suggest directions by which neuroscience could seek to refine and test these hypotheses.
引用
收藏
页数:41
相关论文
共 489 条
  • [1] Functional significance of long-term potentiation for sequence learning and prediction
    Abbott, LF
    Blum, KI
    [J]. CEREBRAL CORTEX, 1996, 6 (03) : 406 - 416
  • [2] Multifaceted aspects of chunking enable robust algorithms
    Acuna, Daniel E.
    Wymbs, Nicholas F.
    Reynolds, Chelsea A.
    Picard, Nathalie
    Turner, Robert S.
    Strick, Peter L.
    Grafton, Scott T.
    Kording, Konrad P.
    [J]. JOURNAL OF NEUROPHYSIOLOGY, 2014, 112 (08) : 1849 - 1856
  • [3] Abstract Structural Representations of Goal-Directed Behavior
    Allen, Kachina
    Ibara, Steven
    Seymour, Amy
    Cordova, Natalia
    Botvinick, Matthew
    [J]. PSYCHOLOGICAL SCIENCE, 2010, 21 (10) : 1518 - 1524
  • [4] SHIFTER CIRCUITS - A COMPUTATIONAL STRATEGY FOR DYNAMIC ASPECTS OF VISUAL PROCESSING
    ANDERSON, CH
    VANESSEN, DC
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1987, 84 (17) : 6297 - 6301
  • [5] Coding of saliency by ensemble bursting in the amygdala of primates
    Andino, S. L. Gonzalez
    Menendez, R. Grave de Peralta
    [J]. FRONTIERS IN BEHAVIORAL NEUROSCIENCE, 2012, 6
  • [6] Angelucci A, 2002, J NEUROSCI, V22, P8633
  • [7] [Anonymous], 2015, ARXIV14117783
  • [8] [Anonymous], NEOCORTEX
  • [9] [Anonymous], 2016, ARXIV160305106
  • [10] [Anonymous], INT C MACH LEARN ED