Machine Learning Capabilities of a Simulated Cerebellum

被引:26
作者
Hausknecht, Matthew [1 ]
Li, Wen-Ke [2 ]
Mauk, Michael [2 ]
Stone, Peter [1 ]
机构
[1] Univ Texas Austin, Dept Comp Sci, Austin, TX 78712 USA
[2] Univ Texas Austin, Ctr Learning & Memory, Austin, TX 78712 USA
基金
美国国家科学基金会;
关键词
Cerebellar pattern recognition; cerebellum; inverted pendulum balancing (cart-pole); MNIST handwritten digit recognition; proportional-integral-derivative (PID) control; robot balance; PREDICTIVE MOTOR CONTROL; NETWORK MODEL; COORDINATION; CORTEX;
D O I
10.1109/TNNLS.2015.2512838
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper describes the learning and control capabilities of a biologically constrained bottom-up model of the mammalian cerebellum. Results are presented from six tasks: 1) eyelid conditioning; 2) pendulum balancing; 3) proportional-integral-derivative control; 4) robot balancing; 5) pattern recognition; and 6) MNIST handwritten digit recognition. These tasks span several paradigms of machine learning, including supervised learning, reinforcement learning, control, and pattern recognition. Results over these six domains indicate that the cerebellar simulation is capable of robustly identifying static input patterns even when randomized across the sensory apparatus. This capability allows the simulated cerebellum to perform several different supervised learning and control tasks. On the other hand, both reinforcement learning and temporal pattern recognition prove problematic due to the delayed nature of error signals and the simulator's inability to solve the credit assignment problem. These results are consistent with previous findings which hypothesize that in the human brain, the basal ganglia is responsible for reinforcement learning, while the cerebellum handles supervised learning.
引用
收藏
页码:510 / 522
页数:13
相关论文
共 47 条
  • [11] STATISTICAL INFERENCE FOR PROBABILISTIC FUNCTIONS OF FINITE STATE MARKOV CHAINS
    BAUM, LE
    PETRIE, T
    [J]. ANNALS OF MATHEMATICAL STATISTICS, 1966, 37 (06): : 1554 - &
  • [12] NEURAL-NETWORK MODEL OF THE CEREBELLUM - TEMPORAL DISCRIMINATION AND THE TIMING OF MOTOR-RESPONSES
    BUONOMANO, DV
    MAUK, MD
    [J]. NEURAL COMPUTATION, 1994, 6 (01) : 38 - 55
  • [13] What are the computations of the cerebellum, the basal ganglia and the cerebral cortex?
    Doya, K
    [J]. NEURAL NETWORKS, 1999, 12 (7-8) : 961 - 974
  • [14] Eccles JC, 1967, CEREBELLUM NEURONAL
  • [15] TEXPLORE: real-time sample-efficient reinforcement learning for robots
    Hester, Todd
    Stone, Peter
    [J]. MACHINE LEARNING, 2013, 90 (03) : 385 - 429
  • [16] Hester T, 2012, IEEE INT CONF ROBOT, P85, DOI 10.1109/ICRA.2012.6225072
  • [17] The cerebellum in action:: a simulation and robotics study
    Hofstötter, C
    Mintz, M
    Verschure, PFMJ
    [J]. EUROPEAN JOURNAL OF NEUROSCIENCE, 2002, 16 (07) : 1361 - 1376
  • [18] Models of the cerebellum and motor learning
    Houk, JC
    Buckingham, JT
    Barto, AG
    [J]. BEHAVIORAL AND BRAIN SCIENCES, 1996, 19 (03) : 368 - +
  • [19] ITO M, 1989, ANNU REV NEUROSCI, V12, P85, DOI 10.1146/annurev.ne.12.030189.000505
  • [20] A Subtraction Mechanism of Temporal Coding in Cerebellar Cortex
    Kalmbach, Brian E.
    Voicu, Horatiu
    Ohyama, Tatsuya
    Mauk, Michael D.
    [J]. JOURNAL OF NEUROSCIENCE, 2011, 31 (06) : 2025 - 2034