Machine Learning Capabilities of a Simulated Cerebellum

被引：26

作者：

Hausknecht, Matthew ^{[1
]}

Li, Wen-Ke ^{[2
]}

Mauk, Michael ^{[2
]}

Stone, Peter ^{[1
]}

机构：

[1] Univ Texas Austin, Dept Comp Sci, Austin, TX 78712 USA

[2] Univ Texas Austin, Ctr Learning & Memory, Austin, TX 78712 USA

来源：

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS | 2017年 / 28卷 / 03期

基金：

美国国家科学基金会;

关键词：

Cerebellar pattern recognition; cerebellum; inverted pendulum balancing (cart-pole); MNIST handwritten digit recognition; proportional-integral-derivative (PID) control; robot balance; PREDICTIVE MOTOR CONTROL; NETWORK MODEL; COORDINATION; CORTEX;

D O I：

10.1109/TNNLS.2015.2512838

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This paper describes the learning and control capabilities of a biologically constrained bottom-up model of the mammalian cerebellum. Results are presented from six tasks: 1) eyelid conditioning; 2) pendulum balancing; 3) proportional-integral-derivative control; 4) robot balancing; 5) pattern recognition; and 6) MNIST handwritten digit recognition. These tasks span several paradigms of machine learning, including supervised learning, reinforcement learning, control, and pattern recognition. Results over these six domains indicate that the cerebellar simulation is capable of robustly identifying static input patterns even when randomized across the sensory apparatus. This capability allows the simulated cerebellum to perform several different supervised learning and control tasks. On the other hand, both reinforcement learning and temporal pattern recognition prove problematic due to the delayed nature of error signals and the simulator's inability to solve the credit assignment problem. These results are consistent with previous findings which hypothesize that in the human brain, the basal ganglia is responsible for reinforcement learning, while the cerebellum handles supervised learning.

引用

页码：510 / 522

页数：13

共 47 条

[11] STATISTICAL INFERENCE FOR PROBABILISTIC FUNCTIONS OF FINITE STATE MARKOV CHAINS
BAUM, LE
PETRIE, T
[J]. ANNALS OF MATHEMATICAL STATISTICS, 1966, 37 (06): : 1554 - &
[12] NEURAL-NETWORK MODEL OF THE CEREBELLUM - TEMPORAL DISCRIMINATION AND THE TIMING OF MOTOR-RESPONSES
BUONOMANO, DV
MAUK, MD
[J]. NEURAL COMPUTATION, 1994, 6 (01) : 38 - 55
[13] What are the computations of the cerebellum, the basal ganglia and the cerebral cortex?
Doya, K
[J]. NEURAL NETWORKS, 1999, 12 (7-8) : 961 - 974
[14] Eccles JC, 1967, CEREBELLUM NEURONAL
[15] TEXPLORE: real-time sample-efficient reinforcement learning for robots
Hester, Todd
Stone, Peter
[J]. MACHINE LEARNING, 2013, 90 (03) : 385 - 429
[16] Hester T, 2012, IEEE INT CONF ROBOT, P85, DOI 10.1109/ICRA.2012.6225072
[17] The cerebellum in action:: a simulation and robotics study
Hofstötter, C
Mintz, M
Verschure, PFMJ
[J]. EUROPEAN JOURNAL OF NEUROSCIENCE, 2002, 16 (07) : 1361 - 1376
[18] Models of the cerebellum and motor learning
Houk, JC
Buckingham, JT
Barto, AG
[J]. BEHAVIORAL AND BRAIN SCIENCES, 1996, 19 (03) : 368 - +
[19] ITO M, 1989, ANNU REV NEUROSCI, V12, P85, DOI 10.1146/annurev.ne.12.030189.000505
[20] A Subtraction Mechanism of Temporal Coding in Cerebellar Cortex
Kalmbach, Brian E.
Voicu, Horatiu
Ohyama, Tatsuya
Mauk, Michael D.
[J]. JOURNAL OF NEUROSCIENCE, 2011, 31 (06) : 2025 - 2034

← 1 2 3 4 5 →