Deep networks for motor control functions

被引:0
作者
Berniker, Max [1 ,2 ]
Kording, Konrad P. [2 ]
机构
[1] Department of Mechanical and Industrial Engineering, University of Illinois at Chicago, Chicago, IL
[2] Department of Physical Medicine and Rehabilitation, Northwestern University, Chicago, IL
基金
美国国家科学基金会;
关键词
Arm reaches; Deep learning; Motor control; Motor learning; Neural networks; Optimal control;
D O I
10.3389/fncom.2015.00032
中图分类号
学科分类号
摘要
The motor system generates time-varying commands to move our limbs and body. Conventional descriptions of motor control and learning rely on dynamical representations of our body’s state (forward and inverse models), and control policies that must be integrated forward to generate feedforward time-varying commands; thus these are representations across space, but not time. Here we examine a new approach that directly represents both time-varying commands and the resulting state trajectories with a function; a representation across space and time. Since the output of this function includes time, it necessarily requires more parameters than a typical dynamical model. To avoid the problems of local minima these extra parameters introduce, we exploit recent advances in machine learning to build our function using a stacked autoencoder, or deep network. With initial and target states as inputs, this deep network can be trained to output an accurate temporal profile of the optimal command and state trajectory for a point-to-point reach of a non-linear limb model, even when influenced by varying force fields. In a manner that mirrors motor babble, the network can also teach itself to learn through trial and error. Lastly, we demonstrate how this network can learn to optimize a cost objective. This functional approach to motor control is a sharp departure from the standard dynamical approach, and may offer new insights into the neural implementation of motor control. © 2015 Berniker and Kording.
引用
收藏
页数:10
相关论文
共 38 条
  • [1] Abeles M., Bergman H., Margalit E., Vaadia E., Spatiotemporal firing patterns in the frontal cortex of behaving monkeys, J. Neurophysiol, 70, pp. 1629-1638, (1993)
  • [2] Averbeck B.B., Chafee M.V., Crowe D.A., Georgopoulos A.P., Parallel processing of serial movements in prefrontal cortex, Proc. Natl. Acad. Sci. U.S.A, 99, pp. 13172-13177, (2002)
  • [3] Bengio Y., Lamblin P., Popovici D., Larochelle H., Greedy layer-wise training of deep networks, Adv. Neural Inf. Process. Syst, 19, (2007)
  • [4] Berniker M., Kording K., Estimating the sources of motor errors for adaptation and generalization, Nat. Neurosci, 11, pp. 1454-1461, (2008)
  • [5] Berniker M., Kording K.P., Estimating the relevance of world disturbances to explain savings, interference and long-term motor adaptation effects, PLoS Comput. Biol, 7, (2011)
  • [6] Bertsekas D.P., Dynamic Programming and Optimal Control-Volume 1, (1995)
  • [7] Bryson A.E., Ho Y.C., Applied Optimal Control: Optimization, Estimation, and Control [M], (1975)
  • [8] Churchland M.M., Cunningham J.P., Kaufman M.T., Foster J.D., Nuyujukian P., Ryu S.I., Et al., Neural population dynamics during reaching, Nature, 487, pp. 51-56, (2012)
  • [9] Cohen Y.E., Andersen R.A., A common reference frame for movement plans in the posterior parietal cortex, Nat. Rev. Neurosci, 3, pp. 553-562, (2002)
  • [10] Coltz J.D., Johnson M.T., Ebner T.J., Cerebellar Purkinje cell simple spike discharge encodes movement velocity in primates during visuomotor arm tracking, J. Neurosci, 19, pp. 1782-1803, (1999)