Stochastic optimal control and estimation methods adapted to the noise characteristics of the sensorimotor system

被引:276
作者
Todorov, E [1 ]
机构
[1] Univ Calif San Diego, Dept Cognit Sci, La Jolla, CA 92093 USA
关键词
D O I
10.1162/0899766053491887
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Optimality principles of biological movement are conceptually appealing and straightforward to formulate. Testing them empirically, however, requires the solution to stochastic optimal control and estimation problems for reasonably realistic models of the motor task and the sensorimotor periphery. Recent studies have highlighted the importance of incorporating biologically plausible noise into such models. Here we extend the linear-quadratic-gaussian framework-currently the only framework where such problems can be solved efficiently-to include control-dependent, state-dependent, and internal noise. Under this extended noise model, we derive a coordinate-descent algorithm guaranteed to converge to a feedback control law and a nonaclaptive linear estimator optimal with respect to each other. Numerical simulations indicate that convergence is exponential, local minima do not exist, and the restriction to nonadaptive linear estimators has negligible effects in the control problems of interest. The application of the algorithm is illustrated in the context of reaching movements. A Matlab implementation is available at www.cogsci.ucsd.edu/similar to todorov.
引用
收藏
页码:1084 / 1108
页数:25
相关论文
共 41 条
[1]   Dynamic optimization of human walking [J].
Anderson, FC ;
Pandy, MG .
JOURNAL OF BIOMECHANICAL ENGINEERING-TRANSACTIONS OF THE ASME, 2001, 123 (05) :381-390
[2]   Discrete-time optimal control with control-dependent noise and generalized Riccati difference equations [J].
Beghi, A ;
D'Alessandro, D .
AUTOMATICA, 1998, 34 (08) :1031-1034
[3]  
Bensoussan A., 1992, STOCHASTIC CONTROL P
[4]  
BERTSEKAS D, 1997, NEURODYNAMIC PROGRAM
[5]   2 MECHANISMS FOR LOCALIZATION - EVIDENCE FOR SEPARATION-DEPENDENT AND SEPARATION-INDEPENDENT PROCESSING OF POSITION INFORMATION [J].
BURBECK, CA ;
YAP, YL .
VISION RESEARCH, 1990, 30 (05) :739-750
[6]  
CHOW C K, 1971, Mathematical Biosciences, V10, P239, DOI 10.1016/0025-5564(71)90062-9
[7]  
Davis M. H. A., 1985, STOCHASTIC MODELLING, DOI 10.1007/978-94-009-4828-0
[8]   STATE-FEEDBACK CONTROL OF SYSTEMS WITH MULTIPLICATIVE NOISE VIA LINEAR MATRIX INEQUALITIES [J].
ELGHAOUI, L .
SYSTEMS & CONTROL LETTERS, 1995, 24 (03) :223-228
[9]   THE COORDINATION OF ARM MOVEMENTS - AN EXPERIMENTALLY CONFIRMED MATHEMATICAL-MODEL [J].
FLASH, T ;
HOGAN, N .
JOURNAL OF NEUROSCIENCE, 1985, 5 (07) :1688-1703
[10]   Signal-dependent noise determines motor planning [J].
Harris, CM ;
Wolpert, DM .
NATURE, 1998, 394 (6695) :780-784