Probabilistic inverse optimal control for non-linear partially observable systems disentangles perceptual uncertainty and behavioral costs

被引:0
|
作者
Straub, Dominik [1 ]
Schultheis, Matthias [1 ]
Koeppl, Heinz [1 ]
Rothkopf, Constantin A. [1 ]
机构
[1] Tech Univ Darmstadt, Ctr Cognit Sci, Darmstadt, Germany
来源
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023) | 2023年
关键词
OPTIMAL FEEDBACK-CONTROL; NOISE;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Inverse optimal control can be used to characterize behavior in sequential decision-making tasks. Most existing work, however, is limited to fully observable or linear systems, or requires the action signals to be known. Here, we introduce a probabilistic approach to inverse optimal control for partially observable stochastic non-linear systems with unobserved action signals, which unifies previous approaches to inverse optimal control with maximum causal entropy formulations. Using an explicit model of the noise characteristics of the sensory and motor systems of the agent in conjunction with local linearization techniques, we derive an approximate likelihood function for the model parameters, which can be computed within a single forward pass. We present quantitative evaluations on stochastic and partially observable versions of two classic control tasks and two human behavioral tasks. Importantly, we show that our method can disentangle perceptual factors and behavioral costs despite the fact that epistemic and pragmatic actions are intertwined in sequential decision-making under uncertainty, such as in active sensing and active learning. The proposed method has broad applicability, ranging from imitation learning to sensorimotor neuroscience.
引用
收藏
页数:28
相关论文
共 50 条
  • [31] OPTIMAL-CONTROL IN ADAPTIVE OPTICS MODELING OF NON-LINEAR SYSTEMS
    HERRMANN, J
    PROCEEDINGS OF THE SOCIETY OF PHOTO-OPTICAL INSTRUMENTATION ENGINEERS, 1983, 365 : 84 - 88
  • [32] APPROXIMATION SCHEMES FOR NON-LINEAR NEUTRAL OPTIMAL-CONTROL SYSTEMS
    KUNISCH, K
    JOURNAL OF MATHEMATICAL ANALYSIS AND APPLICATIONS, 1981, 82 (01) : 112 - 143
  • [33] Optimal discrete-time control for non-linear cascade systems
    Haddad, WM
    Chellaboina, VS
    Fausz, JL
    Abdallah, C
    JOURNAL OF THE FRANKLIN INSTITUTE-ENGINEERING AND APPLIED MATHEMATICS, 1998, 335B (05): : 827 - 839
  • [34] SUB-OPTIMAL CONTROL OF NON-LINEAR DYNAMICAL SYSTEMS VIA LINEAR APPROXIMATION
    KRIECHBAUM, GK
    NOGES, E
    INTERNATIONAL JOURNAL OF CONTROL, 1971, 13 (06) : 1183 - +
  • [35] Design of robust control Lyapunov function for non-linear affine systems with uncertainty
    Yang, Y.
    Lee, J. M.
    IET CONTROL THEORY AND APPLICATIONS, 2012, 6 (14): : 2248 - 2256
  • [36] Adaptive inverse control of non-linear systems with unknown complex hysteretic non-linearities
    Li, Z.
    Hu, Y.
    Liu, Y.
    Chen, T.
    Yuan, P.
    IET CONTROL THEORY AND APPLICATIONS, 2012, 6 (01): : 1 - 7
  • [37] Optimal control using an algebraic method for control-affine non-linear systems
    Won, C.-H.
    Biswas, S.
    INTERNATIONAL JOURNAL OF CONTROL, 2007, 80 (09) : 1491 - 1502
  • [38] H∞ control for conic non-linear jump systems with partially unknown transition probabilities
    Zhang, Yingqi
    Shi, Yan
    Mu, Xiaowu
    Liu, Caixia
    INTERNATIONAL JOURNAL OF SYSTEMS SCIENCE, 2017, 48 (14) : 2976 - 2984
  • [39] Robust near-optimal output feedback control of non-linear systems
    El-Farra, NH
    Christofides, PD
    INTERNATIONAL JOURNAL OF CONTROL, 2001, 74 (02) : 133 - 157
  • [40] Optimal control of a class of non-linear discrete-continuous hybrid systems
    Hu, Minghui
    Gao, Qian
    Shao, Huihe
    WCICA 2006: SIXTH WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION, VOLS 1-12, CONFERENCE PROCEEDINGS, 2006, : 835 - 838