共 46 条
- [1] Aswani A(2013)Provably safe and robust learning-based model predictive control Automatica 49 1216-1226
- [2] Gonzalez H(1997)Nonlinear programming Journal of the Operational Research Society 48 334-6120
- [3] Sastry SS(2017)Risk-constrained reinforcement learning with percentile risk criteria The Journal of Machine Learning Research 18 6070-121
- [4] Tomlin C(2001)Nonlinear lagrangian theory for nonconvex optimization Journal of Optimization Theory and Applications 109 99-284
- [5] Bertsekas DP(2008)Near-optimal sensor placements in gaussian processes: Theory, efficient algorithms and empirical studies Journal of Machine Learning Research 9 235-224
- [6] Chow Y(2005)Robust model predictive control of constrained linear systems with bounded disturbances Automatica 41 219-533
- [7] Ghavamzadeh M(2015)Human-level control through deep reinforcement learning Nature 518 529-42
- [8] Janson L(2000)Optimization of conditional value-at-risk Journal of Risk 2 21-55
- [9] Pavone M(2001)A mathematical theory of communication ACM SIGMOBILE Mobile Computing and Communications Review 5 3-489
- [10] Goh C(2016)Mastering the game of go with deep neural networks and tree search Nature 529 484-359