共 50 条
- [31] A Lower Bound for the Sample Complexity of Inverse Reinforcement Learning INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139
- [32] Distributionally Robust Imitation Learning ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
- [33] Q-Learning for Robust Satisfaction of Signal Temporal Logic Specifications 2016 IEEE 55TH CONFERENCE ON DECISION AND CONTROL (CDC), 2016, : 6565 - 6570
- [36] Making Deep Q-learning Methods Robust to Time Discretization INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 97, 2019, 97
- [37] Q-learning with continuous state spaces and finite decision set 2007 IEEE INTERNATIONAL SYMPOSIUM ON APPROXIMATE DYNAMIC PROGRAMMING AND REINFORCEMENT LEARNING, 2007, : 346 - +
- [39] Constrained Deep Q-Learning Gradually Approaching Ordinary Q-Learning FRONTIERS IN NEUROROBOTICS, 2019, 13
- [40] Learning rates for Q-Learning COMPUTATIONAL LEARNING THEORY, PROCEEDINGS, 2001, 2111 : 589 - 604