共 50 条
- [41] Value-approximation-based online policy for vehicle routing problem with stochastic demand Kongzhi Lilun Yu Yingyong/Control Theory and Applications, 2022, 39 (02): : 241 - 254
- [42] ON CONVERGENCE RATE OF ADAPTIVE MULTISCALE VALUE FUNCTION APPROXIMATION FOR REINFORCEMENT LEARNING 2019 IEEE 29TH INTERNATIONAL WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING (MLSP), 2019,
- [43] Convergence Rates of Online Critic Value Function Approximation in Native Spaces IEEE CONTROL SYSTEMS LETTERS, 2024, 8 : 2145 - 2150
- [44] Improving Gaussian Process Value Function Approximation in Policy Gradient Algorithms ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2011, PT II, 2011, 6792 : 221 - +
- [45] The Divergence of Reinforcement Learning Algorithms with Value-Iteration and Function Approximation 2012 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2012,
- [47] A New Approach for Value Function Approximation Based on Automatic State Partition IMECS 2009: INTERNATIONAL MULTI-CONFERENCE OF ENGINEERS AND COMPUTER SCIENTISTS, VOLS I AND II, 2009, : 208 - 213
- [50] Model Predictive Control and Reinforcement Learning: A Unified Framework Based on Dynamic Programming IFAC PAPERSONLINE, 2024, 58 (18): : 363 - 383