共 50 条
- [41] Asymptotic properties of two time-scale stochastic approximation algorithms with constant step sizes PROCEEDINGS OF THE 2003 AMERICAN CONTROL CONFERENCE, VOLS 1-6, 2003, : 4426 - 4431
- [43] Neural Temporal-Difference Learning Converges to Global Optima ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
- [45] Temporal-Difference Q-learning in Active Fault Diagnosis 2016 3RD CONFERENCE ON CONTROL AND FAULT-TOLERANT SYSTEMS (SYSTOL), 2016, : 287 - 292
- [46] Temporal-Difference Learning An Online Support Vector Regression Approach ICIMCO 2015 PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON INFORMATICS IN CONTROL, AUTOMATION AND ROBOTICS, VOL. 1, 2015, : 318 - 323
- [49] Implementing Temporal-Difference Learning with the Scaled Conjugate Gradient Algorithm Neural Processing Letters, 2005, 22 : 361 - 375