共 50 条
[21]
A Q-Learning Approach for Adherence-Aware Recommendations
[J].
IEEE CONTROL SYSTEMS LETTERS,
2023, 7
:3645-3650
[22]
Finite-Time Theory for Momentum Q-learning
[J].
UNCERTAINTY IN ARTIFICIAL INTELLIGENCE, VOL 161,
2021, 161
:665-674
[23]
Tightening the Dependence on Horizon in the Sample Complexity of Q-Learning
[J].
INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139,
2021, 139
[24]
INTERNALLY DRIVEN Q-LEARNING Convergence and Generalization Results
[J].
ICAART: PROCEEDINGS OF THE 4TH INTERNATIONAL CONFERENCE ON AGENTS AND ARTIFICIAL INTELLIGENCE, VOL 1,
2012,
:491-494
[27]
Suppressing Overestimation in Q-Learning through Adversarial Behaviors
[J].
2024 60TH ANNUAL ALLERTON CONFERENCE ON COMMUNICATION, CONTROL, AND COMPUTING, ALLERTON 2024,
2024,
[28]
Feature Extraction in Q-Learning using Neural Networks
[J].
2017 IEEE 56TH ANNUAL CONFERENCE ON DECISION AND CONTROL (CDC),
2017,
[30]
Active Nearest Neighbors in Changing Environments
[J].
INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 37,
2015, 37
:1870-1879