共 50 条
[31]
A Q-Learning Approach to Derive Optimal Consumption and Investment Strategies
[J].
IEEE TRANSACTIONS ON NEURAL NETWORKS,
2009, 20 (08)
:1234-1243
[32]
Entropy-Based Prioritized Sampling in Deep Q-Learning
[J].
2017 2ND INTERNATIONAL CONFERENCE ON IMAGE, VISION AND COMPUTING (ICIVC 2017),
2017,
:1068-1072
[33]
The Blessing of Heterogeneity in Federated Q-Learning: Linear Speedup and Beyond
[J].
INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 202,
2023, 202
[36]
Risk Aversion Operator for Addressing Maximization Bias in Q-Learning
[J].
IEEE ACCESS,
2020, 8
:43098-43110
[38]
Error bounds for constant step-size Q-learning
[J].
SYSTEMS & CONTROL LETTERS,
2012, 61 (12)
:1203-1208