共 50 条
[32]
Temporal-difference learning with nonlinear function approximation: lazy training and mean field regimes
[J].
MATHEMATICAL AND SCIENTIFIC MACHINE LEARNING, VOL 145,
2021, 145
:37-74
[37]
IMPROVING REINFORCEMENT LEARNING USING TEMPORAL-DIFFERENCE NETWORK EUROCON2009
[J].
EUROCON 2009: INTERNATIONAL IEEE CONFERENCE DEVOTED TO THE 150 ANNIVERSARY OF ALEXANDER S. POPOV, VOLS 1- 4, PROCEEDINGS,
2009,
:1716-1722
[39]
Particle swarm optimization based on temporal-difference learning for solving multi-objective optimization problems
[J].
Computing,
2023, 105
:1795-1820