共 17 条
[11]
WANG J X, KURTH-NELSON Z, TIRUMALA D, Learning to reinforcement learn[EB/OL], (2016)
[12]
XU J Y,, YAO L, LI L, Argumentation based reinforcement learning for meta-knowledge extraction[J], Information Sciences, 506, pp. 258-272, (2020)
[13]
ZHANG Y Z, YAO K J, Pursuit missions for UAV swarms based on DDPG algorithm, Acta Aeronautica et Astronautica Sinica, 41, 10, (2020)
[14]
LU J Y, LIU Q, Meta-reinforcement learning al⁃ gorithm based on automating policy entropy[J], Computer Science, 48, 6, pp. 168-174, (2021)
[15]
HU Y, CHEN M Z,, SAAD W, Distributed multi-agent meta learning for trajectory design in wireless drone networks[J], IEEE Journal on Selected Areas in Communications, 39, 10, pp. 3177-3192, (2021)
[16]
BELKHALE S, LI R, KAHN G, Model-based meta-reinforcement learning for flight with suspended payloads, IEEE Robotics and Automation Letters, 6, 2, pp. 1471-1478, (2021)
[17]
FUJIMOTO S, VAN HOOF H, MEGER D., Addressing function approximation error in actor-critic methods, (2018)