共 54 条
[1]
Bertsekas D. P., 1996, Neuro-Dynamic Programming
[2]
Bhandari J, 2018, Arxiv, DOI arXiv:1806.02450
[3]
Borkar V, 2024, Arxiv, DOI arXiv:2110.14427
[4]
A comprehensive survey of multiagent reinforcement learning
[J].
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART C-APPLICATIONS AND REVIEWS,
2008, 38 (02)
:156-172
[5]
Cassano L, 2019, 2019 18TH EUROPEAN CONTROL CONFERENCE (ECC), P505, DOI [10.23919/ECC.2019.8795670, 10.23919/ecc.2019.8795670]
[6]
Dalal G, 2018, AAAI CONF ARTIF INTE, P6144
[7]
Ding DS, 2021, Arxiv, DOI arXiv:1908.02805
[8]
Doan TT, 2019, PR MACH LEARN RES, V97
[9]
Durmus A., 2021, PROC MACHINE LEARNIN, V134, P1
[10]
Geist M, 2014, J MACH LEARN RES, V15, P289