共 60 条
- [1] Agarwal Alekh, 2021, JOURNAL OF MACHINE LEARNING RESEARCH, V22
- [3] Bertsekas DP, 1996, NEURO DYNAMIC PROGRA
- [4] Boutilier C, 1996, THEORETICAL ASPECTS OF RATIONALITY AND KNOWLEDGE, P195
- [5] Buehler M, 2009, SPRINGER TRAC ADV RO, V56, P1, DOI 10.1007/978-3-642-03991-1
- [8] Engstrom L, 2019, PR MACH LEARN RES, V97
- [9] Fujimoto S, 2018, PR MACH LEARN RES, V80
- [10] Gupta Jayesh K., 2017, Autonomous Agents and Multiagent Systems, AAMAS 2017: Workshops, Best Papers. Revised Selected Papers: LNAI 10642, P66, DOI 10.1007/978-3-319-71682-4_5