共 42 条
- [1] Achiam J, 2017, PR MACH LEARN RES, V70
- [2] [Anonymous], 2018, OPENAI FIVE
- [3] [Anonymous], 2010, THESIS CARNEGIE MELL
- [4] BAIRD LC, 1994, 1994 IEEE INTERNATIONAL CONFERENCE ON NEURAL NETWORKS, VOL 1-7, P2448, DOI 10.1109/ICNN.1994.374604
- [5] THE THEORY OF DYNAMIC PROGRAMMING [J]. BULLETIN OF THE AMERICAN MATHEMATICAL SOCIETY, 1954, 60 (06) : 503 - 515
- [6] Bitam S., 2012, GLOBECOM 2012 - 2012 IEEE Global Communications Conference, P2054, DOI 10.1109/GLOCOM.2012.6503418
- [7] Caruana R., 1993, P 10 INT C MACH LEAR, DOI [DOI 10.1016/B978-1-55860-307-3.50012-5, 10.1016/b978-1-55860-307-3.50012-5]
- [8] Casas Noe, 2017, arXiv:1703.09035
- [9] Chen CC, 2020, AAAI CONF ARTIF INTE, V34, P3414
- [10] Convergence of V2X communication systems and next generation networks [J]. INTERNATIONAL CONFERENCE ON APPLIED SCIENCES, 2019, 477