共 40 条
[1]
Amiranashvili A., 2018, INT C LEARN REPR ICL
[2]
[Anonymous], 2009, P 26 ANN INT C MACH
[3]
[Anonymous], 2010, Proceedings of the Third Conference on Artificial General Intelligence
[4]
Baird L., 1995, Machine Learning. Proceedings of the Twelfth International Conference on Machine Learning, P30
[5]
Brockman Greg, 2016, arXiv
[6]
Christopher John Cornish Hellaby Watkins, 1989, Learning from delayed rewards
[7]
Daley B., 2019, ADV NEURAL INFORM PR, V32, P1131
[8]
Even-Dar E, 2003, J MACH LEARN RES, V5, P1
[10]
Hallak A, 2016, AAAI CONF ARTIF INTE, P1631