共 138 条
[1]
Abdulhai B., Kattan L., Reinforcement learning: introduction to theory and potential for transport applications, Can J Civ Eng, 30, 6, pp. 981-991, (2003)
[2]
Abdulhai B., Pringle R., Karakoulas G.J., Reinforcement learning for True adaptive traffic signal control, J Transport Eng, 129, 3, pp. 278-285, (2003)
[3]
Achiam J., Sastry S., Surprise-based intrinsic motivation for deep reinforcement, learning, (2017)
[4]
Allgower E.L., Georg K., Numerical continuation methods, Springer series in computational mathematics, 13, (1990)
[5]
Andrychowicz M., Wolski F., Ray A., Schneider J., Fong R., Welinder P., McGrew B., Tobin J., Abbeel P., Zaremba W., Hindsight experience replay, Tech. rep, (2017)
[6]
Barto A.G., Intrinsic motivation and reinforcement learning, Intrinsically motivated learning in natural and artificial systems, vol 9783642323751, pp. 17-47, (2013)
[7]
Barto A.G., Mahadevan S., Recent advances in hierarchical reinforcement learning, (2003)
[8]
Bellemare M.G., Srinivasan S., Ostrovski G., Schaul T., Saxton D., Deepmind G., Munos R., Unifying count-based exploration and intrinsic motivation, Tech. rep, (2016)
[9]
Bengio Y., Louradour J., Collobert R., Weston J., Curriculum learning, ACM international conference proceeding series, 382, pp. 1-8, (2009)
[10]
Berseth G., Xie C., Cernek P., van De Panne M., Progressive reinforcement learning with distillation for multi-skilled motion control, 6Th International Conference on Learning Representations, ICLR 2018—conference Track Proceedings., 1802, (2018)