共 152 条
- [1] Sutton RS, Barto AG., Reinforcement Learning: An Introduction, (2018)
- [2] Goodfellow I, Bengio Y, Courville A, Bengio Y., Deep Learning, (2016)
- [3] Liu Q, Zhai JW, Zhang ZZ, Zhong S, Zhou Q, Zhang P, Xu J., A survey on deep reinforcement learning, Chinese Journal of Computers, 41, 1, pp. 1-27, (2018)
- [4] Mnih V, Kavukcuoglu K, Silver D, Rusu AA, Veness J, Bellemare MG, Graves A, Riedmiller M, Fidjeland AK, Ostrovski G., Human-level control through deep reinforcement learning, Nature, 518, 7540, pp. 529-533, (2015)
- [5] Lillicrap TP, Hunt JJ, Pritzel A, Heess N, Erez T, Tassa Y, Silver D, Wierstra D., Continuous control with deep reinforcement learning, Proc. of the Int’l Conf. on Learning Representations, (2016)
- [6] Babaeizadeh M, Frosio I, Tyree S, Clemons J, Kautz J., Reinforcement learning through asynchronous advantage actor-critic on a GPU, Proc. of the Int’l Conf. on Learning Representations, (2017)
- [7] Lai J, Wei JY, Chen XL., Overview of hierarchical reinforcement learning, Computer Engineering and Applications, 57, 3, pp. 72-79, (2021)
- [8] Sutton RS, Precup D, Singh S., Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning, Artificial Intelligence, 112, 1-2, pp. 181-211, (1999)
- [9] Kulkarni TD, Narasimhan K, Saeedi A, Tenenbaum J., Hierarchical deep reinforcement learning: Integrating temporal abstraction and intrinsic motivation, Advances in Neural Information Processing Systems, pp. 3675-3683, (2016)
- [10] Tang H, Hao J, Lv T, Chen Y, Zhang Z, Jia H, Ren C, Zheng Y, Meng Z, Fan C., Hierarchical deep multiagent reinforcement learning with temporal abstraction, (2018)