共 19 条
- [11] Mnih V, Heess N, Graves A., Recurrent models of visual attention, Proceedings of the Advances in Neural Information Processing Systems, pp. 2204-2212, (2014)
- [12] Caicedo J C, Lazebnik S., Active object localization with deep reinforcement learning, Proceedings of the IEEE International Conference on Computer Vision, pp. 2488-2496, (2015)
- [13] Bueno M B, Giro-i-Nieto X, Marques F, Torres J., Hierarchical object detection with deep reinforcement learning, Deep Learning for Image Processing Applications, 11, pp. 1-9, (2016)
- [14] Hara K, Liu M Y, Tuzel O, Farahmand A M., Attentional network for visual object detection, (2017)
- [15] Shah S M, Borkar V S., Q-learning for Markov decision processes with a satisfiability criterion, Systems & Control Letters, 113, pp. 45-51, (2018)
- [16] Garcia F, Thomas P S., A meta-MDP approach to exploration for lifelong reinforcement learning, Proceedings of the Advances in Neural Information Processing Systems, pp. 5691-5700, (2019)
- [17] Sutton R S, Barto A G., Reinforcement Learning: An Introduction, (2018)
- [18] March J G., Exploration and exploitation in organizational learning, Organization Science, 2, 1, (1991)
- [19] Bertsekas D P., Dynamic Programming and Optimal Control, (1995)