Union Regression Object Localization Based on Deep Reinforcement Learning

被引:0
作者
Yao H.-G. [1 ]
Zhang W. [1 ]
Yang H.-Q. [1 ]
Yu J. [1 ]
机构
[1] School of Computer Science and Engineering, Xi'an Technological University, Xi'an
来源
Zidonghua Xuebao/Acta Automatica Sinica | 2023年 / 49卷 / 05期
关键词
deep reinforcement learning; object localization; recurrent neural network (RNN); Visual attention mechanism;
D O I
10.16383/j.aas.c200045
中图分类号
学科分类号
摘要
To simulate the visual attention mechanism of the human eye, search and locate image objection quickly and efficiently, this paper proposes a union regression deep reinforcement learning object localization model based on recurrent neural network (RNN), which fuses the historical observation information with the observation information at the current time, then makes a comprehensive analysis to train the agent to quickly locate the object, and combine with the regressor to fine-tune the object bounding box positioned by the agent. Experiments show that the proposed model can accurately and rapidly locate the object in a few time steps. © 2023 Science Press. All rights reserved.
引用
收藏
页码:1089 / 1098
页数:9
相关论文
共 19 条
  • [1] Wang Ya-Shen, Huang He-Yan, Feng Chong, Zhou Qiang, Conceptual sentence embeddings based on attention mechanism, Acta Automatica Sinica, 46, 7, pp. 1390-1400, (2020)
  • [2] Sherstinsky A., Fundamentals of recurrent neural network and long short-term memory network, Physica D: Nonlinear Phenomena, 404, (2020)
  • [3] Sun Chang-Yin, Mu Chao-Xu, Important scientific problems of multi-agent deep reinforcement learning, Acta Automatica Sinica, 46, 7, pp. 1301-1312, (2020)
  • [4] Hasselt H, Guez A, Silver D., Deep reinforcement learning with double Q-learning, Proceedings of the 13th AAAI Conference on Artificial Intelligence, pp. 2094-2100, (2016)
  • [5] Mnih V, Kavukcuoglu K, Silver D, Graves A, Antonoglou I, Wierstra D, Et al., Playing atari with deep reinforcement learning, (2013)
  • [6] Mnih V, Kavukcuoglu K, Silver D, Rusu A A, Veness J, Belle-mare M, Et al., Human-level control through deep reinforcement learning, Nature, 518, 7540, (2015)
  • [7] Rahman M A, Wang Y., Optimizing intersection-over-union in deep neural networks for image segmentation, Proceedings of the International Symposium on Visual Computing, pp. 234-244, (2016)
  • [8] Girshick R, Donahue J, Darrell T, Malik J., Rich feature hierarchies for accurate object detection and semantic segmentation, Proceedings of the IEEE conference on Computer Vision and Pattern Recognition, pp. 580-587, (2014)
  • [9] Girshick R., Fast R-CNN, Proceedings of the IEEE International Conference on Computer Vision, pp. 1440-1448, (2015)
  • [10] Ren S, He K, Girshick R, Sun J., Faster R-CNN: Towards real-time object detection with region proposal networks, Proceedings of the Advances in Neural Information Processing Systems, pp. 91-99, (2015)