Survey of Transformer-Based Object Detection Algorithms

被引:10
作者
Li, Jian [1 ]
Du, Jianqiang [1 ]
Zhu, Yanchen [1 ]
Guo, Yongkun [1 ]
机构
[1] College of Computer Science, Jiangxi University of Chinese Medicine, Nanchang
关键词
convolutional neural network(CNN); deep learning; image processing; object detection; Transformer;
D O I
10.3778/j.issn.1002-8331.2211-0133
中图分类号
学科分类号
摘要
Transformer is a kind of deep learning framework with strong modeling and parallel computing capabilities. At present, object detection algorithm based on Transformer has become a hotspot. In order to further explore new ideas and directions, this paper summarizes the existing object detection algorithm based on Transformer as well as a variety of object detection data sets and their application scenarios. This paper describes the correlation algorithms for Transformer based object detection from four aspects, i.e. feature extraction, object estimation, label matching policy and application of algorithm, compares the Transformer algorithm with the object detection algorithm based on convolutional neural network, analyzes the advantages and disadvantages of Transformer in object detection task, and proposes a general framework for Transformer based object detection model. Finally, the prospect of development trend of Transformer in the field of object detection is put forward. © 2023 Journal of Computer Engineering and Applications Beijing Co., Ltd.; Science Press. All rights reserved.
引用
收藏
页码:48 / 64
页数:16
相关论文
共 97 条
[1]  
ZHANG B C., A research of target detection and recognition algorithm with region-based convolutional neural network, (2020)
[2]  
LI K Q, CHEN Y, LIU J C,, Et al., Survey of deep learning-based object detection algorithms[J], Computer Engineering, 48, 7, pp. 1-12, (2022)
[3]  
GIRSHICK R, DONAHUE J, DARRELL T, Et al., Rich feature hierarchies for accurate object detection and semantic segmentation[C], Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition, pp. 580-587, (2014)
[4]  
LIU Z L., Research on lane line detection based on convolutional neural network and lane keeping control algorithm, (2021)
[5]  
VASWANI A, SHAZEER N, PARMAR N, Et al., Attention is all you need[J], (2017)
[6]  
CARION N, MASSA F, SYNNAEVE G, Et al., End-to-end object detection with transformers[C], 16th European Conference on Computer Vision, pp. 213-229, (2020)
[7]  
BEAL J, KIM E, TZENG E, Et al., Toward transformer-based object detection[J], (2020)
[8]  
LIU W T, LU X M., Research progress of Transformer based on computer vision[J], Computer Engineering and Applications, 58, 6, pp. 1-16, (2022)
[9]  
CAO J L, LI Y L, SUN H Q, Et al., A survey on deep learning based visual object detection[J], Journal of Image and Graphics, 27, 6, pp. 1697-1722, (2022)
[10]  
LIU Y, ZHAN Y W., Survey of small object detection algorithms based on deep learning[J], Computer Engineering and Applications, 57, 2, pp. 37-48, (2021)