DETR Novel Small Target Detection Algorithm Based on Swin Transformer

被引:0
|
作者
Xu, Fengchang [1 ,2 ]
Alfred, Rayner [1 ]
Pailus, Rayner Henry [1 ]
Lyu, Ge [2 ]
Du, Shifeng [2 ]
Chew, Jackel Vui Lung [3 ]
Li, Guozhang [4 ]
Wang, Xinliang [5 ]
机构
[1] Univ Malaysia Sabah, Fac Comp & Informat, Creat Adv Machine Intelligence Res Ctr, Jalan UMS, Kota Kinabalu 88400, Sabah, Malaysia
[2] Shandong Vocat Coll Light Ind, Dept Informat Engn, Zibo 255300, Shandong, Peoples R China
[3] Univ Malaysia Sabah Labuan Int Campus, Fac Comp & Informat, Labuan 87000, Malaysia
[4] Hainan Vocat Univ Sci & Technol, Coll Informat Engn, Haikou 571126, Hainan, Peoples R China
[5] Binzhou Civil Air Def Engn & Command Support Ctr, Binzhou 256600, Shandong, Peoples R China
来源
IEEE ACCESS | 2024年 / 12卷
关键词
Transformers; Object detection; Feature extraction; Accuracy; Adaptation models; Computational modeling; YOLO; Deep learning; Swin transformer; DETR; small target detection; deep learning;
D O I
10.1109/ACCESS.2024.3445950
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
A small target object refers to an object whose relative size of the bounding box is very small, usually the ratio of the width of the bounding box to the width and height of the original image is less than 0.1, or the ratio of the area of the bounding box to the area of the original image is less than 0.03, or the absolute size is less than 32(& lowast;)32 pixels. It has important applications in industrial defect detection, medical image processing, intelligent security, unmanned driving, and many other fields. Although great progress has been made in the field of target detection, which is limited to large target objects, due to the challenges of small size, inconspicuous features and insufficient data samples, the accuracy and speed of small target detection are low. To solve this problem, this paper proposes a novel small target object detection algorithm model: Swin Transformer's DETR. In this algorithm, Swin Transformer is used as the backbone to extract the global features and local information of small targets, and a three-layer feature pyramid structure is used for feature fusion at the Neck layer to improve the calculation efficiency and model accuracy. Secondly, the detector is optimized, and the detector is replaced by two stages, and the ReLU activation function of FFN layer is replaced by the latest SwiGLU activation function, to avoid the problems of gradient disappearance and explosion and enhance the nonlinearity of the algorithm model. Large resolution size input is adopted on Tiny Person dataset, and its input value is set to [1400,800]. The above analysis is carried out on VOC and Tiny Person datasets, and the detection rates of small target objects are 88.9% and 48.3% respectively. The results show that the Swin Transformer's DETR algorithm model proposed in this paper performs well on various datasets, and has strong generalization ability, stability and accuracy in different scenarios and datasets, which is higher than other algorithm models.
引用
收藏
页码:115838 / 115852
页数:15
相关论文
共 50 条
  • [1] IST-DETR: Improved DETR for Infrared Small Target Detection
    Zhao, Li
    Wang, Jianlong
    Chen, Yunhao
    Yin, Qian
    Rong, Guyao
    Zhou, Sida
    Tang, Jianing
    IEEE ACCESS, 2024, 12 : 164303 - 164314
  • [2] Infrared Small Target Detection With Swin Transformer-Based Multiscale Atrous Spatial Pyramid Pooling Network
    Wu, Heng
    Huang, Xi
    He, Chunhua
    Xiao, Huapan
    Luo, Shaojuan
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2025, 74
  • [3] Underwater Target Detection Algorithm Based on YOLO and Swin Transformer for Sonar Images
    Chen, Ruoyu
    Zhan, Shuyue
    Chen, Ying
    2022 OCEANS HAMPTON ROADS, 2022,
  • [4] ISTD-DETR: A deep learning algorithm based on DETR and Super-resolution for infrared small target detection
    Yang, Huanyu
    Wang, Jun
    Bo, Yuming
    Wang, Jiacun
    NEUROCOMPUTING, 2025, 621
  • [5] STD2: Swin Transformer-Based Defect Detector for Surface Anomaly Detection
    Mia, Md Sohag
    Li, Chunbiao
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2025, 74
  • [6] P-DETR: A transformer-based algorithm for pipeline structure detection
    Aromoye, Ibrahim Akinjobi
    Hiung, Lo Hai
    Sebastian, Patrick
    RESULTS IN ENGINEERING, 2025, 26
  • [7] An Embedding Swin Transformer Model for Automatic Slow-Moving Landslide Detection Based on InSAR Products
    Chen, Xuerong
    Zhao, Chaoying
    Liu, Xiaojie
    Zhang, Shuangcheng
    Xi, Jiangbo
    Khan, Basit Ali
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62
  • [8] FECI-RTDETR a Lightweight Unmanned Aerial Vehicle Infrared Small Target Detector Algorithm Based on RT-DETR
    Xue, Renzheng
    Hua, Shijie
    Xu, Haiqiang
    IEEE ACCESS, 2025, 13 : 9578 - 9591
  • [9] CBAM-SwinT-BL: Small Rail Surface Defect Detection Method Based on Swin Transformer With Block Level CBAM Enhancement
    Zhao, Jiayi
    Yeung, Alison Wun-Lam
    Ali, Muhammad
    Lai, Songjiang
    Ng, Vincent To-Yee
    IEEE ACCESS, 2024, 12 : 181997 - 182009
  • [10] YOLOv8-FDF: A Small Target Detection Algorithm in Complex Scenes
    Jiang, Wenlong
    Han, Dezhi
    Han, Bing
    Wu, Zhongdai
    IEEE ACCESS, 2024, 12 : 119223 - 119237