GANsformer: A Detection Network for Aerial Images with High Performance Combining Convolutional Network and Transformer

被引:30
|
作者
Zhang, Yan [1 ]
Liu, Xi [2 ]
Wa, Shiyun [1 ]
Chen, Shuyu [3 ]
Ma, Qin [1 ]
机构
[1] China Agr Univ, Coll Informat & Elect Engn, Beijing 100083, Peoples R China
[2] China Agr Univ, Coll Humanities & Dev, Beijing 100083, Peoples R China
[3] China Agr Univ, Coll Engn, Beijing 100083, Peoples R China
关键词
object detection; transformer; deep learning; aerial image; generative model; GANsformer detection network; DEEP LEARNING-METHODS; OBJECT DETECTION; RECOGNITION;
D O I
10.3390/rs14040923
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
There has been substantial progress in small object detection in aerial images in recent years, due to the extensive applications and improved performances of convolutional neural networks (CNNs). Typically, traditional machine learning algorithms tend to prioritize inference speed over accuracy. Insufficient samples can cause problems for convolutional neural networks, such as instability, non-convergence, and overfitting. Additionally, detecting aerial images has inherent challenges, such as varying altitudes and illuminance situations, and blurred and dense objects, resulting in low detection accuracy. As a result, this paper adds a transformer backbone attention mechanism as a branch network, using the region-wide feature information. This paper also employs a generative model to expand the input aerial images ahead of the backbone. The respective advantages of the generative model and transformer network are incorporated. On the dataset presented in this study, the model achieves 96.77% precision, 98.83% recall, and 97.91% mAP by adding the Multi-GANs module to the one-stage detection network. These three indices are enhanced by 13.9%, 20.54%, and 10.27%, respectively, when compared to the other detection networks. Furthermore, this study provides an auto-pruning technique that may achieve 32.2 FPS inference speed with a minor performance loss while responding to the real-time detection task's usage environment. This research also develops a macOS application for the proposed algorithm using Swift development technology.
引用
收藏
页数:29
相关论文
共 50 条
  • [1] Combining Deep Fully Convolutional Network and Graph Convolutional Neural Network for the Extraction of Buildings from Aerial Images
    Zhang, Wenzhuo
    Yu, Mingyang
    Chen, Xiaoxian
    Zhou, Fangliang
    Ren, Jie
    Xu, Haiqing
    Xu, Shuai
    BUILDINGS, 2022, 12 (12)
  • [2] RNGDet: Road Network Graph Detection by Transformer in Aerial Images
    Xu, Zhenhua
    Liu, Yuxuan
    Gan, Lu
    Sun, Yuxiang
    Wu, Xinyu
    Liu, Ming
    Wang, Lujia
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [3] Aerial Images and Convolutional Neural Network for Cotton Bloom Detection
    Xu, Rui
    Li, Changying
    Paterson, Andrew H.
    Jiang, Yu
    Sun, Shangpeng
    Robertson, Jon S.
    FRONTIERS IN PLANT SCIENCE, 2018, 8
  • [4] Livestock detection in aerial images using a fully convolutional network
    Han, Liang
    Tao, Pin
    Martin, Ralph R.
    COMPUTATIONAL VISUAL MEDIA, 2019, 5 (02) : 221 - 228
  • [5] Livestock detection in aerial images using a fully convolutional network
    Liang Han
    Pin Tao
    Ralph R.Martin
    Computational Visual Media, 2019, 5 (02) : 221 - 228
  • [6] Livestock detection in aerial images using a fully convolutional network
    Liang Han
    Pin Tao
    Ralph R. Martin
    Computational Visual Media, 2019, 5 : 221 - 228
  • [7] A Network Combining a Transformer and a Convolutional Neural Network for Remote Sensing Image Change Detection
    Wang, Guanghui
    Li, Bin
    Zhang, Tao
    Zhang, Shubi
    REMOTE SENSING, 2022, 14 (09)
  • [8] Vehicle detection in aerial images based on lightweight deep convolutional network
    Shen, Jiaquan
    Liu, Ningzhong
    Sun, Han
    IET IMAGE PROCESSING, 2021, 15 (02) : 479 - 491
  • [9] Fully Convolutional Lightweight Pyramid Network for Vehicle Detection in Aerial Images
    Du, Qingsong
    Celik, Turgay
    Wang, Qianli
    Li, Heng-Chao
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19
  • [10] Convolutional Neural Network Based Automatic Object Detection on Aerial Images
    Sevo, Igor
    Avramovic, Aleksej
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2016, 13 (05) : 740 - 744