GANsformer: A Detection Network for Aerial Images with High Performance Combining Convolutional Network and Transformer

被引:29
作者
Zhang, Yan [1 ]
Liu, Xi [2 ]
Wa, Shiyun [1 ]
Chen, Shuyu [3 ]
Ma, Qin [1 ]
机构
[1] China Agr Univ, Coll Informat & Elect Engn, Beijing 100083, Peoples R China
[2] China Agr Univ, Coll Humanities & Dev, Beijing 100083, Peoples R China
[3] China Agr Univ, Coll Engn, Beijing 100083, Peoples R China
关键词
object detection; transformer; deep learning; aerial image; generative model; GANsformer detection network; DEEP LEARNING-METHODS; OBJECT DETECTION; RECOGNITION;
D O I
10.3390/rs14040923
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
There has been substantial progress in small object detection in aerial images in recent years, due to the extensive applications and improved performances of convolutional neural networks (CNNs). Typically, traditional machine learning algorithms tend to prioritize inference speed over accuracy. Insufficient samples can cause problems for convolutional neural networks, such as instability, non-convergence, and overfitting. Additionally, detecting aerial images has inherent challenges, such as varying altitudes and illuminance situations, and blurred and dense objects, resulting in low detection accuracy. As a result, this paper adds a transformer backbone attention mechanism as a branch network, using the region-wide feature information. This paper also employs a generative model to expand the input aerial images ahead of the backbone. The respective advantages of the generative model and transformer network are incorporated. On the dataset presented in this study, the model achieves 96.77% precision, 98.83% recall, and 97.91% mAP by adding the Multi-GANs module to the one-stage detection network. These three indices are enhanced by 13.9%, 20.54%, and 10.27%, respectively, when compared to the other detection networks. Furthermore, this study provides an auto-pruning technique that may achieve 32.2 FPS inference speed with a minor performance loss while responding to the real-time detection task's usage environment. This research also develops a macOS application for the proposed algorithm using Swift development technology.
引用
收藏
页数:29
相关论文
共 50 条
  • [31] Human detection in aerial thermal imaging using a fully convolutional regression network
    Haider, Ali
    Shaukat, Furqan
    Mir, Junaid
    [J]. INFRARED PHYSICS & TECHNOLOGY, 2021, 116
  • [32] MRDet: A Multihead Network for Accurate Rotated Object Detection in Aerial Images
    Qin, Ran
    Liu, Qingjie
    Gao, Guangshuai
    Huang, Di
    Wang, Yunhong
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [33] Task-Specific Heterogeneous Network for Object Detection in Aerial Images
    Yu, Ying
    Yang, Xi
    Li, Jie
    Gao, Xinbo
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61
  • [34] A deep neural network for vehicle detection in aerial images
    Du R.
    Cheng Y.
    [J]. Journal of Intelligent and Fuzzy Systems, 2024, 1 (01)
  • [35] Using Synthetic Images to Evaluate and Improve Object Detection Neural Network Performance on Aerial Image Datasets
    Konen, Kai
    Hecking, Tobias
    [J]. INTERNATIONAL JOURNAL OF SEMANTIC COMPUTING, 2022, 16 (03) : 339 - 356
  • [36] Disaster Detection from Aerial Imagery with Convolutional Neural Network
    Amit, Siti Nor Khuzaimah Binti
    Aoki, Yoshimitsu
    [J]. 2017 INTERNATIONAL ELECTRONICS SYMPOSIUM ON KNOWLEDGE CREATION AND INTELLIGENT COMPUTING (IES-KCIC), 2017, : 239 - 245
  • [37] CTF-Net: A Convolutional and Transformer Fusion Network for SAR Ship Detection
    Wu, Haoyu
    Yu, Lei
    Li, Xiangwen
    Zhou, Lin
    Zhang, Wenjing
    Bai, Guiming
    [J]. IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2023, 20
  • [38] A Lightweight Convolutional Neural Network for Ship Target Detection in SAR Images
    Hao, Yisheng
    Zhang, Ying
    [J]. IEEE TRANSACTIONS ON AEROSPACE AND ELECTRONIC SYSTEMS, 2024, 60 (02) : 1882 - 1898
  • [39] Laryngeal Tumor Detection in Endoscopic Images Based on Convolutional Neural Network
    Cen, Qian
    Pan, Zhanpeng
    Li, Yang
    Ding, Huijun
    [J]. PROCEEDINGS OF 2019 IEEE 2ND INTERNATIONAL CONFERENCE ON ELECTRONIC INFORMATION AND COMMUNICATION TECHNOLOGY (ICEICT 2019), 2019, : 604 - 608
  • [40] Object Detection Method for Aerial Inspection Image Based on Region-based Fully Convolutional Network
    Liu S.
    Wang B.
    Gao K.
    Wang Y.
    Gao C.
    Chen J.
    [J]. Dianli Xitong Zidonghua/Automation of Electric Power Systems, 2019, 43 (13): : 162 - 168