Research on a small target object detection method for aerial photography based on improved YOLOv7

被引:0
作者
Yang, Jiajun [1 ]
Zhang, Xuesong [1 ]
Song, Cunli [1 ]
机构
[1] Dalian Jiaotong Univ, Sch Software, 794 Huanghe Rd, Dalian 116028, Peoples R China
基金
中国国家自然科学基金;
关键词
Aerial image; Deep learning; Small object detection; YOLO; Vision transformer; INFRARED SMALL; TRANSFORMER;
D O I
10.1007/s00371-024-03615-9
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
In aerial imagery analysis, detecting small targets is highly challenging due to their minimal pixel representation and complex backgrounds. To address this issue, this manuscript proposes a novel method for detecting small aerial targets. Firstly, the K-means + + algorithm is utilized to generate anchor boxes suitable for small targets. Secondly, the YOLOv7-BFAW model is proposed. This method incorporates a series of improvements to YOLOv7, including the integration of a BBF residual structure based on BiFormer and BottleNeck fusion into the backbone network, the design of an MPsim module based on simAM attention for the head network, and the development of a novel loss function, inner-WIoU v2, as the localization loss function, based on WIoU v2. Experiments demonstrate that YOLOv7-BFAW achieves a 4.2% mAP@.5 improvement on the DOTA v1.0 dataset and a 1.7% mAP@.5 improvement on the VisDrone2019 dataset, showcasing excellent generalization capabilities. Furthermore, it is shown that YOLOv7-BFAW exhibits superior detection performance compared to state-of-the-art algorithms.
引用
收藏
页码:3487 / 3501
页数:15
相关论文
共 44 条
  • [1] A Survey of Indoor and Outdoor UAV-Based Target Tracking Systems: Current Status, Challenges, Technologies, and Future Directions
    Alhafnawi, Mohannad
    Salameh, Haythem A. Bany A.
    Masadeh, Ala'eddin
    Al-Obiedollah, Haitham
    Ayyash, Moussa
    El-Khazali, Reyad
    Elgala, Hany
    [J]. IEEE ACCESS, 2023, 11 : 68324 - 68339
  • [2] Carion Nicolas, 2020, Computer Vision - ECCV 2020. 16th European Conference. Proceedings. Lecture Notes in Computer Science (LNCS 12346), P213, DOI 10.1007/978-3-030-58452-8_13
  • [3] A Survey of the Four Pillars for Small Object Detection: Multiscale Representation, Contextual Information, Super-Resolution, and Region Proposal
    Chen, Guang
    Wang, Haitao
    Chen, Kai
    Li, Zhijun
    Song, Zida
    Liu, Yinlong
    Chen, Wenkai
    Knoll, Alois
    [J]. IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2022, 52 (02): : 936 - 953
  • [4] SCPA-Net: Self-calibrated pyramid aggregation for image dehazing
    Chen, Zhihua
    Zhou, Yu
    Li, Ran
    Li, Ping
    Sheng, Bin
    [J]. COMPUTER ANIMATION AND VIRTUAL WORLDS, 2022, 33 (3-4)
  • [5] Towards Large-Scale Small Object Detection: Survey and Benchmarks
    Cheng, Gong
    Yuan, Xiang
    Yao, Xiwen
    Yan, Kebing
    Zeng, Qinghua
    Xie, Xingxing
    Han, Junwei
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (11) : 13467 - 13488
  • [6] Sig-NMS-Based Faster R-CNN Combining Transfer Learning for Small Target Detection in VHR Optical Remote Sensing Imagery
    Dong, Ruchan
    Xu, Dazhuan
    Zhao, Jin
    Jiao, Licheng
    An, Jungang
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2019, 57 (11): : 8534 - 8545
  • [7] Coarse-grained Density Map Guided Object Detection in Aerial Images
    Duan, Chengzhen
    Wei, Zhiwei
    Zhang, Chi
    Qu, Siying
    Wang, Hongpeng
    [J]. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW 2021), 2021, : 2789 - 2798
  • [8] Using deep learning in an embedded system for real-time target detection based on images from an unmanned aerial vehicle: vehicle detection as a case study
    Huang, Fang
    Chen, Shengyin
    Wang, Qi
    Chen, Yingjie
    Zhang, Dandan
    [J]. INTERNATIONAL JOURNAL OF DIGITAL EARTH, 2023, 16 (01) : 910 - 936
  • [9] Dense Nested Attention Network for Infrared Small Target Detection
    Li, Boyang
    Xiao, Chao
    Wang, Longguang
    Wang, Yingqian
    Lin, Zaiping
    Li, Miao
    An, Wei
    Guo, Yulan
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 32 : 1745 - 1758
  • [10] Remote Sensing Object Detection Based on Strong Feature Extraction and Prescreening Network
    Li, Mengyuan
    Cao, Changqing
    Feng, Zhejun
    Xu, Xiangkai
    Wu, Zengyan
    Ye, Shubing
    Yong, Jiawei
    [J]. IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2023, 20