GCL-YOLO: A GhostConv-Based Lightweight YOLO Network for UAV Small Object Detection

被引：46

作者：

Cao, Jinshan ^{[1
]}

Bao, Wenshu ^{[1
]}

Shang, Haixing ^{[2
]}

Yuan, Ming ^{[1
]}

Cheng, Qian ^{[1
]}

机构：

[1] Hubei Univ Technol, Sch Comp Sci, Wuhan 430068, Peoples R China

[2] Northwest Engn Corp Ltd, Power China Grp, Xian 710064, Peoples R China

来源：

REMOTE SENSING | 2023年 / 15卷 / 20期

关键词：

unmanned aerial vehicle (UAV); small object detection; lightweight network; efficient network; YOLO; GhostConv;

D O I：

10.3390/rs15204932

中图分类号：

X [环境科学、安全科学];

学科分类号：

08 ; 0830 ;

摘要：

Precise object detection for unmanned aerial vehicle (UAV) images is a prerequisite for many UAV image applications. Compared with natural scene images, UAV images often have many small objects with few image pixels. These small objects are often obscured, densely distributed, or in complex scenes, which causes great interference to object detection. Aiming to solve this problem, a GhostConv-based lightweight YOLO network (GCL-YOLO) is proposed. In the proposed network, a GhostConv-based backbone network with a few parameters was firstly built. Then, a new prediction head for UAV small objects was designed, and the original prediction head for large natural scene objects was removed. Finally, the focal-efficient intersection over union (Focal-EIOU) loss was used as the localization loss. The experimental results of the VisDrone-DET2021 dataset and the UAVDT dataset showed that, compared with the YOLOv5-S network, the mean average precision at IOU = 0.5 achieved by the proposed GCL-YOLO-S network was improved by 6.9% and 1.8%, respectively, while the parameter amount and the calculation amount were reduced by 76.7% and 32.3%, respectively. Compared with some excellent lightweight networks, the proposed network achieved the highest and second-highest detection accuracy on the two datasets with the smallest parameter amount and a medium calculation amount, respectively.

引用

页数：24

共 37 条

[1] Beyond RGB: Very high resolution urban remote sensing with multimodal deep networks [J].

Audebert, Nicolas ;

Le Saux, Bertrand ;

Lefevre, Sebastien .

ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2018, 140 :20-32

[2]

Bochkovskiy A, 2020, Arxiv, DOI [arXiv:2004.10934, DOI 10.48550/ARXIV.2004.10934]

[3] A full data augmentation pipeline for small object detection based on generative adversarial networks [J].

Bosquet, Brais ;

Cores, Daniel ;

Seidenari, Lorenzo ;

Brea, Victor M. ;

Mucientes, Manuel ;

Del Bimbo, Alberto .

PATTERN RECOGNITION, 2023, 133

[4] VisDrone-DET2021: The Vision Meets Drone Object detection Challenge Results [J].

Cao, Yaru ;

He, Zhijian ;

Wang, Lujia ;

Wang, Wenguan ;

Yuan, Yixuan ;

Zhang, Dingwen ;

Zhang, Jinglin ;

Zhu, Pengfei ;

Van Gool, Luc ;

Han, Junwei ;

Hoi, Steven ;

Hu, Qinghua ;

Liu, Ming ;

Cheng, Chong ;

Liu, Fanfan ;

Cao, Guojin ;

Li, Guozhen ;

Wang, Hongkai ;

He, Jianye ;

Wan, Junfeng ;

Wan, Qi ;

Zhao, Qi ;

Lyu, Shuchang ;

Zhao, Wenzhe ;

Lu, Xiaoqiang ;

Zhu, Xingkui ;

Liu, Yingjie ;

Lv, Yixuan ;

Ma, Yujing ;

Yang, Yuting ;

Wang, Zhe ;

Xu, Zhenyu ;

Luo, Zhipeng ;

Zhang, Zhimin ;

Zhang, Zhiguang ;

Li, Zihao ;

Zhang, Zixiao .

2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW 2021), 2021, :2847-2854

[5] The Pascal Visual Object Classes (VOC) Challenge [J].

Everingham, Mark ;

Van Gool, Luc ;

Williams, Christopher K. I. ;

Winn, John ;

Zisserman, Andrew .

INTERNATIONAL JOURNAL OF COMPUTER VISION, 2010, 88 (02) :303-338

[6] TOOD: Task-aligned One-stage Object Detection [J].

Feng, Chengjian ;

Zhong, Yujie ;

Gao, Yu ;

Scott, Matthew R. ;

Huang, Weilin .

2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, :3490-3499

[7]

Howard AG, 2017, Arxiv, DOI arXiv:1704.04861

[8]

Ge Z, 2021, Arxiv, DOI [arXiv:2107.08430, DOI 10.48550/ARXIV.2107.08430]

[9] GhostNet: More Features from Cheap Operations [J].

Han, Kai ;

Wang, Yunhe ;

Tian, Qi ;

Guo, Jianyuan ;

Xu, Chunjing ;

Xu, Chang .

2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, :1577-1586

[10] Searching for MobileNetV3 [J].

Howard, Andrew ;

Sandler, Mark ;

Chu, Grace ;

Chen, Liang-Chieh ;

Chen, Bo ;

Tan, Mingxing ;

Wang, Weijun ;

Zhu, Yukun ;

Pang, Ruoming ;

Vasudevan, Vijay ;

Le, Quoc V. ;

Adam, Hartwig .

2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :1314-1324

← 1 2 3 4 →