Small Object Detection Based on Lightweight Feature Pyramid

被引:2
作者
Li, Ziyang [1 ]
Guo, Chenwei [1 ]
Han, Guang [1 ]
机构
[1] Nanjing Univ Posts & Telecommun, Sch Commun & Informat Engn, Nanjing 210003, Peoples R China
关键词
Object detection; Feature extraction; Autonomous aerial vehicles; Task analysis; Drones; Consumer electronics; Transformers; small object detection; drone aerial images; Cascade RCNN; feature pyramid;
D O I
10.1109/TCE.2024.3412168
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In today's information age, consumer drones have received more and more attention in the field of consumer electronics. Consumer drones are widely used by consumers in aerial photography, follow-up photography and other scenes. There are many researches that can be carried out in the perspective of UAV, and object detection in the perspective of UAV belongs to one of them. Driven by object detection technology, drones with intelligent perception capabilities can achieve efficient and flexible data collection capabilities. However, due to the large number, large scale variation and uneven distribution of small and medium-sized targets captured by UAV, the existing algorithms have high miss detection rate and error rate for small and medium-sized targets captured by UAV. In order to be better applied to the object detection task in the view of UAV, three modules are designed in this paper. The small object enhancement module, which consists of a feature downsampling layer and a convolution-free step layer, to better cope with the detection and classification tasks of small objects and low-resolution images. The lightweight upsampling feature pyramid structure proposed at the same time has a large receptive field, through the convolution kernel forecast block and the characteristic integration block, it makes the semantics more balanced and enhances the ability of the model to locate small objects. Finally, FocalMAE Loss which reduces the negative impact of low quality samples on the gradient and helps the model converge quickly. The proposed method achieves 8.1% improvement over the conventional method. By reducing the number of parameters and maintaining lightweight functions, it exhibits improved performance in detecting small objects in UAV images.
引用
收藏
页码:6064 / 6074
页数:11
相关论文
共 52 条
[1]  
[Anonymous], 2018, ICLR 2018
[2]   Cascade R-CNN: Delving into High Quality Object Detection [J].
Cai, Zhaowei ;
Vasconcelos, Nuno .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :6154-6162
[3]   Scale Aggregation Network for Accurate and Efficient Crowd Counting [J].
Cao, Xinkun ;
Wang, Zhipeng ;
Zhao, Yanyun ;
Su, Fei .
COMPUTER VISION - ECCV 2018, PT V, 2018, 11209 :757-773
[4]  
Carion Nicolas, 2020, Computer Vision - ECCV 2020. 16th European Conference. Proceedings. Lecture Notes in Computer Science (LNCS 12346), P213, DOI 10.1007/978-3-030-58452-8_13
[5]   R-CNN for Small Object Detection [J].
Chen, Chenyi ;
Liu, Ming-Yu ;
Tuzel, Oncel ;
Xiao, Jianxiong .
COMPUTER VISION - ACCV 2016, PT V, 2017, 10115 :214-230
[6]   SRECG: ECG Signal Super-Resolution Framework for Portable/Wearable Devices in Cardiac Arrhythmias Classification [J].
Chen, Tsai-Min ;
Tsai, Yuan-Hong ;
Tseng, Huan-Hsin ;
Liu, Kai-Chun ;
Chen, Jhih-Yu ;
Huang, Chih-Han ;
Li, Guo-Yuan ;
Shen, Chun-Yen ;
Tsao, Yu .
IEEE TRANSACTIONS ON CONSUMER ELECTRONICS, 2023, 69 (03) :250-260
[7]   Dynamic Head: Unifying Object Detection Heads with Attentions [J].
Dai, Xiyang ;
Chen, Yinpeng ;
Xiao, Bin ;
Chen, Dongdong ;
Liu, Mengchen ;
Yuan, Lu ;
Zhang, Lei .
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :7369-7378
[8]   A Global-Local Self-Adaptive Network for Drone-View Object Detection [J].
Deng, Sutao ;
Li, Shuai ;
Xie, Ke ;
Song, Wenfeng ;
Liao, Xiao ;
Hao, Aimin ;
Qin, Hong .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 :1556-1569
[9]   Crowd Counting by Using Top-k Relations: A Mixed Ground-Truth CNN Framework [J].
Dong, Li ;
Zhang, Haijun ;
Yang, Kai ;
Zhou, Dongliang ;
Shi, Jianyang ;
Ma, Jianghong .
IEEE TRANSACTIONS ON CONSUMER ELECTRONICS, 2022, 68 (03) :307-316
[10]  
Dosovitskiy A., 2021, P INT C LEARN REPR