SF-YOLO: A Novel YOLO Framework for Small Object Detection in Aerial Scenes

被引:0
作者
Sun, Meng [1 ,2 ]
Wang, Le [1 ,2 ,3 ]
Jiang, Wangyu [1 ,2 ]
Dharejo, Fayaz Ali [4 ,5 ]
Mao, Guojun [1 ,2 ,3 ]
Timofte, Radu [4 ,5 ]
机构
[1] Fujian Univ Technol, Coll Comp, Fujian Prov Key Lab Big Data Min & Applicat, Fuzhou, Peoples R China
[2] Fujian Univ Technol, Sch Comp Sci & Math, Fuzhou, Peoples R China
[3] Fujian Univ Technol, Technol Innovat Ctr Factored Transact Data Tourist, Minist Culture & Tourism, Fuzhou, Peoples R China
[4] Univ Wurzburg, Comp Vis Lab, CAIDAS, Wurzburg, Germany
[5] Univ Wurzburg, IFI, Wurzburg, Germany
关键词
computer vision; convolutional neural nets; convolution; feature extraction; object detection; MODEL;
D O I
10.1049/ipr2.70027
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Object detection models are widely applied in the fields such as video surveillance and unmanned aerial vehicles to enable the identification and monitoring of various objects on a diversity of backgrounds. The general CNN-based object detectors primarily rely on downsampling and pooling operations, often struggling with small objects that have low resolution and failing to fully leverage contextual information that can differentiate objects from complex background. To address the problems, we propose a novel YOLO framework called SF-YOLO for small object detection. Firstly, we present a spatial information perception (SIP) module to extract contextual features for different objects through the integration of space to depth operation and large selective kernel module, which dynamically adjusts receptive field of the backbone and obtains the enhanced features for richer understanding of differentiation between objects and background. Furthermore, we design a novel multi-scale feature weighted fusion strategy, which performs weighted fusion on feature maps by combining fast normalized fusion method and CARAFE operation, accurately assessing the importance of each feature and enhancing the representation of small objects. The extensive experiments conducted on VisDrone2019, Tiny-Person and PESMOD datasets demonstrate that our proposed method enables comparable detection performance to state-of-the-art detectors.
引用
收藏
页数:14
相关论文
共 60 条
  • [1] Cai Y., Luan T., Gao H., Et al., Yolov4-5d: An Effective and Efficient Object Detector for Autonomous Driving, IEEE Transactions on Instrumentation and Measurement, 70, pp. 1-13, (2021)
  • [2] Zhang Z., Drone-Yolo: An Efficient Neural Network Method for Target Detection in Drone Images, Drones, 7, 8, (2023)
  • [3] Wang J., Bai L., Fang Z., Han R., Wang J., Choi J., Age of Information Based URLLC Transmission for UAVs on Pylon Turn, IEEE Transactions on Vehicular Technology, 73, 6, pp. 8797-8809, (2024)
  • [4] Sun B., Song J., Wei M., 3D Trajectory Planning Model of Unmanned Aerial Vehicles (UAVs) in a Dynamic Complex Environment based on An Improved Ant Colony Optimization Algorithm, Journal of Nonlinear Convex Analysis, 25, pp. 737-746, (2024)
  • [5] Jin W., Tian X., Shi B., Zhao B., Duan H., Wu H., Enhanced UAV Pursuit-evasion Using Boids Modelling: A Synergistic Integration of Bird Swarm Intelligence and DRL, Computers, Materials & Continua, 80, 3, (2024)
  • [6] Yu W., Yang T., Chen C., Towards Resolving the Challenge of Long-Tail Distribution in UAV Images for Object Detection, pp. 3258-3267, (2021)
  • [7] Yin Y., Wang Z., Zheng L., Su Q., Guo Y., Autonomous UAV Navigation with Adaptive Control Based on Deep Reinforcement Learning, Electronics, 13, 13, (2024)
  • [8] Gao N., Zeng Y., Wang J., Et al., Energy Model for UAV Communications: Experimental Validation and Model Generalization, China Communications, 18, 7, pp. 253-264, (2021)
  • [9] Wu H., Hua Y., Zou H., Ke G., A Lightweight Network for Vehicle Detection Based on Embedded System, Journal of Supercomputing, 78, 16, pp. 18209-18224, (2022)
  • [10] Zhu Y., Zhang C., Zhou D., Wang X., Bai X., Liu W., Traffic Sign Detection and Recognition Using Fully Convolutional Network Guided Proposals, Neurocomputing, 214, pp. 758-766, (2016)