DSOD-YOLO: A lightweight dual feature extraction method for small target detection

被引:1
作者
Nie, Yuan [1 ]
Lai, Huicheng [2 ]
Gao, Guxue [2 ]
机构
[1] Xinjiang Univ, Coll Comp Sci & Technol, Urumqi 830017, Peoples R China
[2] Xinjiang Univ, Key Lab Signal Detect & Proc, Xinjiang Uygur Autonomous Reg, Urumqi 830017, Peoples R China
关键词
Lightweight small-object detection; Dual-backbone feature extraction; Pruning; Knowledge distillation; SmallDark; OBJECT; NETWORK;
D O I
10.1016/j.dsp.2025.105268
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
As object detection techniques advance, large-object detection has become less challenging. However, small-object detection remains a significant hurdle. DSOD-YOLO is a lightweight small-object detection network based on YOLOv8, designed to balance detection accuracy with model efficiency. To accurately detect small objects, the network employs a dual-backbone feature extraction architecture, which enhances the extraction of small-object details. This addresses the issue of detail loss in deep models. Additionally, a Channel-Scale Adaptive Module (FASD) is introduced to adaptively select feature channels and image sizes based on the required feature information. This helps mitigate the problem of sparse feature information and information loss during feature propagation for small objects. To strengthen contextual information and further improve small-object detection, a lightweight Context and Spatial Feature Calibration Network (CSFCN) is integrated. CSFCN performs context correction and spatial feature calibration through its two core modules, Context Feature Calibration (CFC) and Spatial Feature Calibration (SFC), based on pixel context similarity and channel dimensions, respectively. To reduce model complexity, the network undergoes a pruning process, achieving lightweight small-object detection. Furthermore, knowledge distillation is employed, with a large model acting as a teacher network to guide DSOD-YOLO, leading to further accuracy improvements. Experimental results demonstrate that DSODYOLO outperforms state-of-the-art algorithms like YOLOv9 and YOLOv10 on multiple small-object datasets. Additionally, a new small-object dataset (SmallDark) is created for low-light conditions, and the proposed method surpasses existing algorithms on this custom dataset.
引用
收藏
页数:15
相关论文
共 71 条
[1]   Automatic Image and Video Caption Generation With Deep Learning: A Concise Review and Algorithmic Overlap [J].
Amirian, Soheyla ;
Rasheed, Khaled ;
Taha, Thiab R. ;
Arabnia, Hamid R. .
IEEE ACCESS, 2020, 8 :218386-218400
[2]  
Bochkovskiy A, 2020, Arxiv, DOI [arXiv:2004.10934, DOI 10.48550/ARXIV.2004.10934]
[3]   Deep Learning on Computational-Resource-Limited Platforms: A Survey [J].
Chen, Chunlei ;
Zhang, Peng ;
Zhang, Huixiang ;
Dai, Jiangyan ;
Yi, Yugen ;
Zhang, Huihui ;
Zhang, Yonghui .
MOBILE INFORMATION SYSTEMS, 2020, 2020
[4]   Image inpainting algorithm based on inference attention module and two-stage network [J].
Chen, Yuantao ;
Xia, Runlong ;
Yang, Kai ;
Zou, Ke .
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 137
[5]   DNNAM: Image inpainting algorithm via deep neural networks and attention mechanism [J].
Chen, Yuantao ;
Xia, Runlong ;
Yang, Kai ;
Zou, Ke .
APPLIED SOFT COMPUTING, 2024, 154
[6]   MFMAM: Image inpainting via multi-scale feature module with attention module [J].
Chen, Yuantao ;
Xia, Runlong ;
Yang, Kai ;
Zou, Ke .
COMPUTER VISION AND IMAGE UNDERSTANDING, 2024, 238
[7]   A survey on object detection in optical remote sensing images [J].
Cheng, Gong ;
Han, Junwei .
ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2016, 117 :11-28
[8]   A survey of detection-based video multi-object tracking [J].
Dai, Yan ;
Hu, Ziyu ;
Zhang, Shuqi ;
Liu, Lianjun .
DISPLAYS, 2022, 75
[9]   Extended Feature Pyramid Network for Small Object Detection [J].
Deng, Chunfang ;
Wang, Mengmeng ;
Liu, Liang ;
Liu, Yong ;
Jiang, Yunliang .
IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 24 :1968-1979
[10]   A detection network for small defects of steel surface based on YOLOv7 [J].
Gao, Shaoshu ;
Chu, Menghui ;
Zhang, Long .
DIGITAL SIGNAL PROCESSING, 2024, 149