USD-YOLO: An Enhanced YOLO Algorithm for Small Object Detection in Unmanned Systems Perception

被引:1
作者
Deng, Hongqiang [1 ]
Zhang, Shuzhe [2 ]
Wang, Xiaodong [1 ]
Han, Tianxin [1 ]
Ye, Yun [3 ]
机构
[1] Northeastern Univ, Coll Mech Engn & Automat, Shenyang 110819, Peoples R China
[2] Dalian Maritime Univ, Houston Int Inst, Dalian 116026, Peoples R China
[3] Ningbo Univ, Fac Maritime & Transportat, Ningbo 315211, Peoples R China
来源
APPLIED SCIENCES-BASEL | 2025年 / 15卷 / 07期
基金
中国国家自然科学基金;
关键词
computer vision; object detection; you only look once; deep learning; unmanned system; VEHICLE; RADAR;
D O I
10.3390/app15073795
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
In the perception of unmanned systems, small object detection faces numerous challenges, including small size, low resolution, dense distribution, and occlusion, leading to suboptimal perception performance. To address these issues, we propose a specialized algorithm named Unmanned-system Small-object Detection-You Only Look Once (USD-YOLO). First, we designed an innovative module called the Anchor-Free Precision Enhancer to achieve more accurate bounding box overlap measurements and provide a smarter processing mechanism, thereby improving the localization accuracy of candidate boxes for small and densely distributed objects. Second, we introduced the Spatial and Channel Reconstruction Convolution module to reduce redundancy in spatial and channel features while extracting key features of small objects. Additionally, we designed a novel C2f-Global Attention Mechanism module to expand the receptive field and capture more contextual information, optimizing the detection head's ability to handle small and low-resolution objects. We conducted extensive experimental comparisons with state-of-the-art models on three mainstream unmanned system datasets and a real unmanned ground vehicle. The experimental results demonstrate that USD-YOLO achieves higher detection precision and faster speed. On the Citypersons dataset, compared with the baseline, USD-YOLO improves mAP50-95, mAP50, and Recall by 8.5%, 5.9%, and 2.3%, respectively. Additionally, on the Flow-Img and DOTA-v1.0 datasets, USD-YOLO improves mAP50-95 by 2.5% and 2.5%, respectively.
引用
收藏
页数:19
相关论文
共 42 条
[1]   Soft-NMS - Improving Object Detection With One Line of Code [J].
Bodla, Navaneeth ;
Singh, Bharat ;
Chellappa, Rama ;
Davis, Larry S. .
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :5562-5570
[2]   MODS--A USV-Oriented Object Detection and Obstacle Segmentation Benchmark [J].
Bovcon, Borja ;
Muhovic, Jon ;
Vranac, Dusko ;
Mozetic, Dean ;
Pers, Janez ;
Kristan, Matej .
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (08) :13403-13418
[3]   Deep Neural Network Based Vehicle and Pedestrian Detection for Autonomous Driving: A Survey [J].
Chen, Long ;
Lin, Shaobo ;
Lu, Xiankai ;
Cao, Dongpu ;
Wu, Hangbin ;
Guo, Chi ;
Liu, Chun ;
Wang, Fei-Yue .
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2021, 22 (06) :3234-3246
[4]   Towards Large-Scale Small Object Detection: Survey and Benchmarks [J].
Cheng, Gong ;
Yuan, Xiang ;
Yao, Xiwen ;
Yan, Kebing ;
Zeng, Qinghua ;
Xie, Xingxing ;
Han, Junwei .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (11) :13467-13488
[5]   Feature Enhancement Network for Object Detection in Optical Remote Sensing Images [J].
Cheng, Gong ;
Lang, Chunbo ;
Wu, Maoxiong ;
Xie, Xingxing ;
Yao, Xiwen ;
Han, Junwei .
JOURNAL OF REMOTE SENSING, 2021, 2021
[6]   FloW: A Dataset and Benchmark for Floating Waste Detection in Inland Waters [J].
Cheng, Yuwei ;
Zhu, Jiannan ;
Jiang, Mengxin ;
Fu, Jie ;
Pang, Changsong ;
Wang, Peidong ;
Sankaran, Kris ;
Onabola, Olawale ;
Liu, Yimin ;
Liu, Dianbo ;
Bengio, Yoshua .
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, :10933-10942
[7]   YOLO-Former: Marrying YOLO and Transformer for Foreign Object Detection [J].
Dai, Yuan ;
Liu, Weiming ;
Wang, Heng ;
Xie, Wei ;
Long, Kejun .
IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2022, 71
[8]   NAS-FPN: Learning Scalable Feature Pyramid Architecture for Object Detection [J].
Ghiasi, Golnaz ;
Lin, Tsung-Yi ;
Le, Quoc V. .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :7029-7038
[9]   Fast R-CNN [J].
Girshick, Ross .
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :1440-1448
[10]   Millimeter-Wave Radar and Camera Fusion for Multiscenario Object Detection on USVs [J].
He, Xin ;
Wu, Defeng ;
Wu, Dongjie ;
You, Zheng ;
Zhong, Shangkun ;
Liu, Qijun .
IEEE SENSORS JOURNAL, 2024, 24 (19) :31562-31572