L-SSD: lightweight SSD target detection based on depth-separable convolution

被引：9

作者：

Wang, Huilin ^{[1
]}

Qian, Huaming ^{[1
]}

Feng, Shuai ^{[1
]}

Wang, Wenna ^{[1
]}

机构：

[1] Harbin Engn Univ, Coll Intelligent Syst Sci & Engn, Harbin 150001, Peoples R China

来源：

JOURNAL OF REAL-TIME IMAGE PROCESSING | 2024年 / 21卷 / 02期

基金：

中国国家自然科学基金;

关键词：

Asymmetric spatial attention; Feature fusion; Improved BiFPN; Local-global feature extraction; Lightweight target detection; MobileNetv2;

D O I：

10.1007/s11554-024-01413-z

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The current target detection algorithm based on deep learning has many redundant convolution calculations, which are difficult to apply to low-energy mobile devices, such as intelligent inspection robots and automatic driving. To solve this problem, we propose a lightweight target detection algorithm, L-SSD, based on depth-separable convolution. First, we chose the lightweight network MobileNetv2 as the backbone feature extraction network, and we proposed an upsampling feature fusion module (UFFM) to fuse the output feature maps of MobileNetv2. Deep semantic information is introduced into the shallow feature map to improve the feature extraction capability while reducing the complexity of the model. Second, we propose a local-global feature extraction module (LGFEM), which uses LGFEM to generate five additional feature layers to expand the feature map's receptive field and improve the model's detection accuracy. Then, we use an improved weighted bidirectional feature pyramid (BiFPN) for feature fusion to construct a new feature pyramid that fully utilizes the feature information between different layers. Finally, we propose asymmetric spatial attention (ASA) that enhances the expression ability of the features before BiFPN feature fusion, providing good positional information for the feature pyramid. Experimental results on the PASCAL VOC and MS COCO datasets show that the model parameters and model complexity of L-SSD are reduced by 85.9% and 96.1%, respectively, compared to SSD. A detection speed of 106 frames per second was achieved in NVIDIA GeForce RTX 3060 with detection accuracies of 73.8% and 22.4%, respectively. The optimal balance of model parameters, model complexity, detection accuracy, and speed are achieved.

引用

页数：15

共 50 条

[1] L-SSD: lightweight SSD target detection based on depth-separable convolution
Huilin Wang
Huaming Qian
Shuai Feng
Wenna Wang
Journal of Real-Time Image Processing, 2024, 21
[2] Lightweight Bridge Crack Detection Method Based on SegNet and Bottleneck Depth-Separable Convolution With Residuals
Zheng, Xuan
Zhang, Shuailong
Li, Xue
Li, Gang
Li, Xiyuan
IEEE ACCESS, 2021, 9 : 161649 - 161668
[3] Research on Lightweight Method of Insulator Target Detection Based on Improved SSD
Zeng, Bing
Zhou, Yu
He, Dilin
Zhou, Zhihao
Hao, Shitao
Yi, Kexin
Li, Zhilong
Zhang, Wenhua
Xie, Yunmin
SENSORS, 2024, 24 (18)
[4] A Mini-UAV Lightweight Target Detection Model Based on SSD
Zhang, JiaHui
Xie, RongLei
Meng, ZhiJun
Li, Gen
Xin, ShuLin
PROCEEDINGS OF 2022 INTERNATIONAL CONFERENCE ON AUTONOMOUS UNMANNED SYSTEMS, ICAUS 2022, 2023, 1010 : 2999 - 3013
[5] Ssd-kdgan: a lightweight SSD target detection method based on knowledge distillation and generative adversarial networks
Wang, Huilin
Qian, Huaming
Feng, Shuai
JOURNAL OF SUPERCOMPUTING, 2024, 80 (16): : 23544 - 23564
[6] CAL-SSD: lightweight SSD object detection based on coordinated attention
Zhong, Xin
SIGNAL IMAGE AND VIDEO PROCESSING, 2025, 19 (01)
[7] The Multi-Scale Depth-Separable Convolution Network for Fire and Smoke Detection
Yan, Huihui
Cui, Zhihua
Zhao, Haotian
Zhang, Jingbo
Qin, Juanjuan
Guo, Qian
COMBUSTION SCIENCE AND TECHNOLOGY, 2024,
[8] Traffic Pedestrian Detection Algorithm based on Lightweight SSD
Huang, JiaBao
Cai, Qiong
Chen, Yu
Huang, QianQian
Li, Fang
THIRD INTERNATIONAL CONFERENCE ON ELECTRONICS AND COMMUNICATION; NETWORK AND COMPUTER TECHNOLOGY (ECNCT 2021), 2022, 12167
[9] Compression algorithm for live face recognition model based on depth-separable convolution
Wei, Jinhu
Zhou, Yan
Xie, Wei
Yu, JinWei
Li, WeiSheng
PROCEEDINGS OF THE 33RD CHINESE CONTROL AND DECISION CONFERENCE (CCDC 2021), 2021, : 1624 - 1628
[10] FD-SSD: An improved SSD object detection algorithm based on feature fusion and dilated convolution
Yin, Qunjie
Yang, Wenzhu
Ran, Mengying
Wang, Sile
SIGNAL PROCESSING-IMAGE COMMUNICATION, 2021, 98

← 1 2 3 4 5 →