Swin-YOLO for Concealed Object Detection in Millimeter Wave Images

被引:10
作者
Huang, Pingping [1 ,2 ]
Wei, Ran [1 ,2 ]
Su, Yun [1 ,2 ]
Tan, Weixian [1 ,2 ]
机构
[1] Inner Mongolia Univ Technol, Coll Informat Engn, Hohhot 010051, Peoples R China
[2] Inner Mongolia Key Lab Radar Technol & Applicat, Hohhot 010051, Peoples R China
来源
APPLIED SCIENCES-BASEL | 2023年 / 13卷 / 17期
关键词
millimeter wave images; concealed object detection; Swin Transformer; attention mechanism; SEGMENTATION;
D O I
10.3390/app13179793
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Concealed object detection in millimeter wave (MMW) images has gained significant attention in the realm of public safety, primarily due to its distinctive advantages of non-hazardous and non-contact operation. However, this undertaking confronts substantial challenges in practical applications, owing to the inherent limitations of low imaging resolution, small concealed object size, intricate environmental noise, and the need for real-time performance. In this study, we propose Swin-YOLO, an innovative single-stage detection model built upon transformer layers. Our approach encompasses several key contributions. Firstly, the integration of Local Perception Swin Transform Layers (LPST Layers) enhanced the network's capability to acquire contextual information and local awareness. Secondly, we introduced a novel feature fusion layer and a specialized prediction head for detecting small targets, effectively leveraging the network's shallow feature information. Lastly, a coordinate attention (CA) module was seamlessly incorporated between the neck network and the detection head, augmenting the network's sensitivity towards critical regions of small objects. To validate the efficacy and feasibility of our proposed method, we created a new MMW dataset containing a large number of small concealed objects and conducted comprehensive experiments to evaluate the effectiveness of overall and partial improvements, as well as computational efficiency. The results demonstrated a remarkable 4.7% improvement in the mean Average Precision (mAP) for Swin-YOLO compared with the YOLOv5 baseline. Moreover, when compared with other enhanced transformer-based models, Swin-YOLO exhibited a superior accuracy and the fastest inference speed. The proposed model showcases enhanced performance and holds promise for advancing the capabilities of real-world applications in public safety domains.
引用
收藏
页数:23
相关论文
共 50 条
[41]   Non-local Neural Networks [J].
Wang, Xiaolong ;
Girshick, Ross ;
Gupta, Abhinav ;
He, Kaiming .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :7794-7803
[42]   CBAM: Convolutional Block Attention Module [J].
Woo, Sanghyun ;
Park, Jongchan ;
Lee, Joon-Young ;
Kweon, In So .
COMPUTER VISION - ECCV 2018, PT VII, 2018, 11211 :3-19
[43]   A novel deformable body partition model for MMW suspicious object detection and dynamic tracking [J].
Yang, Xi ;
Wei, Ziyu ;
Wang, Nannan ;
Song, Bin ;
Gao, Xinbo .
SIGNAL PROCESSING, 2020, 174
[44]   Concealed object recognition based on geometric feature descriptors [J].
Yeom, Seokwon ;
Lee, Dong-Su ;
Chang, YuShin ;
Lee, Mun-Kyo ;
Jung, Sang-Won .
PASSIVE AND ACTIVE MILLIMETER-WAVE IMAGING XV, 2012, 8362
[45]   A Suspicious Multi-Object Detection and Recognition Method for Millimeter Wave SAR Security Inspection Images Based on Multi-Path Extraction Network [J].
Yuan, Minghui ;
Zhang, Quansheng ;
Li, Yinwei ;
Yan, Yunhao ;
Zhu, Yiming .
REMOTE SENSING, 2021, 13 (24)
[46]   Domain adaptive detection system for concealed objects using millimeter wave images [J].
Zhang, Bo ;
Wang, Bin ;
Wu, Xiaofeng ;
Zhang, Liming ;
Yang, Minghui ;
Sun, Xiaowei .
NEURAL COMPUTING & APPLICATIONS, 2021, 33 (18) :11573-11588
[47]  
Zhang K.S., 2022, Masters Thesis
[48]   The Development of Frequency Multipliers for Terahertz Remote Sensing System [J].
Zhang, Yong ;
Wu, Chengkai ;
Liu, Xiaoyu ;
Wang, Li ;
Dai, Chunyue ;
Cui, Jianhang ;
Li, Yukun ;
Kinar, Nicholas .
REMOTE SENSING, 2022, 14 (10)
[49]  
Zheng ZH, 2020, AAAI CONF ARTIF INTE, V34, P12993
[50]   TPH-YOLOv5: Improved YOLOv5 Based on Transformer Prediction Head for Object Detection on Drone-captured Scenarios [J].
Zhu, Xingkui ;
Lyu, Shuchang ;
Wang, Xu ;
Zhao, Qi .
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW 2021), 2021, :2778-2788