Automated efficient traffic gesture recognition using swin transformer-based multi-input deep network with radar images

被引:0
作者
Firat, Huseyin [1 ]
Uzen, Huseyin [2 ]
Atila, Orhan [3 ]
Sengur, Abdulkadir [4 ]
机构
[1] Dicle Univ, Fac Engn, Dept Comp Engn, Diyarbakir, Turkiye
[2] Bingol Univ, Fac Engn & Architecture, Dept Comp Engn, Bingol, Turkiye
[3] Firat Univ, Technol Fac, Elect Elect Engn Dept, Elazig, Turkiye
[4] Firat Univ, Fac Technol, Dept Elect & Elect Engn, Elazig, Turkiye
关键词
Deep learning; Radar images; Swin transformers; Traffic hand gesture;
D O I
10.1007/s11760-024-03664-6
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Radar-based artificial intelligence (AI) applications have gained significant attention recently, spanning from fall detection to gesture recognition. The growing interest in this field has led to a shift towards deep convolutional networks, and transformers have emerged to address limitations in convolutional neural network methods, becoming increasingly popular in the AI community. In this paper, we present a novel hybrid approach for radar-based traffic hand gesture classification using transformers. Traffic hand gesture recognition (HGR) holds importance in AI applications, and our proposed three-phase approach addresses the efficiency and effectiveness of traffic HGR. In the initial phase, feature vectors are extracted from input radar images using the pre-trained DenseNet-121 model. These features are then consolidated by concatenating them to gather information from diverse radar sensors, followed by a patch extraction operation. The concatenated features from all inputs are processed in the Swin transformer block to facilitate further HGR. The classification stage involves sequential application of global average pooling, Dense, and Softmax layers. To assess the effectiveness of our method on ULM university radar dataset, we employ various performance metrics, including accuracy, precision, recall, and F1-score, achieving an average accuracy score of 90.54%. We compare this score with existing approaches to demonstrate the competitiveness of our proposed method.
引用
收藏
页数:11
相关论文
共 50 条
[31]   Identification of Asymptomatic COVID-19 Patients on Chest CT Images Using Transformer-Based or Convolutional Neural Network-Based Deep Learning Models [J].
Yin, Minyue ;
Liang, Xiaolong ;
Wang, Zilan ;
Zhou, Yijia ;
He, Yu ;
Xue, Yuhan ;
Gao, Jingwen ;
Lin, Jiaxi ;
Yu, Chenyan ;
Liu, Lu ;
Liu, Xiaolin ;
Xu, Chao ;
Zhu, Jinzhou .
JOURNAL OF DIGITAL IMAGING, 2023, 36 (03) :827-836
[32]   Employing a Multi-Input Deep Convolutional Neural Network to Derive Soil Clay Content from a Synergy of Multi-Temporal Optical and Radar Imagery Data [J].
Tziolas, Nikolaos ;
Tsakiridis, Nikolaos ;
Ben-Dor, Eyal ;
Theocharis, John ;
Zalidis, George .
REMOTE SENSING, 2020, 12 (09)
[33]   MSST-Net: A Multi-Scale Adaptive Network for Building Extraction from Remote Sensing Images Based on Swin Transformer [J].
Yuan, Wei ;
Xu, Wenbo .
REMOTE SENSING, 2021, 13 (23)
[34]   Efficient Ensemble via Rotation-Based Self- Supervised Learning Technique and Multi-Input Multi-Output Network [J].
Park, Jaehoon .
IEEE ACCESS, 2024, 12 :36135-36147
[35]   Multi-View Fusion Network-Based Gesture Recognition Using sEMG Data [J].
Li, Gongfa ;
Zou, Cejing ;
Jiang, Guozhang ;
Jiang, Du ;
Yun, Juntong ;
Zhao, Guojun ;
Cheng, Yangwei .
IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2024, 28 (08) :4432-4443
[36]   Gesture recognition from RGB images using convolutional neural network-attention based system [J].
Barbhuiya, Abul Abbas ;
Karsh, Ram Kumar ;
Jain, Rahul .
CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2022, 34 (24)
[37]   End-to-End Asbestos Roof Detection on Orthophotos Using Transformer-Based YOLO Deep Neural Network [J].
Pace, Cesare Davide ;
Bria, Alessandro ;
Focareta, Mariano ;
Lozupone, Gabriele ;
Marrocco, Claudio ;
Meoli, Giuseppe ;
Molinara, Mario .
IMAGE ANALYSIS AND PROCESSING, ICIAP 2023, PT I, 2023, 14233 :232-244
[38]   Development of a transformer-based deep learning algorithm for diabetic peripheral neuropathy classification using corneal confocal microscopy images [J].
Chen, Wenqu ;
Liao, Danling ;
Deng, Yuyang ;
Hu, Jianzhang .
FRONTIERS IN CELL AND DEVELOPMENTAL BIOLOGY, 2024, 12
[39]   Dynamic Hand Gesture Recognition Using Effective Feature Extraction and Attention Based Deep Neural Network [J].
Miah, Abu Saleh Musa ;
Shin, Jungpil ;
Hasan, Md. Al Mehedi ;
Okuyama, Yuichi ;
Nobuyoshi, Asai .
2023 IEEE 16TH INTERNATIONAL SYMPOSIUM ON EMBEDDED MULTICORE/MANY-CORE SYSTEMS-ON-CHIP, MCSOC, 2023, :241-247
[40]   A Deep Learning Approach for Crop Disease and Pest Classification Using Swin Transformer and Dual-Attention Multi-Scale Fusion Network [J].
Karthik, R. ;
Ajay, Armaano ;
Singh Bisht, Akshaj ;
Illakiya, T. ;
Suganthi, K. .
IEEE ACCESS, 2024, 12 :152639-152655