Semaphore Recognition Using Deep Learning

被引:0
作者
Huan, Yan [1 ]
Yan, Weiqi [1 ]
机构
[1] Auckland Univ Technol, Dept Comp Sci, Auckland 1010, New Zealand
来源
ELECTRONICS | 2025年 / 14卷 / 02期
关键词
YOLO11; semaphore recognition; convolutional neural network (CNN); deep learning; MediaPipe; feature extraction; data enhancement; pre-training model;
D O I
10.3390/electronics14020286
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This study explored the application of deep learning models for signal flag recognition, comparing YOLO11 with basic CNN, ResNet18, and DenseNet121. Experimental results demonstrated that YOLO11 outperformed the other models, achieving superior performance across all common evaluation metrics. The confusion matrix further confirmed that YOLO11 exhibited the highest classification accuracy among the tested models. Moreover, by integrating MediaPipe's human posture data with image data to create multimodal inputs for training, it was observed that the posture data significantly enhanced the model's performance. Leveraging MediaPipe's posture data for annotation generation and model training enabled YOLO11 to achieve an impressive 99% accuracy on the test set. This study highlights the effectiveness of YOLO11 for flag signal recognition tasks. Furthermore, it demonstrates that when handling tasks involving human posture, MediaPipe not only enhances model performance through posture feature data but also facilitates data processing and contributes to validating prediction results in subsequent stages.
引用
收藏
页数:19
相关论文
共 50 条
[31]   Speech Emotion Recognition Using Deep Learning [J].
Alagusundari, N. ;
Anuradha, R. .
ARTIFICIAL INTELLIGENCE: THEORY AND APPLICATIONS, VOL 1, AITA 2023, 2024, 843 :313-325
[32]   Human Activity Recognition using Deep Learning [J].
Moola, Ramu ;
Hossain, Ashraf .
2022 URSI REGIONAL CONFERENCE ON RADIO SCIENCE, USRI-RCRS, 2022, :165-168
[33]   Fake Banknote Recognition Using Deep Learning [J].
Pachon, Cesar G. ;
Ballesteros, Dora M. ;
Renza, Diego .
APPLIED SCIENCES-BASEL, 2021, 11 (03) :1-20
[34]   Speech Command Recognition Using Deep Learning [J].
Ayache, Mohammad ;
Kanaan, Hussien ;
Kassir, Kawthar ;
Kassir, Yasser .
2021 SIXTH INTERNATIONAL CONFERENCE ON ADVANCES IN BIOMEDICAL ENGINEERING (ICABME), 2021, :24-29
[35]   Fake Speech Recognition Using Deep Learning [J].
Camacho, Steven ;
Maria Ballesteros, Dora ;
Renza, Diego .
APPLIED COMPUTER SCIENCES IN ENGINEERING, WEA 2021, 2021, 1431 :38-48
[36]   Recognition of driver distractions using deep learning [J].
Valeriano, Leonel Cuevas ;
Napoletano, Paolo ;
Schettini, Raimondo .
2018 IEEE 8TH INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS - BERLIN (ICCE-BERLIN), 2018,
[37]   A Survey of Biometric Recognition Using Deep Learning [J].
Mehraj H. ;
Mir A.H. .
EAI Endorsed Transactions on Energy Web, 2021, 8 (33) :1-16
[38]   Detection and Recognition of Badgers Using Deep Learning [J].
Okafor, Emmanuel ;
Berendsen, Gerard ;
Schomaker, Lambert ;
Wiering, Marco .
ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2018, PT III, 2018, 11141 :554-563
[39]   Speech Emotion Recognition Using Deep Learning [J].
Ahmed, Waqar ;
Riaz, Sana ;
Iftikhar, Khunsa ;
Konur, Savas .
ARTIFICIAL INTELLIGENCE XL, AI 2023, 2023, 14381 :191-197
[40]   Hand Gesture Recognition Using Deep Learning [J].
Hussain, Soeb ;
Saxena, Rupal ;
Han, Xie ;
Khan, Jameel Ahmed ;
Shin, Hyunchul .
PROCEEDINGS INTERNATIONAL SOC DESIGN CONFERENCE 2017 (ISOCC 2017), 2017, :48-49