Semaphore Recognition Using Deep Learning

被引:1
作者
Huan, Yan [1 ]
Yan, Weiqi [1 ]
机构
[1] Auckland Univ Technol, Dept Comp Sci, Auckland 1010, New Zealand
关键词
YOLO11; semaphore recognition; convolutional neural network (CNN); deep learning; MediaPipe; feature extraction; data enhancement; pre-training model;
D O I
10.3390/electronics14020286
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This study explored the application of deep learning models for signal flag recognition, comparing YOLO11 with basic CNN, ResNet18, and DenseNet121. Experimental results demonstrated that YOLO11 outperformed the other models, achieving superior performance across all common evaluation metrics. The confusion matrix further confirmed that YOLO11 exhibited the highest classification accuracy among the tested models. Moreover, by integrating MediaPipe's human posture data with image data to create multimodal inputs for training, it was observed that the posture data significantly enhanced the model's performance. Leveraging MediaPipe's posture data for annotation generation and model training enabled YOLO11 to achieve an impressive 99% accuracy on the test set. This study highlights the effectiveness of YOLO11 for flag signal recognition tasks. Furthermore, it demonstrates that when handling tasks involving human posture, MediaPipe not only enhances model performance through posture feature data but also facilitates data processing and contributes to validating prediction results in subsequent stages.
引用
收藏
页数:19
相关论文
共 50 条
[41]   A Survey of Biometric Recognition Using Deep Learning [J].
Mehraj H. ;
Mir A.H. .
EAI Endorsed Transactions on Energy Web, 2021, 8 (33) :1-16
[42]   Recognition of driver distractions using deep learning [J].
Valeriano, Leonel Cuevas ;
Napoletano, Paolo ;
Schettini, Raimondo .
2018 IEEE 8TH INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS - BERLIN (ICCE-BERLIN), 2018,
[43]   Fake Speech Recognition Using Deep Learning [J].
Camacho, Steven ;
Maria Ballesteros, Dora ;
Renza, Diego .
APPLIED COMPUTER SCIENCES IN ENGINEERING, WEA 2021, 2021, 1431 :38-48
[44]   Food State Recognition Using Deep Learning [J].
Alahmari, Saeed. S. S. ;
Salem, Tawfiq .
IEEE ACCESS, 2022, 10 :130048-130057
[45]   Overview of currency recognition using deep learning [J].
Qian Zhang ;
Wei Qi Yan ;
Mohan Kankanhalli .
Journal of Banking and Financial Technology, 2019, 3 (1) :59-69
[46]   Handwritten Text Recognition using Deep Learning [J].
Nikitha, A. ;
Geetha, J. ;
JayaLakshmi, D. S. .
2020 5TH IEEE INTERNATIONAL CONFERENCE ON RECENT TRENDS ON ELECTRONICS, INFORMATION, COMMUNICATION & TECHNOLOGY (RTEICT-2020), 2020, :388-392
[47]   Jordanian Currency Recognition Using Deep Learning [J].
Alghyaline, Salah .
INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2025, 16 (01) :397-404
[48]   License plate segmentation and recognition system using deep learning and OpenVINO [J].
Castro-Zunti, Riel D. ;
Yepez, Juan ;
Ko, Seok-Bum .
IET INTELLIGENT TRANSPORT SYSTEMS, 2020, 14 (02) :119-126
[49]   Comparison Study of Traffic Signs Recognition Using Deep Learning Architectures [J].
Alawaji, Khaldaa ;
Hedjar, Ramdane .
2022 13TH INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION SYSTEMS (ICICS), 2022, :442-447
[50]   Data Augmentation and Intelligent Recognition in Pavement Texture Using a Deep Learning [J].
Chen, Ning ;
Xu, Zijin ;
Liu, Zhuo ;
Chen, Yihan ;
Miao, Yinghao ;
Li, Qiuhan ;
Hou, Yue ;
Wang, Linbing .
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (12) :25427-25436