Semaphore Recognition Using Deep Learning

被引:1
作者
Huan, Yan [1 ]
Yan, Weiqi [1 ]
机构
[1] Auckland Univ Technol, Dept Comp Sci, Auckland 1010, New Zealand
关键词
YOLO11; semaphore recognition; convolutional neural network (CNN); deep learning; MediaPipe; feature extraction; data enhancement; pre-training model;
D O I
10.3390/electronics14020286
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This study explored the application of deep learning models for signal flag recognition, comparing YOLO11 with basic CNN, ResNet18, and DenseNet121. Experimental results demonstrated that YOLO11 outperformed the other models, achieving superior performance across all common evaluation metrics. The confusion matrix further confirmed that YOLO11 exhibited the highest classification accuracy among the tested models. Moreover, by integrating MediaPipe's human posture data with image data to create multimodal inputs for training, it was observed that the posture data significantly enhanced the model's performance. Leveraging MediaPipe's posture data for annotation generation and model training enabled YOLO11 to achieve an impressive 99% accuracy on the test set. This study highlights the effectiveness of YOLO11 for flag signal recognition tasks. Furthermore, it demonstrates that when handling tasks involving human posture, MediaPipe not only enhances model performance through posture feature data but also facilitates data processing and contributes to validating prediction results in subsequent stages.
引用
收藏
页数:19
相关论文
共 50 条
[1]   EMOTION RECOGNITION USING DEEP LEARNING [J].
Priya, R. N. Beena ;
Hanmandlu, M. ;
Vasikarla, Shantaram .
2021 IEEE APPLIED IMAGERY PATTERN RECOGNITION WORKSHOP (AIPR), 2021,
[2]   Sign Language Recognition Using Deep Learning [J].
Ray, Anushka ;
Syed, Shahbaz ;
Poornima, S. ;
Pushpalatha, M. .
JOURNAL OF PHARMACEUTICAL NEGATIVE RESULTS, 2022, 13 :421-428
[3]   Arabic Handwritten Recognition Using Deep Learning: A Survey [J].
Alrobah, Naseem ;
Albahli, Saleh .
ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING, 2022, 47 (08) :9943-9963
[4]   Arabic Handwritten Recognition Using Deep Learning: A Survey [J].
Naseem Alrobah ;
Saleh Albahli .
Arabian Journal for Science and Engineering, 2022, 47 :9943-9963
[5]   Blind Interleaver Recognition Using Deep Learning Techniques [J].
Ahamed, Nayim ;
Swaminathan, R. ;
Naveen, B. .
IEEE ACCESS, 2024, 12 :158714-158730
[6]   Durian Types Recognition Using Deep Learning Techniques [J].
Lim, Marcus Guozong ;
Chuah, Joon Huang .
2018 9TH IEEE CONTROL AND SYSTEM GRADUATE RESEARCH COLLOQUIUM (ICSGRC2018), 2018, :183-187
[7]   Speech Recognition using Deep Learning [J].
Lakkhanawannakun, Phoemporn ;
Noyunsan, Chaluemwut .
2019 34TH INTERNATIONAL TECHNICAL CONFERENCE ON CIRCUITS/SYSTEMS, COMPUTERS AND COMMUNICATIONS (ITC-CSCC 2019), 2019, :514-517
[8]   Sketch recognition using deep learning [J].
Zhao P. ;
Wang F. ;
Liu H. ;
Yao S. .
Sichuan Daxue Xuebao (Gongcheng Kexue Ban)/Journal of Sichuan University (Engineering Science Edition), 2016, 48 (03) :94-99
[9]   Deep Learning for Vein Biometric Recognition on a Smartphone [J].
Garcia-Martin, Raul ;
Sanchez-Reillo, Raul .
IEEE ACCESS, 2021, 9 :98812-98832
[10]   Recognition of cursive video text using a deep learning framework [J].
Mirza, Ali ;
Siddiqi, Imran .
IET IMAGE PROCESSING, 2020, 14 (14) :3444-3455