Semaphore Recognition Using Deep Learning

被引:0
作者
Huan, Yan [1 ]
Yan, Weiqi [1 ]
机构
[1] Auckland Univ Technol, Dept Comp Sci, Auckland 1010, New Zealand
来源
ELECTRONICS | 2025年 / 14卷 / 02期
关键词
YOLO11; semaphore recognition; convolutional neural network (CNN); deep learning; MediaPipe; feature extraction; data enhancement; pre-training model;
D O I
10.3390/electronics14020286
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This study explored the application of deep learning models for signal flag recognition, comparing YOLO11 with basic CNN, ResNet18, and DenseNet121. Experimental results demonstrated that YOLO11 outperformed the other models, achieving superior performance across all common evaluation metrics. The confusion matrix further confirmed that YOLO11 exhibited the highest classification accuracy among the tested models. Moreover, by integrating MediaPipe's human posture data with image data to create multimodal inputs for training, it was observed that the posture data significantly enhanced the model's performance. Leveraging MediaPipe's posture data for annotation generation and model training enabled YOLO11 to achieve an impressive 99% accuracy on the test set. This study highlights the effectiveness of YOLO11 for flag signal recognition tasks. Furthermore, it demonstrates that when handling tasks involving human posture, MediaPipe not only enhances model performance through posture feature data but also facilitates data processing and contributes to validating prediction results in subsequent stages.
引用
收藏
页数:19
相关论文
共 50 条
  • [21] Arabic Sign Language Recognition Using Deep Learning Models
    Al-Barham, Muhammad
    Abu Sa'aleek, Ahmad
    Al-Odat, Mohammad
    Hamad, Ghada
    Al-Yaman, Musa
    Elnagar, Ashraf
    2022 13TH INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION SYSTEMS (ICICS), 2022, : 226 - 231
  • [22] Pattern recognition and features selection for speech emotion recognition model using deep learning
    Kittisak Jermsittiparsert
    Abdurrahman Abdurrahman
    Parinya Siriattakul
    Ludmila A. Sundeeva
    Wahidah Hashim
    Robbi Rahim
    Andino Maseleno
    International Journal of Speech Technology, 2020, 23 : 799 - 806
  • [23] Pattern recognition and features selection for speech emotion recognition model using deep learning
    Jermsittiparsert, Kittisak
    Abdurrahman, Abdurrahman
    Siriattakul, Parinya
    Sundeeva, Ludmila A.
    Hashim, Wahidah
    Rahim, Robbi
    Maseleno, Andino
    INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2020, 23 (04) : 799 - 806
  • [24] Fuzzified Image Enhancement for Deep Learning in Iris Recognition
    Liu, Ming
    Zhou, Zhiqian
    Shang, Penghui
    Xu, Dong
    IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2020, 28 (01) : 92 - 99
  • [25] A Survey of Biometric Recognition Using Deep Learning
    Mehraj H.
    Mir A.H.
    EAI Endorsed Transactions on Energy Web, 2021, 8 (33) : 1 - 16
  • [26] Voice Gender Recognition Using Deep Learning
    Buyukyilmaz, Mucahit
    Cibikdiken, Ali Osman
    PROCEEDINGS OF 2016 INTERNATIONAL CONFERENCE ON MODELING, SIMULATION AND OPTIMIZATION TECHNOLOGIES AND APPLICATIONS (MSOTA2016), 2016, 58 : 409 - 411
  • [27] Speech Emotion Recognition Using Deep Learning
    Alagusundari, N.
    Anuradha, R.
    ARTIFICIAL INTELLIGENCE: THEORY AND APPLICATIONS, VOL 1, AITA 2023, 2024, 843 : 313 - 325
  • [28] Biometrics recognition using deep learning: a survey
    Shervin Minaee
    Amirali Abdolrashidi
    Hang Su
    Mohammed Bennamoun
    David Zhang
    Artificial Intelligence Review, 2023, 56 : 8647 - 8695
  • [29] Traffic sign recognition using deep learning
    Patel V.
    Mehta J.
    Iyer S.
    Sharma A.K.
    International Journal of Vehicle Autonomous Systems, 2023, 16 (2-4) : 97 - 107
  • [30] Biometrics recognition using deep learning: a survey
    Minaee, Shervin
    Abdolrashidi, Amirali
    Su, Hang
    Bennamoun, Mohammed
    Zhang, David
    ARTIFICIAL INTELLIGENCE REVIEW, 2023, 56 (08) : 8647 - 8695