Multi-Modal System for Walking Safety for the Visually Impaired: Multi-Object Detection and Natural Language Generation

被引:0
作者
Lee, Jekyung [1 ]
Cha, Kyung-Ae [1 ]
Lee, Miran [2 ]
机构
[1] Daegu Univ, Dept Artificial Intelligence, Gyongsan 38453, South Korea
[2] Daegu Univ, Dept Comp & Informat Engn, Gyongsan 38453, South Korea
来源
APPLIED SCIENCES-BASEL | 2024年 / 14卷 / 17期
基金
新加坡国家研究基金会;
关键词
visually impaired; object detection; YOLOv5; natural language generation; KoAlpaca; walking assistance sentence;
D O I
10.3390/app14177643
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
This study introduces a system for visually impaired individuals in a walking environment. It combines object recognition using YOLOv5 and cautionary sentence generation with KoAlpaca. The system employs image data augmentation for diverse training data and GPT for natural language training. Furthermore, the implementation of the system on a single board was followed by a comprehensive comparative analysis with existing studies. Moreover, a pilot test involving visually impaired and healthy individuals was conducted to validate the system's practical applicability and adaptability in real-world walking environments. Our pilot test results indicated an average usability score of 4.05. Participants expressed some dissatisfaction with the notification conveying time and online implementation, but they highly praised the system's object detection range and accuracy. The experiments demonstrated that using QLoRA enables more efficient training of larger models, which is associated with improved model performance. Our study makes a significant contribution to the literature because the proposed system enables real-time monitoring of various environmental conditions and objects in pedestrian environments using AI.
引用
收藏
页数:20
相关论文
共 49 条
  • [1] [Anonymous], 2022, About us
  • [2] Ayes, 2023, OKO App Leverages AI to Help Blind Pedestrians Recognize Traffic Signals
  • [3] Enhancing perception for the visually impaired with deep learning techniques and low-cost wearable sensors
    Bauer, Zuria
    Dominguez, Alejandro
    Cruz, Edmanuel
    Gomez-Donoso, Francisco
    Orts-Escolano, Sergio
    Cazorla, Miguel
    [J]. PATTERN RECOGNITION LETTERS, 2020, 137 : 27 - 36
  • [4] Be My Eyes, 2023, Introducing Be My AI
  • [5] A review of object detection: Datasets, performance evaluation, architecture, applications and current trends
    Chen, Wei
    Luo, Jinjin
    Zhang, Fan
    Tian, Zijian
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (24) : 65603 - 65661
  • [6] A wearable assistive system for the visually impaired using object detection, distance measurement and tactile presentation
    Chen, Yiwen
    Shen, Junjie
    Sawada, Hideyuki
    [J]. INTELLIGENCE & ROBOTICS, 2023, 3 (03): : 420 - 435
  • [7] Dettmers T, 2023, Arxiv, DOI arXiv:2305.14314
  • [8] Eckert M., 2018, P 11 INT JOINT C BIO, P555, DOI [10.5220/0006655605550561, DOI 10.5220/0006655605550561]
  • [9] A Systematic Review of Urban Navigation Systems for Visually Impaired People
    El-taher, Fatma El-zahraa
    Taha, Ayman
    Courtney, Jane
    Mckeever, Susan
    [J]. SENSORS, 2021, 21 (09)
  • [10] Rich feature hierarchies for accurate object detection and semantic segmentation
    Girshick, Ross
    Donahue, Jeff
    Darrell, Trevor
    Malik, Jitendra
    [J]. 2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, : 580 - 587