Embedded solution to detect and classify head level objects using stereo vision for visually impaired people with audio feedback

被引:0
作者
Kevin, Munoz [1 ]
Mario, Chavarria [2 ]
Ortiz, Luisa [3 ]
Sutter, Silvan [2 ]
Klaus, Schonenberger [2 ]
Bladimir, Bacca-Cortes [1 ]
机构
[1] Univ Valle, Sch Elect & Elect Engn, Cali, Colombia
[2] Swiss Fed Inst Technol Lausanne, EssentialTech, Lausanne, Switzerland
[3] Univ Autonoma Occident, Fac Engn & Basic Sci, Cali, Colombia
关键词
Audio feedback; Convolutional neural networks; Embedded systems; Head-level object detection; Visually impaired people; SYSTEM;
D O I
10.1038/s41598-025-01529-7
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
This work presents an embedded solution for detecting and classifying head-level objects using stereo vision to assist blind individuals. A custom dataset was created, featuring five classes of head-level objects, selected based on a survey of visually impaired users. Object detection and classification were achieved using deep-neural networks such as YoloV5. The system computes the relative range and orientation of detected head-level objects and provides audio feedback to alert the user about nearby objects. Four types of tests were conducted: a dataset-based test, achieving a mAP@0.95 of 0.89 for head-level objects classification; a quantitative assessment of range and orientation, with an average error of 0.028 m +/- 0.004 and 2.05 degrees +/- 0.09, respectively; a field test conducted over a week at different times and lighting conditions, yielding a precision/recall of 98.21%/93.75% for head-level object classification; and user tests with Head-level identification accuracy of 91% and obstacle-avoidance/local-navigation where users reported an average of 88.75% for low or middle risk.
引用
收藏
页数:19
相关论文
共 33 条
[1]  
Achirei S. D., 2021, IEEE 17 INT C INT CO, P409, DOI [10.1109/ICCP53602.2021.9733610, DOI 10.1109/ICCP53602.2021.9733610]
[2]   Navigation Assistance for the Visually Impaired Using RGB-D Sensor With Range Expansion [J].
Aladren, A. ;
Lopez-Nicolas, G. ;
Puig, Luis ;
Guerrero, Josechu J. .
IEEE SYSTEMS JOURNAL, 2016, 10 (03) :922-932
[3]   CNN-Based Object Recognition and Tracking System to Assist Visually Impaired People [J].
Ashiq, Fahad ;
Asif, Muhammad ;
Bin Ahmad, Maaz ;
Zafar, Sadia ;
Masood, Khalid ;
Mahmood, Toqeer ;
Mahmood, Muhammad Tariq ;
Lee, Ik Hyun .
IEEE ACCESS, 2022, 10 :14819-14834
[4]  
Chavarria M.A., 2021, Accessible Technology and the Developing World, P248, DOI [10.1093/oso/9780198846413.003.0013, DOI 10.1093/OSO/9780198846413.003.0013]
[5]   A 68-mw 2.2 Tops/w Low Bit Width and Multiplierless DCNN Object Detection Processor for Visually Impaired People [J].
Chen, Xiaobai ;
Xu, Jinglong ;
Yu, Zhiyi .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2019, 29 (11) :3444-3453
[6]  
Daewoong K., 2021, Detectonlampcabinetc & Dataset Roboflow Universe, V1
[7]  
developer.nvidia, 2022, NVIDIA & TensorRT NVIDIA Developer
[8]  
Durette P. N., 2021, Google Text to Speech
[9]  
Everding L, 2016, 2016 IEEE 18TH INTERNATIONAL CONFERENCE ON E-HEALTH NETWORKING, APPLICATIONS AND SERVICES (HEALTHCOM), P228
[10]  
Hsueh-Cheng Wang, 2017, 2017 IEEE International Conference on Robotics and Automation (ICRA), P6533, DOI 10.1109/ICRA.2017.7989772