Embedded solution to detect and classify head level objects using stereo vision for visually impaired people with audio feedback

被引：0

作者：

Kevin, Munoz ^{[1
]}

Mario, Chavarria ^{[2
]}

Ortiz, Luisa ^{[3
]}

Sutter, Silvan ^{[2
]}

Klaus, Schonenberger ^{[2
]}

Bladimir, Bacca-Cortes ^{[1
]}

机构：

[1] Univ Valle, Sch Elect & Elect Engn, Cali, Colombia

[2] Swiss Fed Inst Technol Lausanne, EssentialTech, Lausanne, Switzerland

[3] Univ Autonoma Occident, Fac Engn & Basic Sci, Cali, Colombia

来源：

SCIENTIFIC REPORTS | 2025年 / 15卷 / 01期

关键词：

Audio feedback; Convolutional neural networks; Embedded systems; Head-level object detection; Visually impaired people; SYSTEM;

D O I：

10.1038/s41598-025-01529-7

中图分类号：

O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];

学科分类号：

07 ; 0710 ; 09 ;

摘要：

This work presents an embedded solution for detecting and classifying head-level objects using stereo vision to assist blind individuals. A custom dataset was created, featuring five classes of head-level objects, selected based on a survey of visually impaired users. Object detection and classification were achieved using deep-neural networks such as YoloV5. The system computes the relative range and orientation of detected head-level objects and provides audio feedback to alert the user about nearby objects. Four types of tests were conducted: a dataset-based test, achieving a mAP@0.95 of 0.89 for head-level objects classification; a quantitative assessment of range and orientation, with an average error of 0.028 m +/- 0.004 and 2.05 degrees +/- 0.09, respectively; a field test conducted over a week at different times and lighting conditions, yielding a precision/recall of 98.21%/93.75% for head-level object classification; and user tests with Head-level identification accuracy of 91% and obstacle-avoidance/local-navigation where users reported an average of 88.75% for low or middle risk.

引用

页数：19

共 33 条

[1]

Achirei S. D., 2021, IEEE 17 INT C INT CO, P409, DOI [10.1109/ICCP53602.2021.9733610, DOI 10.1109/ICCP53602.2021.9733610]

[2] Navigation Assistance for the Visually Impaired Using RGB-D Sensor With Range Expansion [J].

Aladren, A. ;

Lopez-Nicolas, G. ;

Puig, Luis ;

Guerrero, Josechu J. .

IEEE SYSTEMS JOURNAL, 2016, 10 (03) :922-932

[3] CNN-Based Object Recognition and Tracking System to Assist Visually Impaired People [J].

Ashiq, Fahad ;

Asif, Muhammad ;

Bin Ahmad, Maaz ;

Zafar, Sadia ;

Masood, Khalid ;

Mahmood, Toqeer ;

Mahmood, Muhammad Tariq ;

Lee, Ik Hyun .

IEEE ACCESS, 2022, 10 :14819-14834

[4]

Chavarria M.A., 2021, Accessible Technology and the Developing World, P248, DOI [10.1093/oso/9780198846413.003.0013, DOI 10.1093/OSO/9780198846413.003.0013]

[5] A 68-mw 2.2 Tops/w Low Bit Width and Multiplierless DCNN Object Detection Processor for Visually Impaired People [J].

Chen, Xiaobai ;

Xu, Jinglong ;

Yu, Zhiyi .

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2019, 29 (11) :3444-3453

[6]

Daewoong K., 2021, Detectonlampcabinetc & Dataset Roboflow Universe, V1

[7]

developer.nvidia, 2022, NVIDIA & TensorRT NVIDIA Developer

[8]

Durette P. N., 2021, Google Text to Speech

[9]

Everding L, 2016, 2016 IEEE 18TH INTERNATIONAL CONFERENCE ON E-HEALTH NETWORKING, APPLICATIONS AND SERVICES (HEALTHCOM), P228

[10]

Hsueh-Cheng Wang, 2017, 2017 IEEE International Conference on Robotics and Automation (ICRA), P6533, DOI 10.1109/ICRA.2017.7989772

← 1 2 3 4 →