Analysis and design framework for the development of indoor scene understanding assistive solutions for the person with visual impairment/blindness

被引:2
作者
Valipoor, Moeen [1 ]
de Antonio, Angelica [1 ]
Cabrera, Julian [2 ]
机构
[1] Univ Politecn Madrid, Madrid HCI Lab, ETS Ingn Informat, Madrid, Spain
[2] Univ Politecn Madrid, Informat Proc & Telecommun Ctr, Grp Tratamiento Imagenes, ETSI Telecomunicac, Madrid, Spain
关键词
Assistive technology; Blindness; Visual impairment; Scene understanding; BLIND PEOPLE; OBJECT DETECTION; NAVIGATION; GUIDE;
D O I
10.1007/s00530-024-01350-8
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper discusses the challenges of the current state of computer vision-based indoor scene understanding assistive solutions for the person with visual impairment (P-VI)/blindness. It focuses on two main issues: the lack of user-centered approach in the development process and the lack of guidelines for the selection of appropriate technologies. First, it discusses the needs of users of an assistive solution through state-of-the-art analysis based on a previous systematic review of literature and commercial products and on semi-structured user interviews. Then it proposes an analysis and design framework to address these needs. Our paper presents a set of structured use cases that help to visualize and categorize the diverse real-world challenges faced by the P-VI/blindness in indoor settings, including scene description, object finding, color detection, obstacle avoidance and text reading across different contexts. Next, it details the functional and non-functional requirements to be fulfilled by indoor scene understanding assistive solutions and provides a reference architecture that helps to map the needs into solutions, identifying the components that are necessary to cover the different use cases and respond to the requirements. To further guide the development of the architecture components, the paper offers insights into various available technologies like depth cameras, object detection, segmentation algorithms and optical character recognition (OCR), to enable an informed selection of the most suitable technologies for the development of specific assistive solutions, based on aspects like effectiveness, price and computational cost. In conclusion, by systematically analyzing user needs and providing guidelines for technology selection, this research contributes to the development of more personalized and practical assistive solutions tailored to the unique challenges faced by the P-VI/blindness.
引用
收藏
页数:28
相关论文
共 113 条
[1]  
Abraham Leo, 2020, 2020 4th International Conference on Trends in Electronics and Informatics (ICOEI). Proceedings, P972, DOI 10.1109/ICOEI48184.2020.9142984
[2]   A novel algorithm for distance measurement using stereo camera [J].
Adil, Elmehdi ;
Mikou, Mohammed ;
Mouhsen, Ahmed .
CAAI TRANSACTIONS ON INTELLIGENCE TECHNOLOGY, 2022, 7 (02) :177-186
[3]  
Aira, 2023, About us
[4]  
Akter Taslima, 2020, CSCW '20: 23rd Conference on Computer-Supported Cooperative Work and Social Computing, P69, DOI 10.1145/3406865.3418382
[5]  
Alamri Abdullah, 2023, 2023 9th International Conference on Engineering, Applied Sciences, and Technology (ICEAST), P38, DOI 10.1109/ICEAST58324.2023.10157934
[6]  
Allam Mahmoud, 2022, Digital Transformation Technology: Proceedings of ITAF 2020. Lecture Notes in Networks and Systems (224), P195, DOI 10.1007/978-981-16-2275-5_12
[7]  
[Anonymous], 2024, Detectron2 Model Zoo and Baselines
[8]  
[Anonymous], 2022, Unsupervised monocular depth estimation in highly complex environments
[9]  
[Anonymous], 2024, Realsense L515
[10]  
[Anonymous], 2007, Apple unveils ARKit 2