Semantic scene understanding on mobile device with illumination invariance for the visually impaired

被引：0

作者：

Xu, Chengyou ^{[1
]}

Wang, Kaiwei ^{[1
]}

Yang, Kailun ^{[1
]}

Cheng, Ruiqi ^{[1
]}

Bai, Jian ^{[1
]}

机构：

[1] Zhejiang Univ, State Key Lab Modern Opt Instrumentat, Hangzhou 310027, Peoples R China

来源：

ARTIFICIAL INTELLIGENCE AND MACHINE LEARNING IN DEFENSE APPLICATIONS | 2019年 / 11169卷

关键词：

Semantic segmentation; robustness; illumination invariance; scene understanding; mobile device;

D O I：

10.1117/12.2532550

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

For Visually Impaired People (VIP), it's very difficult to perceive their surroundings. To address this problem, we propose a scene understanding system to aid VIP in indoor and outdoor environments. Semantic segmentation performance is generally sensitive to the environment and illumination changes, including the change between indoor and outdoor environments and the change across different weather conditions. Meanwhile, most existing methods have paid more attention on either the accuracy or the efficiency, instead of the balance between both of them. In the proposed system, the training dataset is preprocessed by using an illumination-invariant transformation to weaken the impact of illumination changes and improve the robustness of the semantic segmentation network. Regarding the structure of semantic segmentation network, the lightweight networks such as MobileNetV2 and ShuffleNet V2 are employed as the backbone of DeepLabv3+ to improve the accuracy with little increasing of computation, which is suitable for mobile assistance device. We evaluate the robustness of the segmentation model across different environments on the Gardens Point Walking dataset, and demonstrate the extremely positive effect of the illumination-invariant pre-transformation in challenging real-world domain. The network trained on computer achieves a relatively high accuracy on ADE20K relabeled into 20 classes. The frame rate of the proposed system is up to 83 FPS on a 1080Ti GPU.

引用

页数：9

共 50 条

[1] Detection of Exercise and Cooking Scene for Assitance of Visually Impaired People
Bhatlawande, Shripad
Shilaskar, Swati
Abhyankar, Anant
Ahire, Mahesh
Chadgal, Ankush
Madake, Jyoti
PERVASIVE COMPUTING AND SOCIAL NETWORKING, ICPCSN 2022, 2023, 475 : 493 - 508
[2] Rainy Night Scene Understanding With Near Scene Semantic Adaptation
Di, Shuai
Feng, Qi
Li, Chun-Guang
Zhang, Mei
Zhang, Honggang
Elezovikj, Semir
Tan, Chiu C.
Ling, Haibin
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2021, 22 (03) : 1594 - 1602
[3] Deep semantic segmentation for visual scene understanding of soil types
Zamani, Vahid
Taghaddos, Hosein
Gholipour, Yaghob
Pourreza, Hamidreza
AUTOMATION IN CONSTRUCTION, 2022, 140
[4] A Wearable Navigation Device for Visually Impaired People Based on the Real-Time Semantic Visual SLAM System
Chen, Zhuo
Liu, Xiaoming
Kojima, Masaru
Huang, Qiang
Arai, Tatsuo
SENSORS, 2021, 21 (04) : 1 - 14
[5] Indoor Scene Understanding with Geometric and Semantic Contexts
Choi, Wongun
Chao, Yu-Wei
Pantofaru, Caroline
Savarese, Silvio
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2015, 112 (02) : 204 - 220
[6] Semantic Foggy Scene Understanding with Synthetic Data
Sakaridis, Christos
Dai, Dengxin
Van Gool, Luc
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2018, 126 (09) : 973 - 992
[7] Semantic Foggy Scene Understanding with Synthetic Data
Christos Sakaridis
Dengxin Dai
Luc Van Gool
International Journal of Computer Vision, 2018, 126 : 973 - 992
[8] Indoor Scene Understanding with Geometric and Semantic Contexts
Wongun Choi
Yu-Wei Chao
Caroline Pantofaru
Silvio Savarese
International Journal of Computer Vision, 2015, 112 : 204 - 220
[9] Improving Semantic Scene Understanding Using Prior Information
Laddha, Ankit
Hebert, Martial
UNMANNED SYSTEMS TECHNOLOGY XVIII, 2016, 9837
[10] Deep Multispectral Semantic Scene Understanding of Forested Environments Using Multimodal Fusion
Valada, Abhinav
Oliveira, Gabriel L.
Brox, Thomas
Burgard, Wolfram
2016 INTERNATIONAL SYMPOSIUM ON EXPERIMENTAL ROBOTICS, 2017, 1 : 465 - 477

← 1 2 3 4 5 →