Semantic scene understanding on mobile device with illumination invariance for the visually impaired

被引:0
|
作者
Xu, Chengyou [1 ]
Wang, Kaiwei [1 ]
Yang, Kailun [1 ]
Cheng, Ruiqi [1 ]
Bai, Jian [1 ]
机构
[1] Zhejiang Univ, State Key Lab Modern Opt Instrumentat, Hangzhou 310027, Peoples R China
来源
ARTIFICIAL INTELLIGENCE AND MACHINE LEARNING IN DEFENSE APPLICATIONS | 2019年 / 11169卷
关键词
Semantic segmentation; robustness; illumination invariance; scene understanding; mobile device;
D O I
10.1117/12.2532550
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
For Visually Impaired People (VIP), it's very difficult to perceive their surroundings. To address this problem, we propose a scene understanding system to aid VIP in indoor and outdoor environments. Semantic segmentation performance is generally sensitive to the environment and illumination changes, including the change between indoor and outdoor environments and the change across different weather conditions. Meanwhile, most existing methods have paid more attention on either the accuracy or the efficiency, instead of the balance between both of them. In the proposed system, the training dataset is preprocessed by using an illumination-invariant transformation to weaken the impact of illumination changes and improve the robustness of the semantic segmentation network. Regarding the structure of semantic segmentation network, the lightweight networks such as MobileNetV2 and ShuffleNet V2 are employed as the backbone of DeepLabv3+ to improve the accuracy with little increasing of computation, which is suitable for mobile assistance device. We evaluate the robustness of the segmentation model across different environments on the Gardens Point Walking dataset, and demonstrate the extremely positive effect of the illumination-invariant pre-transformation in challenging real-world domain. The network trained on computer achieves a relatively high accuracy on ADE20K relabeled into 20 classes. The frame rate of the proposed system is up to 83 FPS on a 1080Ti GPU.
引用
收藏
页数:9
相关论文
共 50 条
  • [21] Semantic-Relation-First Active Learning for Scene Understanding in Indoor Environments
    Gan, Rundong
    Su, Longfei
    Chen, Haotian
    Yuan, Jing
    Liu, Jie
    Sun, Fengchi
    2022 41ST CHINESE CONTROL CONFERENCE (CCC), 2022, : 6964 - 6969
  • [22] Application of Deep Neural Network Structures in Semantic Segmentation for Road Scene Understanding
    Shekar, Meera
    O'Hearn, Meghan
    Knudsen, Ellina
    Shibuya, Kenji
    Bishop, Simon
    van Berchem, Helene
    Egerton-Warburton, Christopher
    Shibata Okamura, Kyoko
    Mozaffarian, Dariush
    OPTICAL MEMORY AND NEURAL NETWORKS, 2023, 32 (02) : 137 - 146
  • [24] Enhanced Scene Understanding and Situation Awareness for Autonomous Vehicles Based on Semantic Segmentation
    Zhao, Yiyue
    Wang, Liang
    Yun, Xinyu
    Chai, Chen
    Liu, Zhiyu
    Fan, Wenxuan
    Luo, Xiao
    Liu, Yang
    Qu, Xiaobo
    IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2024, : 1 - 13
  • [25] An environmental perception and navigational assistance system for visually impaired persons based on semantic stixels and sound interaction
    Wang, Juan
    Yang, Kailun
    Hu, Weijian
    Wang, Kaiwei
    2018 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2018, : 1921 - 1926
  • [26] Unsupervised scene adaptation for semantic segmentation of urban mobile laser scanning point clouds
    Luo, Haifeng
    Khoshelham, Kourosh
    Fang, Lina
    Chen, Chongcheng
    ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2020, 169 (169) : 253 - 267
  • [27] Accessible Museum Collections for the Visually Impaired: Combining Tactile Exploration, Audio Descriptions and Mobile Gestures
    Anagnostakis, Giorgos
    Antoniou, Michalis
    Kardamitsi, Elena
    Sachinidis, Thodoris
    Koutsabasis, Panayiotis
    Stavrakis, Modestos
    Vosinakis, Spyros
    Zissis, Dimitris
    PROCEEDINGS OF THE 18TH INTERNATIONAL CONFERENCE ON HUMAN-COMPUTER INTERACTION WITH MOBILE DEVICES AND SERVICES (MOBILEHCI 2016), 2016, : 1021 - 1025
  • [28] Temporal Consistency for RGB-Thermal Data-Based Semantic Scene Understanding
    Li, Haotian
    Chu, Henry K.
    Sun, Yuxiang
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2024, 9 (11): : 9757 - 9764
  • [29] SCIM: Simultaneous Clustering, Inference, and Mapping for Open-World Semantic Scene Understanding
    Blum, Hermann
    Mueller, Marcus G.
    Gawel, Abel
    Siegwart, Roland
    Cadena, Cesar
    ROBOTICS RESEARCH, ISRR 2022, 2023, 27 : 119 - 135
  • [30] TextSumIt: A Semantic Single Document Summarization Model on Android Mobile Device
    Foong, Oi-Mean
    Lee, Mellissa
    INFORMATION TECHNOLOGY APPLICATIONS IN INDUSTRY, PTS 1-4, 2013, 263-266 : 1902 - 1909