Semantic scene understanding on mobile device with illumination invariance for the visually impaired

被引:0
|
作者
Xu, Chengyou [1 ]
Wang, Kaiwei [1 ]
Yang, Kailun [1 ]
Cheng, Ruiqi [1 ]
Bai, Jian [1 ]
机构
[1] Zhejiang Univ, State Key Lab Modern Opt Instrumentat, Hangzhou 310027, Peoples R China
来源
ARTIFICIAL INTELLIGENCE AND MACHINE LEARNING IN DEFENSE APPLICATIONS | 2019年 / 11169卷
关键词
Semantic segmentation; robustness; illumination invariance; scene understanding; mobile device;
D O I
10.1117/12.2532550
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
For Visually Impaired People (VIP), it's very difficult to perceive their surroundings. To address this problem, we propose a scene understanding system to aid VIP in indoor and outdoor environments. Semantic segmentation performance is generally sensitive to the environment and illumination changes, including the change between indoor and outdoor environments and the change across different weather conditions. Meanwhile, most existing methods have paid more attention on either the accuracy or the efficiency, instead of the balance between both of them. In the proposed system, the training dataset is preprocessed by using an illumination-invariant transformation to weaken the impact of illumination changes and improve the robustness of the semantic segmentation network. Regarding the structure of semantic segmentation network, the lightweight networks such as MobileNetV2 and ShuffleNet V2 are employed as the backbone of DeepLabv3+ to improve the accuracy with little increasing of computation, which is suitable for mobile assistance device. We evaluate the robustness of the segmentation model across different environments on the Gardens Point Walking dataset, and demonstrate the extremely positive effect of the illumination-invariant pre-transformation in challenging real-world domain. The network trained on computer achieves a relatively high accuracy on ADE20K relabeled into 20 classes. The frame rate of the proposed system is up to 83 FPS on a 1080Ti GPU.
引用
收藏
页数:9
相关论文
共 50 条
  • [41] Cost-Efficient Image Semantic Segmentation for Indoor Scene Understanding Using Weakly Supervised Learning and BIM
    Yang, Liu
    Cai, Hubo
    JOURNAL OF COMPUTING IN CIVIL ENGINEERING, 2023, 37 (02)
  • [42] Towards a point cloud understanding framework for forest scene semantic segmentation across forest types and sensor platforms
    Lu, Hao
    Li, Bowen
    Yang, Gang
    Fan, Guangpeng
    Wang, Han
    Pang, Yong
    Wang, Zuyuan
    Lian, Yining
    Xu, Haifeng
    Huang, Huan
    REMOTE SENSING OF ENVIRONMENT, 2025, 318
  • [43] Dual hierarchical attention-enhanced transfer learning for semantic segmentation of point clouds in building scene understanding
    Zhang, Limao
    Wei, Zeyang
    Xiao, Zhonghua
    Ji, Ankang
    Wu, Beibei
    AUTOMATION IN CONSTRUCTION, 2024, 168
  • [44] The unified theory of acceptance and use of technology 2: understanding mobile device use at festivals
    Van Winkle, Christine M.
    Bueddefeld, Jill N. H.
    Halpenny, Elizabeth A.
    MacKay, Kelly J.
    LEISURE STUDIES, 2019, 38 (05) : 634 - 650
  • [45] Perceiving like a Bat: Hierarchical 3D Geometric-Semantic Scene Understanding Inspired by a Biomimetic Mechanism
    Zhang, Chi
    Yang, Zhong
    Xue, Bayang
    Zhuo, Haoze
    Liao, Luwei
    Yang, Xin
    Zhu, Zekun
    BIOMIMETICS, 2023, 8 (05)
  • [46] [DEMO] Mobile Augmented Reality-3D Object Selection and Reconstruction with an RGBD Sensor and Scene Understanding
    Wagner, Daniel
    Reitmayr, Gerhard
    Mulloni, Alessandro
    Mendez, Erick
    Diaz, Serafin
    2014 IEEE INTERNATIONAL SYMPOSIUM ON MIXED AND AUGMENTED REALITY (ISMAR) - SCIENCE AND TECHNOLOGY, 2014, : 381 - 381
  • [47] Design and implementation for semantic information retrieval through convergence of ontology and user context based on mobile device
    Gu M.S.
    Hwang J.
    Mun H.-J.
    Personal and Ubiquitous Computing, 2023, 27 (03) : 1123 - 1138
  • [48] CEKD: Cross-Modal Edge-Privileged Knowledge Distillation for Semantic Scene Understanding Using Only Thermal Images
    Feng, Zhen
    Guo, Yanning
    Sun, Yuxiang
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2023, 8 (04) : 2205 - 2212
  • [49] RGB-DI Images and Full Convolution Neural Network-Based Outdoor Scene Understanding for Mobile Robots
    Qiu, Zengshuai
    Zhuang, Yan
    Yan, Fei
    Hu, Huosheng
    Wang, Wei
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2019, 68 (01) : 27 - 37
  • [50] Towards 3D LiDAR-based semantic scene understanding of 3D point cloud sequences: The SemanticKITTI Dataset
    Behley, Jens
    Garbade, Martin
    Milioto, Andres
    Quenzel, Jan
    Behnke, Sven
    Gall, Juergen
    Stachniss, Cyrill
    INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH, 2021, 40 (8-9): : 959 - 967