Semantic scene understanding on mobile device with illumination invariance for the visually impaired

被引:0
|
作者
Xu, Chengyou [1 ]
Wang, Kaiwei [1 ]
Yang, Kailun [1 ]
Cheng, Ruiqi [1 ]
Bai, Jian [1 ]
机构
[1] Zhejiang Univ, State Key Lab Modern Opt Instrumentat, Hangzhou 310027, Peoples R China
来源
ARTIFICIAL INTELLIGENCE AND MACHINE LEARNING IN DEFENSE APPLICATIONS | 2019年 / 11169卷
关键词
Semantic segmentation; robustness; illumination invariance; scene understanding; mobile device;
D O I
10.1117/12.2532550
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
For Visually Impaired People (VIP), it's very difficult to perceive their surroundings. To address this problem, we propose a scene understanding system to aid VIP in indoor and outdoor environments. Semantic segmentation performance is generally sensitive to the environment and illumination changes, including the change between indoor and outdoor environments and the change across different weather conditions. Meanwhile, most existing methods have paid more attention on either the accuracy or the efficiency, instead of the balance between both of them. In the proposed system, the training dataset is preprocessed by using an illumination-invariant transformation to weaken the impact of illumination changes and improve the robustness of the semantic segmentation network. Regarding the structure of semantic segmentation network, the lightweight networks such as MobileNetV2 and ShuffleNet V2 are employed as the backbone of DeepLabv3+ to improve the accuracy with little increasing of computation, which is suitable for mobile assistance device. We evaluate the robustness of the segmentation model across different environments on the Gardens Point Walking dataset, and demonstrate the extremely positive effect of the illumination-invariant pre-transformation in challenging real-world domain. The network trained on computer achieves a relatively high accuracy on ADE20K relabeled into 20 classes. The frame rate of the proposed system is up to 83 FPS on a 1080Ti GPU.
引用
收藏
页数:9
相关论文
共 50 条
  • [31] Mobile Mapping and Visualization of Indoor Structures to Simplify Scene Understanding and Location Awareness
    Pintore, Giovanni
    Ganovelli, Fabio
    Gobbetti, Enrico
    Scopigno, Roberto
    COMPUTER VISION - ECCV 2016 WORKSHOPS, PT II, 2016, 9914 : 130 - 145
  • [32] NIS-SLAM: Neural Implicit Semantic RGB-D SLAM for 3D Consistent Scene Understanding
    Zhai, Hongjia
    Huang, Gan
    Hu, Qirui
    Li, Guanglin
    Bao, Hujun
    Zhang, Guofeng
    IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2024, 30 (11) : 7129 - 7139
  • [33] Improving RGB-Thermal Semantic Scene Understanding With Synthetic Data Augmentation for Autonomous Driving
    Li, Haotian
    Chu, Henry K.
    Sun, Yuxiang
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2025, 10 (05): : 4452 - 4459
  • [34] TSS-Net: Time-based Semantic Segmentation Neural Network for Road Scene Understanding
    Duong, Tin Trung
    Nguyen, Huy-Hung
    Jeon, Jae Wook
    PROCEEDINGS OF THE 2021 15TH INTERNATIONAL CONFERENCE ON UBIQUITOUS INFORMATION MANAGEMENT AND COMMUNICATION (IMCOM 2021), 2021,
  • [35] GEOMETRIC INVARIANTS CONSTRUCTION FOR SEMANTIC SCENE UNDERSTANDING FROM MULTIPLE VIEWS INSPIRED BY THE HUMAN VISUAL SYSTEM
    Fan, N.
    Jin, Cheng
    INTERNATIONAL JOURNAL OF IMAGE AND GRAPHICS, 2012, 12 (02)
  • [36] Scene Understanding and Semantic Mapping for Unmanned Ground Vehicles Using 3D Point Clouds
    Yan, Fei
    He, Guojian
    Zhuang, Yan
    Chang, Huan
    2018 8TH INTERNATIONAL CONFERENCE ON INFORMATION SCIENCE AND TECHNOLOGY (ICIST 2018), 2018, : 341 - 347
  • [37] Scene understanding using natural language description based on 3D semantic graph map
    Jiyoun Moon
    Beomhee Lee
    Intelligent Service Robotics, 2018, 11 : 347 - 354
  • [38] Enhancing semantic segmentation for autonomous vehicle scene understanding in indian context using modified CANet model
    Khairnar, Smita
    Thepade, Sudeep D.
    Kolekar, Suresh
    Gite, Shilpa
    Pradhan, Biswajeet
    Alamri, Abdullah
    Patil, Bhagyesha
    Dahake, Shrutee
    Gaikwad, Radhika
    Chaudhari, Atharva
    METHODSX, 2025, 14
  • [39] Scene understanding using natural language description based on 3D semantic graph map
    Moon, Jiyoun
    Lee, Beomhee
    INTELLIGENT SERVICE ROBOTICS, 2018, 11 (04) : 347 - 354
  • [40] Multi-source pseudo-label learning of semantic segmentation for the scene recognition of agricultural mobile robots
    Matsuzaki, Shigemichi
    Miura, Jun
    Masuzawa, Hiroaki
    ADVANCED ROBOTICS, 2022, 36 (19) : 1011 - 1029