FuseSeg: Semantic Segmentation of Urban Scenes Based on RGB and Thermal Data Fusion

被引:130
|
作者
Sun, Yuxiang [1 ]
Zuo, Weixun [1 ]
Yun, Peng [2 ]
Wang, Hengli [1 ]
Liu, Ming [1 ]
机构
[1] Hong Kong Univ Sci & Technol, Dept Elect & Comp Engn, Hong Kong, Peoples R China
[2] Hong Kong Univ Sci & Technol, Dept Comp Sci & Engn, Hong Kong, Peoples R China
基金
中国国家自然科学基金;
关键词
Semantics; Image segmentation; Cameras; Lighting; Laser radar; Data integration; Autonomous driving; information fusion; semantic segmentation; thermal images; urban scenes; DYNAMIC ENVIRONMENTS; MOTION REMOVAL; D SLAM; POINT; NETWORK;
D O I
10.1109/TASE.2020.2993143
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Semantic segmentation of urban scenes is an essential component in various applications of autonomous driving. It makes great progress with the rise of deep learning technologies. Most of the current semantic segmentation networks use single-modal sensory data, which are usually the RGB images produced by visible cameras. However, the segmentation performance of these networks is prone to be degraded when lighting conditions are not satisfied, such as dim light or darkness. We find that thermal images produced by thermal imaging cameras are robust to challenging lighting conditions. Therefore, in this article, we propose a novel RGB and thermal data fusion network named FuseSeg to achieve superior performance of semantic segmentation in urban scenes. The experimental results demonstrate that our network outperforms the state-of-the-art networks. Note to Practitioners-This article investigates the problem of semantic segmentation of urban scenes when lighting conditions are not satisfied. We provide a solution to this problem via information fusion with RGB and thermal data. We build an end-to-end deep neural network, which takes as input a pair of RGB and thermal images and outputs pixel-wise semantic labels. Our network could be used for urban scene understanding, which serves as a fundamental component of many autonomous driving tasks, such as environment modeling, obstacle avoidance, motion prediction, and planning. Moreover, the simple design of our network allows it to be easily implemented using various deep learning frameworks, which facilitates the applications on different hardware or software platforms.
引用
收藏
页码:1000 / 1011
页数:12
相关论文
共 50 条
  • [21] TSTR: A Real-Time RGB-Thermal Semantic Segmentation Model with Multimodal Fusion Transformers
    Zhao, Guogiang
    Yan, Xiaoyun
    Cui, Aodie
    Hu, Chang
    Bao, Jiaqi
    Huang, Junjie
    2023 19TH INTERNATIONAL CONFERENCE ON MOBILITY, SENSING AND NETWORKING, MSN 2023, 2023, : 588 - 595
  • [22] Real-Time Flame Segmentation based on RGB-Thermal Fusion
    Guo, Shuaihao
    Hu, Biao
    Huang, Ran
    2021 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND BIOMIMETICS (IEEE-ROBIO 2021), 2021, : 1435 - 1440
  • [23] CGFNet: cross-guided fusion network for RGB-thermal semantic segmentation
    Yanping Fu
    Qiaoqiao Chen
    Haifeng Zhao
    The Visual Computer, 2022, 38 : 3243 - 3252
  • [24] MEFNET: Multi-expert fusion network for RGB-Thermal semantic segmentation
    Lai, Wenjie
    Zeng, Fanyu
    Hu, Xiao
    Li, Wei
    He, Shaowei
    Liu, Ziji
    Jiang, Yadong
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 125
  • [25] SFAF-MA: Spatial Feature Aggregation and Fusion With Modality Adaptation for RGB-Thermal Semantic Segmentation
    He, Xunjie
    Wang, Meiling
    Liu, Tong
    Zhao, Lin
    Yue, Yufeng
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2023, 72
  • [26] ScaleNet: Scale Invariant Network for Semantic Segmentation in Urban Driving Scenes
    Ansari, Mohammad Dawud
    Krauss, Stephan
    Wasenmueller, Oliver
    Stricker, Didier
    PROCEEDINGS OF THE 13TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS THEORY AND APPLICATIONS (VISIGRAPP 2018), VOL 5: VISAPP, 2018, : 399 - 404
  • [27] Semantic Segmentation of Urban Scenes with Enhanced Spatial Contexts
    Wang, Jeonghyeon
    Kim, Jinwhan
    2016 13TH INTERNATIONAL CONFERENCE ON UBIQUITOUS ROBOTS AND AMBIENT INTELLIGENCE (URAI), 2016, : 263 - 266
  • [28] Semantic Segmentation of Urban Scenes Using Spatial Contexts
    Wang, Jeonghyeon
    Kim, Jinwhan
    IEEE ACCESS, 2020, 8 : 55254 - 55268
  • [29] AGFNet: Adaptive Gated Fusion Network for RGB-T Semantic Segmentation
    Zhou, Xiaofei
    Wu, Xiaoling
    Bao, Liuxin
    Yin, Haibing
    Jiang, Qiuping
    Zhang, Jiyong
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2025,
  • [30] A Multisensor Data Fusion Model for Semantic Segmentation in Aerial Images
    Weng, Qian
    Chen, Hao
    Chen, Hongli
    Guo, Wenzhong
    Mao, Zhengyuan
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19