FuseSeg: Semantic Segmentation of Urban Scenes Based on RGB and Thermal Data Fusion

被引:130
|
作者
Sun, Yuxiang [1 ]
Zuo, Weixun [1 ]
Yun, Peng [2 ]
Wang, Hengli [1 ]
Liu, Ming [1 ]
机构
[1] Hong Kong Univ Sci & Technol, Dept Elect & Comp Engn, Hong Kong, Peoples R China
[2] Hong Kong Univ Sci & Technol, Dept Comp Sci & Engn, Hong Kong, Peoples R China
基金
中国国家自然科学基金;
关键词
Semantics; Image segmentation; Cameras; Lighting; Laser radar; Data integration; Autonomous driving; information fusion; semantic segmentation; thermal images; urban scenes; DYNAMIC ENVIRONMENTS; MOTION REMOVAL; D SLAM; POINT; NETWORK;
D O I
10.1109/TASE.2020.2993143
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Semantic segmentation of urban scenes is an essential component in various applications of autonomous driving. It makes great progress with the rise of deep learning technologies. Most of the current semantic segmentation networks use single-modal sensory data, which are usually the RGB images produced by visible cameras. However, the segmentation performance of these networks is prone to be degraded when lighting conditions are not satisfied, such as dim light or darkness. We find that thermal images produced by thermal imaging cameras are robust to challenging lighting conditions. Therefore, in this article, we propose a novel RGB and thermal data fusion network named FuseSeg to achieve superior performance of semantic segmentation in urban scenes. The experimental results demonstrate that our network outperforms the state-of-the-art networks. Note to Practitioners-This article investigates the problem of semantic segmentation of urban scenes when lighting conditions are not satisfied. We provide a solution to this problem via information fusion with RGB and thermal data. We build an end-to-end deep neural network, which takes as input a pair of RGB and thermal images and outputs pixel-wise semantic labels. Our network could be used for urban scene understanding, which serves as a fundamental component of many autonomous driving tasks, such as environment modeling, obstacle avoidance, motion prediction, and planning. Moreover, the simple design of our network allows it to be easily implemented using various deep learning frameworks, which facilitates the applications on different hardware or software platforms.
引用
收藏
页码:1000 / 1011
页数:12
相关论文
共 50 条
  • [1] RTFNet: RGB-Thermal Fusion Network for Semantic Segmentation of Urban Scenes
    Sun, Yuxiang
    Zuo, Weixun
    Liu, Ming
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2019, 4 (03): : 2576 - 2583
  • [2] Light Transport Induced Domain Adaptation for Semantic Segmentation in Thermal Infrared Urban Scenes
    Chen, Junzhang
    Liu, Zichao
    Jin, Darui
    Wang, Yuanyuan
    Yang, Fan
    Bai, Xiangzhi
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (12) : 23194 - 23211
  • [3] Multispectral Fusion Transformer Network for RGB-Thermal Urban Scene Semantic Segmentation
    Zhou, Heng
    Tian, Chunna
    Zhang, Zhenxi
    Huo, Qizheng
    Xie, Yongqiang
    Li, Zhongbo
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19
  • [4] Robust semantic segmentation based on RGB-thermal in variable lighting scenes
    Guo, Zhifeng
    Li, Xu
    Xu, Qimin
    Sun, Zhengliang
    MEASUREMENT, 2021, 186
  • [5] IAFFNet: Illumination-Aware Feature Fusion Network for All-Day RGB-Thermal Semantic Segmentation of Road Scenes
    Hou, Ya-Li
    Jia, Yan
    Hou, Zhijiang
    Hao, Xiaoli
    Shen, Yan
    IEEE ACCESS, 2022, 10 : 129702 - 129711
  • [6] UrbanLF: A Comprehensive Light Field Dataset for Semantic Segmentation of Urban Scenes
    Sheng, Hao
    Cong, Ruixuan
    Yang, Da
    Chen, Rongshan
    Wang, Sizhe
    Cui, Zhenglong
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (11) : 7880 - 7893
  • [7] A Curriculum Domain Adaptation Approach to the Semantic Segmentation of Urban Scenes
    Zhang, Yang
    David, Philip
    Foroosh, Hassan
    Gong, Boqing
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2020, 42 (08) : 1823 - 1841
  • [8] Contrastive learning-based knowledge distillation for RGB-thermal urban scene semantic segmentation
    Guo, Xiaodong
    Zhou, Wujie
    Liu, Tong
    KNOWLEDGE-BASED SYSTEMS, 2024, 292
  • [9] Residual spatial fusion network for RGB-thermal semantic segmentation
    Li, Ping
    Chen, Junjie
    Lin, Binbin
    Xu, Xianghua
    NEUROCOMPUTING, 2024, 595
  • [10] CGFNet: cross-guided fusion network for RGB-thermal semantic segmentation CGI PaperID: 105
    Fu, Yanping
    Chen, Qiaoqiao
    Zhao, Haifeng
    VISUAL COMPUTER, 2022, 38 (9-10) : 3243 - 3252