FuseSeg: Semantic Segmentation of Urban Scenes Based on RGB and Thermal Data Fusion

被引:130
|
作者
Sun, Yuxiang [1 ]
Zuo, Weixun [1 ]
Yun, Peng [2 ]
Wang, Hengli [1 ]
Liu, Ming [1 ]
机构
[1] Hong Kong Univ Sci & Technol, Dept Elect & Comp Engn, Hong Kong, Peoples R China
[2] Hong Kong Univ Sci & Technol, Dept Comp Sci & Engn, Hong Kong, Peoples R China
基金
中国国家自然科学基金;
关键词
Semantics; Image segmentation; Cameras; Lighting; Laser radar; Data integration; Autonomous driving; information fusion; semantic segmentation; thermal images; urban scenes; DYNAMIC ENVIRONMENTS; MOTION REMOVAL; D SLAM; POINT; NETWORK;
D O I
10.1109/TASE.2020.2993143
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Semantic segmentation of urban scenes is an essential component in various applications of autonomous driving. It makes great progress with the rise of deep learning technologies. Most of the current semantic segmentation networks use single-modal sensory data, which are usually the RGB images produced by visible cameras. However, the segmentation performance of these networks is prone to be degraded when lighting conditions are not satisfied, such as dim light or darkness. We find that thermal images produced by thermal imaging cameras are robust to challenging lighting conditions. Therefore, in this article, we propose a novel RGB and thermal data fusion network named FuseSeg to achieve superior performance of semantic segmentation in urban scenes. The experimental results demonstrate that our network outperforms the state-of-the-art networks. Note to Practitioners-This article investigates the problem of semantic segmentation of urban scenes when lighting conditions are not satisfied. We provide a solution to this problem via information fusion with RGB and thermal data. We build an end-to-end deep neural network, which takes as input a pair of RGB and thermal images and outputs pixel-wise semantic labels. Our network could be used for urban scene understanding, which serves as a fundamental component of many autonomous driving tasks, such as environment modeling, obstacle avoidance, motion prediction, and planning. Moreover, the simple design of our network allows it to be easily implemented using various deep learning frameworks, which facilitates the applications on different hardware or software platforms.
引用
收藏
页码:1000 / 1011
页数:12
相关论文
共 50 条
  • [41] GMNet: Graded-Feature Multilabel-Learning Network for RGB-Thermal Urban Scene Semantic Segmentation
    Zhou, Wujie
    Liu, Jinfu
    Lei, Jingsheng
    Yu, Lu
    Hwang, Jenq-Neng
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 : 7790 - 7802
  • [42] Data Fusion and Models Integration for Enhanced Semantic Segmentation in Remote Sensing
    Dong, Xiaorui
    Li, Jiansheng
    Chang, Qingfang
    Miao, Shufeng
    Wan, Hongxiang
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2025, 18 : 7134 - 7151
  • [43] MULTI-MODAL SEMANTIC MESH SEGMENTATION IN URBAN SCENES
    Laupheimer, Dominik
    Haala, Norbert
    XXIV ISPRS CONGRESS IMAGING TODAY, FORESEEING TOMORROW, COMMISSION II, 2022, 5-2 : 267 - 274
  • [44] CMX: Cross-Modal Fusion for RGB-X Semantic Segmentation With Transformers
    Zhang, Jiaming
    Liu, Huayao
    Yang, Kailun
    Hu, Xinxin
    Liu, Ruiping
    Stiefelhagen, Rainer
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2023, 24 (12) : 14679 - 14694
  • [45] STN: Saliency-Guided Transformer Network for Point-Wise Semantic Segmentation of Urban Scenes
    Ma, Lingfei
    Li, Jonathan
    Guan, Haiyan
    Yu, Yongtao
    Chen, Yiping
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19
  • [46] Multimodal Frequeny Spectrum Fusion Schema for RGB-T Image Semantic Segmentation
    Liu, Hengyan
    Zhang, Wenzhang
    Dai, Tianhong
    Yin, Longfei
    Ren, Guangyu
    2024 33RD INTERNATIONAL CONFERENCE ON COMPUTER COMMUNICATIONS AND NETWORKS, ICCCN 2024, 2024,
  • [47] Robust semantic segmentation method of urban scenes in snowy environment
    Yin, Hanqi
    Yin, Guisheng
    Sun, Yiming
    Zhang, Liguo
    Tian, Ye
    MACHINE VISION AND APPLICATIONS, 2024, 35 (03)
  • [48] Multi-modal neural networks with multi-scale RGB-T fusion for semantic segmentation
    Lyu, Y.
    Schiopu, I.
    Munteanu, A.
    ELECTRONICS LETTERS, 2020, 56 (18) : 920 - 922
  • [49] Unsupervised Domain Extension for Nighttime Semantic Segmentation in Urban Scenes
    Scherer, Sebastian
    Schoen, Robin
    Ludwig, Katja
    Lienhart, Rainer
    PROCEEDINGS OF THE 2ND INTERNATIONAL CONFERENCE ON DEEP LEARNING THEORY AND APPLICATIONS (DELTA), 2021, : 38 - 47
  • [50] Knowledge Distillation SegFormer-Based Network for RGB-T Semantic Segmentation
    Zhou, Wujie
    Gong, Tingting
    Yan, Weiqing
    IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2025, 55 (03): : 2170 - 2182