Temporal Consistency for RGB-Thermal Data-Based Semantic Scene Understanding

被引:4
|
作者
Li, Haotian [1 ]
Chu, Henry K. [1 ]
Sun, Yuxiang [2 ]
机构
[1] Hong Kong Polytech Univ, Dept Mech Engn, Kowloon, Hong Kong, Peoples R China
[2] City Univ Hong Kong, Dept Mech Engn, Kowloon, Hong Kong, Peoples R China
来源
IEEE ROBOTICS AND AUTOMATION LETTERS | 2024年 / 9卷 / 11期
关键词
Semantic segmentation; Accuracy; Measurement; Semantics; Optical flow; Cameras; Image synthesis; Autonomous vehicles; multi-modal fusion; RGB-Thermal; semantic segmentation; temporal consistency;
D O I
10.1109/LRA.2024.3458594
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
Semantic scene understanding is a fundamental capability for autonomous vehicles. Under challenging lighting conditions, such as nighttime and on-coming headlights, the semantic scene understanding performance using only RGB images are usually degraded. Thermal images can provide complementary information to RGB images, so many recent semantic segmentation networks have been proposed using RGB-Thermal (RGB-T) images. However, most existing networks focus only on improving segmentation accuracy for single image frames, omitting the information consistency between consecutive frames. To provide a solution to this issue, we propose a temporal-consistent framework for RGB-T semantic segmentation, which introduces a virtual view image generation module to synthesize a virtual image for the next moment, and a consistency loss function to ensure the segmentation consistency. We also propose an evaluation metric to measure both the accuracy and consistency for semantic segmentation. Experimental results show that our framework outperforms state-of-the-art methods.
引用
收藏
页码:9757 / 9764
页数:8
相关论文
共 49 条
  • [1] Improving RGB-Thermal Semantic Scene Understanding With Synthetic Data Augmentation for Autonomous Driving
    Li, Haotian
    Chu, Henry K.
    Sun, Yuxiang
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2025, 10 (05): : 4452 - 4459
  • [2] Multispectral Fusion Transformer Network for RGB-Thermal Urban Scene Semantic Segmentation
    Zhou, Heng
    Tian, Chunna
    Zhang, Zhenxi
    Huo, Qizheng
    Xie, Yongqiang
    Li, Zhongbo
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19
  • [3] Global contextually guided lightweight network for RGB-thermal urban scene understanding
    Gong, Tingting
    Zhou, Wujie
    Qian, Xiaohong
    Lei, Jingsheng
    Yu, Lu
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 117
  • [4] Robust semantic segmentation based on RGB-thermal in variable lighting scenes
    Guo, Zhifeng
    Li, Xu
    Xu, Qimin
    Sun, Zhengliang
    MEASUREMENT, 2021, 186
  • [5] RegSeg: An End-to-End Network for Multimodal RGB-Thermal Registration and Semantic Segmentation
    Lai, Wenjie
    Zeng, Fanyu
    Hu, Xiao
    He, Shaowei
    Liu, Ziji
    Jiang, Yadong
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2024, 33 : 6676 - 6690
  • [6] Dual-Space Graph-Based Interaction Network for RGB-Thermal Semantic Segmentation in Electric Power Scene
    Xu, Chang
    Li, Qingwu
    Jiang, Xiongbiao
    Yu, Dabing
    Zhou, Yaqin
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (04) : 1577 - 1592
  • [7] Embedded Control Gate Fusion and Attention Residual Learning for RGB-Thermal Urban Scene Parsing
    Zhou, Wujie
    Lv, Ying
    Lei, Jingsheng
    Yu, Lu
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2023, 24 (05) : 4794 - 4803
  • [8] CGFNet: cross-guided fusion network for RGB-thermal semantic segmentation
    Yanping Fu
    Qiaoqiao Chen
    Haifeng Zhao
    The Visual Computer, 2022, 38 : 3243 - 3252
  • [9] Residual spatial fusion network for RGB-thermal semantic segmentation
    Li, Ping
    Chen, Junjie
    Lin, Binbin
    Xu, Xianghua
    NEUROCOMPUTING, 2024, 595
  • [10] MEFNET: Multi-expert fusion network for RGB-Thermal semantic segmentation
    Lai, Wenjie
    Zeng, Fanyu
    Hu, Xiao
    Li, Wei
    He, Shaowei
    Liu, Ziji
    Jiang, Yadong
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 125