Temporal Consistency for RGB-Thermal Data-Based Semantic Scene Understanding

被引:4
|
作者
Li, Haotian [1 ]
Chu, Henry K. [1 ]
Sun, Yuxiang [2 ]
机构
[1] Hong Kong Polytech Univ, Dept Mech Engn, Kowloon, Hong Kong, Peoples R China
[2] City Univ Hong Kong, Dept Mech Engn, Kowloon, Hong Kong, Peoples R China
来源
IEEE ROBOTICS AND AUTOMATION LETTERS | 2024年 / 9卷 / 11期
关键词
Semantic segmentation; Accuracy; Measurement; Semantics; Optical flow; Cameras; Image synthesis; Autonomous vehicles; multi-modal fusion; RGB-Thermal; semantic segmentation; temporal consistency;
D O I
10.1109/LRA.2024.3458594
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
Semantic scene understanding is a fundamental capability for autonomous vehicles. Under challenging lighting conditions, such as nighttime and on-coming headlights, the semantic scene understanding performance using only RGB images are usually degraded. Thermal images can provide complementary information to RGB images, so many recent semantic segmentation networks have been proposed using RGB-Thermal (RGB-T) images. However, most existing networks focus only on improving segmentation accuracy for single image frames, omitting the information consistency between consecutive frames. To provide a solution to this issue, we propose a temporal-consistent framework for RGB-T semantic segmentation, which introduces a virtual view image generation module to synthesize a virtual image for the next moment, and a consistency loss function to ensure the segmentation consistency. We also propose an evaluation metric to measure both the accuracy and consistency for semantic segmentation. Experimental results show that our framework outperforms state-of-the-art methods.
引用
收藏
页码:9757 / 9764
页数:8
相关论文
共 50 条
  • [31] Real-Time One-Stream Semantic-Guided Refinement Network for RGB-Thermal Salient Object Detection
    Huo, Fushuo
    Zhu, Xuegui
    Zhang, Qian
    Liu, Ziming
    Yu, Wenchao
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2022, 71
  • [32] Semantic Segmentation of Indoor-Scene RGB-D Images Based on Iterative Contraction and Merging
    Syu, Jia-Hao
    Cho, Shih-Hsuan
    Wang, Sheng-Jyh
    Wang, Li-Chun
    IMAGE AND SIGNAL PROCESSING (ICISP 2018), 2018, 10884 : 252 - 261
  • [33] Multistage Shallow Pyramid Parsing for Road Scene Understanding Based on Semantic Segmentation
    Nurhadiyatna, Adi
    Loncaric, Sven
    PROCEEDINGS OF THE 2019 11TH INTERNATIONAL SYMPOSIUM ON IMAGE AND SIGNAL PROCESSING AND ANALYSIS (ISPA 2019), 2019, : 198 - 203
  • [34] NIS-SLAM: Neural Implicit Semantic RGB-D SLAM for 3D Consistent Scene Understanding
    Zhai, Hongjia
    Huang, Gan
    Hu, Qirui
    Li, Guanglin
    Bao, Hujun
    Zhang, Guofeng
    IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2024, 30 (11) : 7129 - 7139
  • [35] Enhanced Scene Understanding and Situation Awareness for Autonomous Vehicles Based on Semantic Segmentation
    Zhao, Yiyue
    Wang, Liang
    Yun, Xinyu
    Chai, Chen
    Liu, Zhiyu
    Fan, Wenxuan
    Luo, Xiao
    Liu, Yang
    Qu, Xiaobo
    IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2024, : 6537 - 6549
  • [36] CEKD: Cross-Modal Edge-Privileged Knowledge Distillation for Semantic Scene Understanding Using Only Thermal Images
    Feng, Zhen
    Guo, Yanning
    Sun, Yuxiang
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2023, 8 (04) : 2205 - 2212
  • [37] Anisotropic Convolutional Neural Networks for RGB-D Based Semantic Scene Completion
    Li, Jie
    Wang, Peng
    Han, Kai
    Liu, Yu
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (11) : 8125 - 8138
  • [38] High performance RGB-Thermal Video Object Detection via hybrid fusion with progressive interaction and temporal-modal difference
    Wang, Qishun
    Tu, Zhengzheng
    Li, Chenglong
    Tang, Jin
    INFORMATION FUSION, 2025, 114
  • [39] RGB-DI Images and Full Convolution Neural Network-Based Outdoor Scene Understanding for Mobile Robots
    Qiu, Zengshuai
    Zhuang, Yan
    Yan, Fei
    Hu, Huosheng
    Wang, Wei
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2019, 68 (01) : 27 - 37
  • [40] TSS-Net: Time-based Semantic Segmentation Neural Network for Road Scene Understanding
    Duong, Tin Trung
    Nguyen, Huy-Hung
    Jeon, Jae Wook
    PROCEEDINGS OF THE 2021 15TH INTERNATIONAL CONFERENCE ON UBIQUITOUS INFORMATION MANAGEMENT AND COMMUNICATION (IMCOM 2021), 2021,