Temporal Consistency for RGB-Thermal Data-Based Semantic Scene Understanding

被引：4

作者：

Li, Haotian ^{[1
]}

Chu, Henry K. ^{[1
]}

Sun, Yuxiang ^{[2
]}

机构：

[1] Hong Kong Polytech Univ, Dept Mech Engn, Kowloon, Hong Kong, Peoples R China

[2] City Univ Hong Kong, Dept Mech Engn, Kowloon, Hong Kong, Peoples R China

来源：

IEEE ROBOTICS AND AUTOMATION LETTERS | 2024年 / 9卷 / 11期

关键词：

Semantic segmentation; Accuracy; Measurement; Semantics; Optical flow; Cameras; Image synthesis; Autonomous vehicles; multi-modal fusion; RGB-Thermal; semantic segmentation; temporal consistency;

D O I：

10.1109/LRA.2024.3458594

中图分类号：

TP24 [机器人技术];

学科分类号：

080202 ; 1405 ;

摘要：

Semantic scene understanding is a fundamental capability for autonomous vehicles. Under challenging lighting conditions, such as nighttime and on-coming headlights, the semantic scene understanding performance using only RGB images are usually degraded. Thermal images can provide complementary information to RGB images, so many recent semantic segmentation networks have been proposed using RGB-Thermal (RGB-T) images. However, most existing networks focus only on improving segmentation accuracy for single image frames, omitting the information consistency between consecutive frames. To provide a solution to this issue, we propose a temporal-consistent framework for RGB-T semantic segmentation, which introduces a virtual view image generation module to synthesize a virtual image for the next moment, and a consistency loss function to ensure the segmentation consistency. We also propose an evaluation metric to measure both the accuracy and consistency for semantic segmentation. Experimental results show that our framework outperforms state-of-the-art methods.

引用

页码：9757 / 9764

页数：8

共 50 条

[31] Real-Time One-Stream Semantic-Guided Refinement Network for RGB-Thermal Salient Object Detection
Huo, Fushuo
Zhu, Xuegui
Zhang, Qian
Liu, Ziming
Yu, Wenchao
IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2022, 71
[32] Semantic Segmentation of Indoor-Scene RGB-D Images Based on Iterative Contraction and Merging
Syu, Jia-Hao
Cho, Shih-Hsuan
Wang, Sheng-Jyh
Wang, Li-Chun
IMAGE AND SIGNAL PROCESSING (ICISP 2018), 2018, 10884 : 252 - 261
[33] Multistage Shallow Pyramid Parsing for Road Scene Understanding Based on Semantic Segmentation
Nurhadiyatna, Adi
Loncaric, Sven
PROCEEDINGS OF THE 2019 11TH INTERNATIONAL SYMPOSIUM ON IMAGE AND SIGNAL PROCESSING AND ANALYSIS (ISPA 2019), 2019, : 198 - 203
[34] NIS-SLAM: Neural Implicit Semantic RGB-D SLAM for 3D Consistent Scene Understanding
Zhai, Hongjia
Huang, Gan
Hu, Qirui
Li, Guanglin
Bao, Hujun
Zhang, Guofeng
IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2024, 30 (11) : 7129 - 7139
[35] Enhanced Scene Understanding and Situation Awareness for Autonomous Vehicles Based on Semantic Segmentation
Zhao, Yiyue
Wang, Liang
Yun, Xinyu
Chai, Chen
Liu, Zhiyu
Fan, Wenxuan
Luo, Xiao
Liu, Yang
Qu, Xiaobo
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2024, : 6537 - 6549
[36] CEKD: Cross-Modal Edge-Privileged Knowledge Distillation for Semantic Scene Understanding Using Only Thermal Images
Feng, Zhen
Guo, Yanning
Sun, Yuxiang
IEEE ROBOTICS AND AUTOMATION LETTERS, 2023, 8 (04) : 2205 - 2212
[37] Anisotropic Convolutional Neural Networks for RGB-D Based Semantic Scene Completion
Li, Jie
Wang, Peng
Han, Kai
Liu, Yu
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (11) : 8125 - 8138
[38] High performance RGB-Thermal Video Object Detection via hybrid fusion with progressive interaction and temporal-modal difference
Wang, Qishun
Tu, Zhengzheng
Li, Chenglong
Tang, Jin
INFORMATION FUSION, 2025, 114
[39] RGB-DI Images and Full Convolution Neural Network-Based Outdoor Scene Understanding for Mobile Robots
Qiu, Zengshuai
Zhuang, Yan
Yan, Fei
Hu, Huosheng
Wang, Wei
IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2019, 68 (01) : 27 - 37
[40] TSS-Net: Time-based Semantic Segmentation Neural Network for Road Scene Understanding
Duong, Tin Trung
Nguyen, Huy-Hung
Jeon, Jae Wook
PROCEEDINGS OF THE 2021 15TH INTERNATIONAL CONFERENCE ON UBIQUITOUS INFORMATION MANAGEMENT AND COMMUNICATION (IMCOM 2021), 2021,

← 1 2 3 4 5 →