Temporal Consistency for RGB-Thermal Data-Based Semantic Scene Understanding

被引:4
|
作者
Li, Haotian [1 ]
Chu, Henry K. [1 ]
Sun, Yuxiang [2 ]
机构
[1] Hong Kong Polytech Univ, Dept Mech Engn, Kowloon, Hong Kong, Peoples R China
[2] City Univ Hong Kong, Dept Mech Engn, Kowloon, Hong Kong, Peoples R China
来源
IEEE ROBOTICS AND AUTOMATION LETTERS | 2024年 / 9卷 / 11期
关键词
Semantic segmentation; Accuracy; Measurement; Semantics; Optical flow; Cameras; Image synthesis; Autonomous vehicles; multi-modal fusion; RGB-Thermal; semantic segmentation; temporal consistency;
D O I
10.1109/LRA.2024.3458594
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
Semantic scene understanding is a fundamental capability for autonomous vehicles. Under challenging lighting conditions, such as nighttime and on-coming headlights, the semantic scene understanding performance using only RGB images are usually degraded. Thermal images can provide complementary information to RGB images, so many recent semantic segmentation networks have been proposed using RGB-Thermal (RGB-T) images. However, most existing networks focus only on improving segmentation accuracy for single image frames, omitting the information consistency between consecutive frames. To provide a solution to this issue, we propose a temporal-consistent framework for RGB-T semantic segmentation, which introduces a virtual view image generation module to synthesize a virtual image for the next moment, and a consistency loss function to ensure the segmentation consistency. We also propose an evaluation metric to measure both the accuracy and consistency for semantic segmentation. Experimental results show that our framework outperforms state-of-the-art methods.
引用
收藏
页码:9757 / 9764
页数:8
相关论文
共 50 条
  • [21] HEFANet: hierarchical efficient fusion and aggregation segmentation network for enhanced rgb-thermal urban scene parsing
    Shen, Zhengwen
    Pan, Zaiyu
    Weng, Yuchen
    Li, Yulian
    Wang, Jiangyu
    Wang, Jun
    APPLIED INTELLIGENCE, 2024, 54 (22) : 11248 - 11266
  • [22] Semantic Foggy Scene Understanding with Synthetic Data
    Sakaridis, Christos
    Dai, Dengxin
    Van Gool, Luc
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2018, 126 (09) : 973 - 992
  • [23] Semantic Foggy Scene Understanding with Synthetic Data
    Christos Sakaridis
    Dengxin Dai
    Luc Van Gool
    International Journal of Computer Vision, 2018, 126 : 973 - 992
  • [24] Real-Time Flame Segmentation based on RGB-Thermal Fusion
    Guo, Shuaihao
    Hu, Biao
    Huang, Ran
    2021 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND BIOMIMETICS (IEEE-ROBIO 2021), 2021, : 1435 - 1440
  • [25] ECFNet: Efficient cross-layer fusion network for real time RGB-Thermal urban scene parsing
    Shen, Zhengwen
    Wang, Jiangyu
    Weng, Yuchen
    Pan, Zaiyu
    Li, Yulian
    Wang, Jun
    DIGITAL SIGNAL PROCESSING, 2024, 151
  • [26] EGFNet: Edge-Aware Guidance Fusion Network for RGB-Thermal Urban Scene Parsing
    Dong, Shaohua
    Zhou, Wujie
    Xu, Caie
    Yan, Weiqing
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2024, 25 (01) : 657 - 669
  • [27] Probability-Based Framework to Fuse Temporal Consistency and Semantic Information for Background Segmentation
    Zeng, Zhi
    Wang, Ting
    Ma, Fulei
    Zhang, Liang
    Shen, Peiyi
    Shah, Syed Afaq Ali
    Bennamoun, Mohammed
    IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 24 : 740 - 754
  • [28] BFTNet: Boundary-Induced Four-Phase Transformer Network for RGB-Thermal Urban Road Scene Parsing
    Zhou, Wujie
    Gong, Tingting
    Fang, Meixin
    Yu, Lu
    IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2024,
  • [29] IAFFNet: Illumination-Aware Feature Fusion Network for All-Day RGB-Thermal Semantic Segmentation of Road Scenes
    Hou, Ya-Li
    Jia, Yan
    Hou, Zhijiang
    Hao, Xiaoli
    Shen, Yan
    IEEE ACCESS, 2022, 10 : 129702 - 129711
  • [30] Vision-Based Semantic Segmentation in Scene Understanding for Autonomous Driving: Recent Achievements, Challenges, and Outlooks
    Muhammad, Khan
    Hussain, Tanveer
    Ullah, Hayat
    Del Ser, Javier
    Rezaei, Mahdi
    Kumar, Neeraj
    Hijji, Mohammad
    Bellavista, Paolo
    de Albuquerque, Victor Hugo C.
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (12) : 22694 - 22715