Thermal infrared image semantic segmentation for night-time driving scenes based on deep learning

被引:4
作者
Maheswari, B. [1 ]
Reeja, S. R. [1 ]
机构
[1] VIT AP Univ, Sch Comp Sci & Engn, Amaravati 522237, Andhra Prades, India
关键词
Thermal infrared images; Semantic segmentation; Top-down guided attention module; Attention loss; Organized gradient alignment loss; Semantic encoding ambiguity; ATTENTION NETWORK; FUSION; RGB;
D O I
10.1007/s11042-023-15882-0
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Semantic segmentation of thermal infrared (ThIR) images is challenging because the images considered in this task are highly complex. The discrimination of image regions is very difficult, and the traditional techniques fail to discover the crucial semantic information from the images completely. To overcome such issue, this paper introduces a novel network model for ThIR image semantic segmentation that facilitates effective image-to-image translation and reduces semantic encoding ambiguity. The proposed model is named top-down attention and gradient alignment-based graph neural network (AGAGNN). A top-down guided attention module (GAM) is utilized in the proposed model to deal with semantic encoding ambiguity. Apart from this, an elaborate attention loss is introduced to ensure a hierarchical coding of features. Also, the edge distortion problem due to the translation of images is reduced with an organized gradient alignment loss. The proposed model is evaluated under the Python platform based on pixel-level annotations over the KAIST dataset. The proposed model has shown 98.3% accuracy, and the comparative analysis has proved that the model is more effective than the existing models in preserving semantic information.
引用
收藏
页码:44885 / 44910
页数:26
相关论文
共 53 条
  • [1] Abbadi N. K. E., 2020, INDONESIAN J ELECT E, V18, P1501, DOI DOI 10.11591/IJEECS.V18.I3.PP1501-1509
  • [2] Anoosheh A, 2019, IEEE INT CONF ROBOT, P5958, DOI [10.1109/icra.2019.8794387, 10.1109/ICRA.2019.8794387]
  • [3] A method for improving semantic segmentation using thermographic images in infants
    Asano, Hidetsugu
    Hirakawa, Eiji
    Hayashi, Hayato
    Hamada, Keisuke
    Asayama, Yuto
    Oohashi, Masaaki
    Uchiyama, Akira
    Higashino, Teruo
    [J]. BMC MEDICAL IMAGING, 2022, 22 (01)
  • [4] Balit E., 2020, IN P 16 EUROPEAN C C, P1
  • [5] Pedestrian detection with unsupervised multispectral feature learning using deep neural networks
    Cao, Yanpeng
    Guan, Dayan
    Huang, Weilin
    Yang, Jiangxin
    Cao, Yanlong
    Qiao, Yu
    [J]. INFORMATION FUSION, 2019, 46 : 206 - 217
  • [6] Choi KC, 2021, THERMAL IMAGE SEMANT
  • [7] FEANet: Feature-Enhanced Attention Network for RGB-Thermal Real-time Semantic Segmentation
    Deng, Fuqin
    Feng, Hua
    Liang, Mingjian
    Wang, Hongmin
    Yang, Yong
    Gao, Yuan
    Chen, Junfeng
    Hu, Junjie
    Guo, Xiyue
    Lam, Tin Lun
    [J]. 2021 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2021, : 4467 - 4473
  • [8] Deep Multi-Modal Object Detection and Semantic Segmentation for Autonomous Driving: Datasets, Methods, and Challenges
    Feng, Di
    Haase-Schutz, Christian
    Rosenbaum, Lars
    Hertlein, Heinz
    Glaser, Claudius
    Timm, Fabian
    Wiesbeck, Werner
    Dietmayer, Klaus
    [J]. IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2021, 22 (03) : 1341 - 1360
  • [9] A new representation of scene layout improves saliency detection in traffic scenes
    He, De-Huai
    Yang, Kai-Fu
    Wan, Xue-Mei
    Xiao, Fen
    Yan, Hong-Mei
    Li, Yong-Jie
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2022, 193
  • [10] Infrared machine vision and infrared thermography with deep learning: A review
    He, Yunze
    Deng, Baoyuan
    Wang, Hongjin
    Cheng, Liang
    Zhou, Ke
    Cai, Siyuan
    Ciampa, Francesco
    [J]. INFRARED PHYSICS & TECHNOLOGY, 2021, 116