SmokeSeger: A Transformer-CNN coupled model for urban scene smoke segmentation

被引:12
|
作者
Jing, Tao [1 ]
Meng, Qing-Hao [1 ]
Hou, Hui-Rang [1 ]
机构
[1] Tianjin Univ, Inst Robot & Autonomous Syst, Sch Elect & Informat Engn, Tianjin Key Lab Proc Measurement & Control, Tianjin 300072, Peoples R China
基金
中国国家自然科学基金; 中国博士后科学基金;
关键词
smoke semantic segmentation; urban smoke scene; dual-branch encoder; transformer; convolutional neural network; NETWORK;
D O I
10.1109/TII.2023.3271441
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Smoke is an informative indicator of early fire and gas leakage. Segmenting the smoke from images can provide detailed information about the smoke volume, dispersion direction, and source location, which has significant implications considering the proliferation of video surveillance systems in cities. Focusing on smoke segmentation in the urban scene, we designed a dual-branch segmentation model, named SmokeSeger, which couples a Transformer branch and a CNN branch to enhance the representation of both global and local features. To address the lack of real-scene smoke datasets, we built an urban scene smoke segmentation dataset containing 3217 images of fire smoke and exhaust emissions with accurate annotations. Experiments validate that the SmokeSeger outperforms other mainstream segmentation methods on the proposed dataset. Visualization of attention maps reveals that the model could effectively capture the semantic relationship between the smoke and the corresponding source, which benefits the discrimination between smoke and smoke-like objects.
引用
收藏
页码:1385 / 1396
页数:12
相关论文
共 50 条
  • [41] Cracks segmentation of engineering structures in complex backgrounds using a concatenation of Transformer and CNN models driven by scene understanding information
    Zhang, Chun
    Yu, Jian
    Zhao, Yinjie
    Wu, Han
    Wu, Guangyu
    STRUCTURES, 2024, 65
  • [42] Emb-trattunet: a novel edge loss function and transformer-CNN architecture for multi-classes pneumonia infection segmentation in low annotation regimes
    Fares Bougourzi
    Fadi Dornaika
    Amir Nakib
    Abdelmalik Taleb-Ahmed
    Artificial Intelligence Review, 57
  • [43] Advanced Hybrid Transformer-CNN Deep Learning Model for Effective Intrusion Detection Systems with Class Imbalance Mitigation Using Resampling Techniques
    Kamal, Hesham
    Mashaly, Maggie
    FUTURE INTERNET, 2024, 16 (12)
  • [44] UNetFormer: A UNet-like transformer for efficient semantic segmentation of remote sensing urban scene imagery
    Wang, Libo
    Li, Rui
    Zhang, Ce
    Fang, Shenghui
    Duan, Chenxi
    Meng, Xiaoliang
    Atkinson, Peter M.
    ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2022, 190 : 196 - 214
  • [45] MS-TCNet: An effective Transformer-CNN combined network using multi-scale feature learning for 3D medical image segmentation
    Ao, Yu
    Shi, Weili
    Ji, Bai
    Miao, Yu
    He, Wei
    Jiang, Zhengang
    COMPUTERS IN BIOLOGY AND MEDICINE, 2024, 170
  • [46] Urban scene segmentation model based on multi-scale shuffle features
    Gu, Wenjuan
    Wang, Hongcheng
    Liu, Xiaobao
    Yin, Yanchao
    Xu, Biao
    MATHEMATICAL BIOSCIENCES AND ENGINEERING, 2023, 20 (07) : 11763 - 11784
  • [47] Channel selection and local attention transformer model for semantic segmentation on UAV remote sensing scene
    Liu, Da
    Long, Hao
    Liu, Zhenbao
    IET IMAGE PROCESSING, 2025, 19 (01)
  • [48] Transformer Meets Convolution: A Bilateral Awareness Network for Semantic Segmentation of Very Fine Resolution Urban Scene Images
    Wang, Libo
    Li, Rui
    Wang, Dongzhi
    Duan, Chenxi
    Wang, Teng
    Meng, Xiaoliang
    REMOTE SENSING, 2021, 13 (16)
  • [49] A MULTI-SCALE SAR-OPTICAL IMAGE MATCHING METHOD USING STRUCTURE-ENHANCED CONVOLUTIONAL LAYER AND TRANSFORMER-CNN MODEL
    Liu, Yijun
    Long, Jie
    Liu, Qian
    Liu, Chang
    Wang, Qingsong
    IGARSS 2024-2024 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, IGARSS 2024, 2024, : 2975 - 2978
  • [50] CardSegNet: An adaptive hybrid CNN-vision transformer model for heart region segmentation in cardiac MRI
    Aghapanah, Hamed
    Rasti, Reza
    Kermani, Saeed
    Tabesh, Faezeh
    Banaem, Hossein Yousefi
    Aliakbar, Hamidreza Pour
    Sanei, Hamid
    Segars, William Paul
    COMPUTERIZED MEDICAL IMAGING AND GRAPHICS, 2024, 115