SmokeSeger: A Transformer-CNN coupled model for urban scene smoke segmentation

被引:12
|
作者
Jing, Tao [1 ]
Meng, Qing-Hao [1 ]
Hou, Hui-Rang [1 ]
机构
[1] Tianjin Univ, Inst Robot & Autonomous Syst, Sch Elect & Informat Engn, Tianjin Key Lab Proc Measurement & Control, Tianjin 300072, Peoples R China
基金
中国国家自然科学基金; 中国博士后科学基金;
关键词
smoke semantic segmentation; urban smoke scene; dual-branch encoder; transformer; convolutional neural network; NETWORK;
D O I
10.1109/TII.2023.3271441
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Smoke is an informative indicator of early fire and gas leakage. Segmenting the smoke from images can provide detailed information about the smoke volume, dispersion direction, and source location, which has significant implications considering the proliferation of video surveillance systems in cities. Focusing on smoke segmentation in the urban scene, we designed a dual-branch segmentation model, named SmokeSeger, which couples a Transformer branch and a CNN branch to enhance the representation of both global and local features. To address the lack of real-scene smoke datasets, we built an urban scene smoke segmentation dataset containing 3217 images of fire smoke and exhaust emissions with accurate annotations. Experiments validate that the SmokeSeger outperforms other mainstream segmentation methods on the proposed dataset. Visualization of attention maps reveals that the model could effectively capture the semantic relationship between the smoke and the corresponding source, which benefits the discrimination between smoke and smoke-like objects.
引用
收藏
页码:1385 / 1396
页数:12
相关论文
共 50 条
  • [21] Anatomical Landmark Detection Using a Multiresolution Learning Approach with a Hybrid Transformer-CNN Model
    Viriyasaranon, Thanaporn
    Ma, Serie
    Choi, Jang-Hwan
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2023, PT VI, 2023, 14225 : 433 - 443
  • [22] A pyramid Gaussian pooling based CNN and transformer hybrid network for smoke segmentation
    Wang, Guiqian
    Yuan, Feiniu
    Li, Hongdi
    Fang, Zhijun
    IET IMAGE PROCESSING, 2024, 18 (12) : 3206 - 3217
  • [23] UAVformer: A Composite Transformer Network for Urban Scene Segmentation of UAV Images
    Yi, Shi
    Liu, Xi
    Li, Junjie
    Chen, Ling
    PATTERN RECOGNITION, 2023, 133
  • [24] A lightweight multi-scale multi-angle dynamic interactive transformer-CNN fusion model for 3D medical image segmentation
    Hua, Xin
    Du, Zhijiang
    Yu, Hongjian
    Ma, Jixin
    Zheng, Fanjun
    Zhang, Chen
    Lu, Qiaohui
    Zhao, Hui
    NEUROCOMPUTING, 2024, 608
  • [25] CCTNet: Coupled CNN and Transformer Network for Crop Segmentation of Remote Sensing Images
    Wang, Hong
    Chen, Xianzhong
    Zhang, Tianxiang
    Xu, Zhiyong
    Li, Jiangyun
    REMOTE SENSING, 2022, 14 (09)
  • [26] U-TransCNN: A U-shape transformer-CNN fusion model for underwater image enhancement☆
    Yao, Haiyang
    Guo, Ruige
    Zhao, Zhongda
    Zang, Yuzhang
    Zhao, Xiaobo
    Lei, Tao
    Wang, Haiyan
    DISPLAYS, 2025, 88
  • [27] CTMFNet: CNN and Transformer Multiscale Fusion Network of Remote Sensing Urban Scene Imagery
    Song, Pengfei
    Li, Jinjiang
    An, Zhiyong
    Fan, Hui
    Fan, Linwei
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61
  • [28] Data-Oriented Octree Inverse Hierarchical Order Aggregation Hybrid Transformer-CNN for 3D Medical Segmentation
    Li, Yuhua
    Jiang, Shan
    Yang, Zhiyong
    Wang, Lixiang
    Wang, Liwen
    Zhou, Zeyang
    JOURNAL OF IMAGING INFORMATICS IN MEDICINE, 2025,
  • [29] Enhancing Lung Tumor Segmentation: A Comparative Study of CNN-based Network with Multi-Scale Strategies and Attention Mechanisms and Hybrid Transformer-CNN Network
    Kim, Hye Ryun
    Lee, Jumin
    Hong, Helen
    Kim, Bong-Seog
    COMPUTER-AIDED DIAGNOSIS, MEDICAL IMAGING 2024, 2024, 12927
  • [30] Dual-Attention Model Fusing CNN and Transformer for Pancreas Segmentation
    Zhu, Yan
    Hu, Peijun
    Tian, Yu
    Dong, Kaiqi
    Li, Jingsong
    MEDINFO 2023 - THE FUTURE IS ACCESSIBLE, 2024, 310 : 931 - 935