SmokeSeger: A Transformer-CNN coupled model for urban scene smoke segmentation

被引:12
|
作者
Jing, Tao [1 ]
Meng, Qing-Hao [1 ]
Hou, Hui-Rang [1 ]
机构
[1] Tianjin Univ, Inst Robot & Autonomous Syst, Sch Elect & Informat Engn, Tianjin Key Lab Proc Measurement & Control, Tianjin 300072, Peoples R China
基金
中国国家自然科学基金; 中国博士后科学基金;
关键词
smoke semantic segmentation; urban smoke scene; dual-branch encoder; transformer; convolutional neural network; NETWORK;
D O I
10.1109/TII.2023.3271441
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Smoke is an informative indicator of early fire and gas leakage. Segmenting the smoke from images can provide detailed information about the smoke volume, dispersion direction, and source location, which has significant implications considering the proliferation of video surveillance systems in cities. Focusing on smoke segmentation in the urban scene, we designed a dual-branch segmentation model, named SmokeSeger, which couples a Transformer branch and a CNN branch to enhance the representation of both global and local features. To address the lack of real-scene smoke datasets, we built an urban scene smoke segmentation dataset containing 3217 images of fire smoke and exhaust emissions with accurate annotations. Experiments validate that the SmokeSeger outperforms other mainstream segmentation methods on the proposed dataset. Visualization of attention maps reveals that the model could effectively capture the semantic relationship between the smoke and the corresponding source, which benefits the discrimination between smoke and smoke-like objects.
引用
收藏
页码:1385 / 1396
页数:12
相关论文
共 50 条
  • [31] SCSNet: a novel transformer-CNN fusion architecture for enhanced segmentation and classification on high-resolution semiconductor micro-scale defects
    Luo, Yuening
    Mei, Zhouzhouzhou
    Qiao, Yibo
    Chen, Yining
    APPLIED INTELLIGENCE, 2025, 55 (06)
  • [32] A Low-Complexity Transformer-CNN Hybrid Model Combining Dynamic Attention for Remote Sensing Image Compression
    Zhang, Lili
    Wang, Xianjun
    Liu, Jiahui
    Fang, Qizhi
    RADIOENGINEERING, 2024, 33 (04) : 642 - 659
  • [33] Enhancing Efficient Global Understanding Network With CSWin Transformer for Urban Scene Images Segmentation
    Zhang, Jie
    Shao, Mingwen
    Qiao, Yuanjian
    Cao, Xiangyong
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2023, 16 : 10230 - 10245
  • [34] Multispectral Fusion Transformer Network for RGB-Thermal Urban Scene Semantic Segmentation
    Zhou, Heng
    Tian, Chunna
    Zhang, Zhenxi
    Huo, Qizheng
    Xie, Yongqiang
    Li, Zhongbo
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19
  • [35] An effective multi-scale interactive fusion network with hybrid Transformer and CNN for smoke image segmentation
    Li, Kang
    Yuan, Feiniu
    Wang, Chunmei
    PATTERN RECOGNITION, 2025, 159
  • [36] A Combined Recognition and Segmentation Model for Urban Traffic Scene Understanding
    Oeljeklaus, Malte
    Hoffmann, Frank
    Bertram, Torsten
    2017 IEEE 20TH INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS (ITSC), 2017,
  • [37] A Transformer-CNN Hybrid Model for Cognitive Behavioral Therapy in Psychological Assessment and Intervention for Enhanced Diagnostic Accuracy and Treatment Efficiency
    Vuyyuru, Veera Ankalu.
    Krishna, G. Vamsi
    Mary, S. Suma Christal
    Kayalvili, S.
    Alsubayhay, Abraheem Mohammed Sulayman
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2023, 14 (07) : 594 - 602
  • [38] Learning Content-Enhanced Mask Transformer for Domain Generalized Urban-Scene Segmentation
    Bi, Qi
    You, Shaodi
    Gevers, Theo
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 2, 2024, : 819 - 827
  • [39] Emb-trattunet: a novel edge loss function and transformer-CNN architecture for multi-classes pneumonia infection segmentation in low annotation regimes
    Bougourzi, Fares
    Dornaika, Fadi
    Nakib, Amir
    Taleb-Ahmed, Abdelmalik
    ARTIFICIAL INTELLIGENCE REVIEW, 2024, 57 (04)
  • [40] Multi-Scale Orthogonal Model CNN-Transformer for Medical Image Segmentation
    Zhou, Wuyi
    Zeng, Xianhua
    Zhou, Mingkun
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2023, 37 (10)