Multiscale fire image detection method based on CNN and Transformer

被引:1
|
作者
Wu, Shengbao [1 ]
Sheng, Buyun [1 ,2 ]
Fu, Gaocai [1 ]
Zhang, Daode [2 ]
Jian, Yuchao [1 ]
机构
[1] Wuhan Univ Technol, Sch Mech & Elect Engn, Wuhan 430070, Peoples R China
[2] Hubei Univ Technol, Sch Mech Engn, Wuhan 430068, Peoples R China
关键词
Deep learning; Fire detection; CNN; Multiscale feature extraction; Transformer; Hybrid model; Attention mechanism; CONVOLUTIONAL NEURAL-NETWORKS; REAL-TIME FIRE; VIDEO FIRE; COLOR; SURVEILLANCE;
D O I
10.1007/s11042-023-17482-4
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Fire is one of the most harmful hazards that affect daily life. The existing fire detection methods have the problems of large computation, slow detection speed, and low detection accuracy to varying degrees, and do not achieve a better trade-off between model complexity, accuracy, and detection speed. In this paper, a multiscale fire image detection method combining Convolutional Neural Network(CNN) and Transformer is proposed. In the shallow layer of the model, the CNN-based multiscale feature extraction module is used to obtain rich fire image information. In the deep layers of the model, the powerful global learning ability of the Transformer is used to carry out overall perception and macroscopic understanding of images. The experimental results show that the best detection accuracy of the model can reach 94.62%, and the fastest detection speed can reach 158.12FPS, F1 score is stable at around 94%, which is fully capable of real-time and accurate detection of fire. Compared with the existing detection methods, this method has higher detection accuracy under similar model complexity and detection speed. With similar detection accuracy, our method has a faster detection speed. The proposed method achieves a better balance between model complexity, detection speed, and accuracy.
引用
收藏
页码:49787 / 49811
页数:25
相关论文
共 50 条
  • [21] Remote sensing image change detection based on CNN-Transformer structure
    Pan, Mengyang
    Yang, Hang
    Fan, Xianghui
    CHINESE JOURNAL OF LIQUID CRYSTALS AND DISPLAYS, 2024, 39 (10) : 1361 - 1379
  • [22] Rethinking Image Deblurring via CNN-Transformer Multiscale Hybrid Architecture
    Zhao, Qian
    Yang, Hao
    Zhou, Dongming
    Cao, Jinde
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2023, 72
  • [23] Rethinking Image Deblurring via CNN-Transformer Multiscale Hybrid Architecture
    Zhao, Qian
    Yang, Hao
    Zhou, Dongming
    Cao, Jinde
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2023, 72
  • [24] TD-Net:unsupervised medical image registration network based on Transformer and CNN
    Song, Lei
    Liu, Guixia
    Ma, Mingrui
    APPLIED INTELLIGENCE, 2022, 52 (15) : 18201 - 18209
  • [25] TD-Net:unsupervised medical image registration network based on Transformer and CNN
    Lei Song
    Guixia Liu
    Mingrui Ma
    Applied Intelligence, 2022, 52 : 18201 - 18209
  • [26] CD-CTFM: A Lightweight CNN-Transformer Network for Remote Sensing Cloud Detection Fusing Multiscale Features
    Ge, Wenxuan
    Yang, Xubing
    Jiang, Rui
    Shao, Wei
    Zhang, Li
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2024, 17 : 4538 - 4551
  • [27] Metal Defect Image Recognition Method Based on Shallow CNN Fusion Transformer
    Tang D.
    Yang Z.
    Cheng H.
    Liu M.
    Zhou L.
    Ding C.
    Zhongguo Jixie Gongcheng/China Mechanical Engineering, 2022, 33 (19): : 2298 - 2305and2316
  • [28] Multiscale network based on feature fusion for fire disaster detection in complex scenes
    Feng, Jian
    Sun, Yu
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 240
  • [29] From CNN to Transformer: A Review of Medical Image Segmentation Models
    Yao, Wenjian
    Bai, Jiajun
    Liao, Wei
    Chen, Yuheng
    Liu, Mengjuan
    Xie, Yao
    JOURNAL OF IMAGING INFORMATICS IN MEDICINE, 2024, 37 (04): : 1529 - 1547
  • [30] Multi-Label Auroral Image Classification Based on CNN and Transformer
    Su, Hang
    Yang, Qiuju
    Ning, Yixuan
    Hu, Zejun
    Liu, Lili
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2025, 34 : 1835 - 1848