Fire Detection Approach Based on Vision Transformer

被引:7
|
作者
Khudayberdiev, Otabek [1 ]
Zhang, Jiashu [1 ]
Elkhalil, Ahmed [1 ]
Balde, Lansana [1 ]
机构
[1] Southwest Jiaotong Univ, Sch Comp & Artificial Intelligence, Chengdu, Peoples R China
来源
ARTIFICIAL INTELLIGENCE AND SECURITY, ICAIS 2022, PT I | 2022年 / 13338卷
关键词
Vision transformer; Self-attention; Convolutional neural networks; Fire detection; Image classification; CONVOLUTIONAL NEURAL-NETWORKS; SURVEILLANCE;
D O I
10.1007/978-3-031-06794-5_4
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Considering the rapid development of embedding surveillance video systems for fire monitoring, we need to distribute systems with high accuracy and detection speed. Recent progress in vision-based fire detection techniques achieved remarkable success by the powerful ability of deep convolutional neural networks. CNN's have long been the architecture of choice for computer vision tasks. However, current CNN-based methods consider fire classification entire image pixels as equal, ignoring regardless of information. Thus, this can cause a low accuracy rate and delay detection. To increase detection speed and achieve high accuracy, we propose a fire detection approach based on Vision Transformer as a viable alternative to CNN. Different from convolutional networks, transformers operate with images as a sequence of patches, selectively attending to different image parts based on context. In addition, the attention mechanism in the transformer solves the problem with a small flame, thereby provide detection fire in the early stage. Since transformers using global self-attention, which conducts complex computing, we utilize fine-tuned Swin Transformer as our backbone architecture that computes self-attention with local windows. Thus, solving the classification problems with high-resolution images. Experimental results conducted on the image fire dataset demonstrate the promising capability of the model compared to state-of-the-art methods. Specifically, Vision Transformer obtains a classification accuracy of 98.54% on the publicly available dataset.
引用
收藏
页码:41 / 53
页数:13
相关论文
共 50 条
  • [31] A vision-based system for early fire detection
    Santana, Pedro
    Gomes, Pedro
    Barata, Jose
    PROCEEDINGS 2012 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2012, : 739 - 744
  • [32] Efficient Fire Detection for Uncertain Surveillance Environment
    Muhammad, Khan
    Khan, Selman
    Elhoseny, Mohamed
    Ahmed, Syed Hassan
    Baik, Sung Wook
    IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2019, 15 (05) : 3113 - 3122
  • [33] Static hand gesture recognition method based on the Vision Transformer
    Zhang, Yu
    Wang, Junlin
    Wang, Xin
    Jing, Haonan
    Sun, Zhanshuo
    Cai, Yu
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (20) : 31309 - 31328
  • [34] Vision Transformer based Intelligent Parking System for Smart Cities
    Sadek, Rowayda A.
    Khalifa, Alaa A.
    2023 20TH ACS/IEEE INTERNATIONAL CONFERENCE ON COMPUTER SYSTEMS AND APPLICATIONS, AICCSA, 2023,
  • [35] A novel approach for melanoma detection utilizing GAN synthesis and vision transformer
    Wang R.
    Chen X.
    Wang X.
    Wang H.
    Qian C.
    Yao L.
    Zhang K.
    Computers in Biology and Medicine, 2024, 176
  • [36] A survey of the vision transformers and their CNN-transformer based variants
    Khan, Asifullah
    Raufu, Zunaira
    Sohail, Anabia
    Khan, Abdul Rehman
    Asif, Hifsa
    Asif, Aqsa
    Farooq, Umair
    ARTIFICIAL INTELLIGENCE REVIEW, 2023, 56 (SUPPL3) : S2917 - S2970
  • [37] A Systematic Literature Review of Vision-Based Fire Detection, Prediction and Forecasting
    Ismail, Norisza Dalila
    Ramli, Rizauddin
    Ab Rahman, Mohd Nizam
    JURNAL KEJURUTERAAN, 2025, 37 (01):
  • [38] Multiscale fire image detection method based on CNN and Transformer
    Shengbao Wu
    Buyun Sheng
    Gaocai Fu
    Daode Zhang
    Yuchao Jian
    Multimedia Tools and Applications, 2024, 83 : 49787 - 49811
  • [39] A vision transformer based CNN for underwater image enhancement ViTClarityNet
    Mohamed E. Fathy
    Samer A. Mohamed
    Mohammed I. Awad
    Hossam E. Abd El Munim
    Scientific Reports, 15 (1)
  • [40] A survey of the vision transformers and their CNN-transformer based variants
    Asifullah Khan
    Zunaira Rauf
    Anabia Sohail
    Abdul Rehman Khan
    Hifsa Asif
    Aqsa Asif
    Umair Farooq
    Artificial Intelligence Review, 2023, 56 : 2917 - 2970