Fire Detection Approach Based on Vision Transformer

被引:7
|
作者
Khudayberdiev, Otabek [1 ]
Zhang, Jiashu [1 ]
Elkhalil, Ahmed [1 ]
Balde, Lansana [1 ]
机构
[1] Southwest Jiaotong Univ, Sch Comp & Artificial Intelligence, Chengdu, Peoples R China
来源
ARTIFICIAL INTELLIGENCE AND SECURITY, ICAIS 2022, PT I | 2022年 / 13338卷
关键词
Vision transformer; Self-attention; Convolutional neural networks; Fire detection; Image classification; CONVOLUTIONAL NEURAL-NETWORKS; SURVEILLANCE;
D O I
10.1007/978-3-031-06794-5_4
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Considering the rapid development of embedding surveillance video systems for fire monitoring, we need to distribute systems with high accuracy and detection speed. Recent progress in vision-based fire detection techniques achieved remarkable success by the powerful ability of deep convolutional neural networks. CNN's have long been the architecture of choice for computer vision tasks. However, current CNN-based methods consider fire classification entire image pixels as equal, ignoring regardless of information. Thus, this can cause a low accuracy rate and delay detection. To increase detection speed and achieve high accuracy, we propose a fire detection approach based on Vision Transformer as a viable alternative to CNN. Different from convolutional networks, transformers operate with images as a sequence of patches, selectively attending to different image parts based on context. In addition, the attention mechanism in the transformer solves the problem with a small flame, thereby provide detection fire in the early stage. Since transformers using global self-attention, which conducts complex computing, we utilize fine-tuned Swin Transformer as our backbone architecture that computes self-attention with local windows. Thus, solving the classification problems with high-resolution images. Experimental results conducted on the image fire dataset demonstrate the promising capability of the model compared to state-of-the-art methods. Specifically, Vision Transformer obtains a classification accuracy of 98.54% on the publicly available dataset.
引用
收藏
页码:41 / 53
页数:13
相关论文
共 50 条
  • [21] Fire Sensor and Surveillance Camera-Based GTCNN for Fire Detection System
    Sridhar, P.
    Thangavel, Senthil Kumar
    Parameswaran, Latha
    Oruganti, Venkata Ramana Murthy
    IEEE SENSORS JOURNAL, 2023, 23 (07) : 7626 - 7633
  • [22] Time-Distributed Vision Transformer Stacked With Transformer for Heart Failure Detection Based on Echocardiography Video
    Ramadhan, Mgs M. Luthfi
    Yudha, Adyatma W. A. Nugraha
    Rachmadi, Muhammad Febrian
    Tandayu, Kevin Moses Hanky, Jr.
    Liastuti, Lies Dina
    Jatmiko, Wisnu
    IEEE ACCESS, 2024, 12 : 182438 - 182454
  • [23] Vision Transformer-Based Tailing Detection in Videos
    Lee, Jaewoo
    Lee, Sungjun
    Cho, Wonki
    Siddiqui, Zahid Ali
    Park, Unsang
    APPLIED SCIENCES-BASEL, 2021, 11 (24):
  • [24] DeepFake detection algorithm based on improved vision transformer
    Heo, Young-Jin
    Yeo, Woon-Ha
    Kim, Byung-Gyu
    APPLIED INTELLIGENCE, 2023, 53 (07) : 7512 - 7527
  • [25] DeepFake detection algorithm based on improved vision transformer
    Young-Jin Heo
    Woon-Ha Yeo
    Byung-Gyu Kim
    Applied Intelligence, 2023, 53 : 7512 - 7527
  • [26] BViT: Broad Attention-Based Vision Transformer
    Li, Nannan
    Chen, Yaran
    Li, Weifan
    Ding, Zixiang
    Zhao, Dongbin
    Nie, Shuai
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (09) : 12772 - 12783
  • [27] A Novel Fire Detection Approach Based on CNN-SVM Using Tensorflow
    Wang, Zhicheng
    Wang, Zhiheng
    Zhang, Hongwei
    Guo, Xiaopeng
    INTELLIGENT COMPUTING METHODOLOGIES, ICIC 2017, PT III, 2017, 10363 : 682 - 693
  • [28] A forest fire smoke detection model combining convolutional neural network and vision transformer
    Zheng, Ying
    Zhang, Gui
    Tan, Sanqing
    Yang, Zhigao
    Wen, Dongxin
    Xiao, Huashun
    FRONTIERS IN FORESTS AND GLOBAL CHANGE, 2023, 6
  • [29] Efficient Fire Detection with E-EFNet: A Lightweight Deep Learning-Based Approach for Edge Devices
    Farman, Haleem
    Nasralla, Moustafa M.
    Khattak, Sohaib Bin Altaf
    Jan, Bilal
    APPLIED SCIENCES-BASEL, 2023, 13 (23):
  • [30] MaxCerVixT: A novel lightweight vision transformer-based Approach for precise cervical cancer detection
    Pacal, Ishak
    KNOWLEDGE-BASED SYSTEMS, 2024, 289