Fire Detection Approach Based on Vision Transformer

被引:7
|
作者
Khudayberdiev, Otabek [1 ]
Zhang, Jiashu [1 ]
Elkhalil, Ahmed [1 ]
Balde, Lansana [1 ]
机构
[1] Southwest Jiaotong Univ, Sch Comp & Artificial Intelligence, Chengdu, Peoples R China
来源
ARTIFICIAL INTELLIGENCE AND SECURITY, ICAIS 2022, PT I | 2022年 / 13338卷
关键词
Vision transformer; Self-attention; Convolutional neural networks; Fire detection; Image classification; CONVOLUTIONAL NEURAL-NETWORKS; SURVEILLANCE;
D O I
10.1007/978-3-031-06794-5_4
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Considering the rapid development of embedding surveillance video systems for fire monitoring, we need to distribute systems with high accuracy and detection speed. Recent progress in vision-based fire detection techniques achieved remarkable success by the powerful ability of deep convolutional neural networks. CNN's have long been the architecture of choice for computer vision tasks. However, current CNN-based methods consider fire classification entire image pixels as equal, ignoring regardless of information. Thus, this can cause a low accuracy rate and delay detection. To increase detection speed and achieve high accuracy, we propose a fire detection approach based on Vision Transformer as a viable alternative to CNN. Different from convolutional networks, transformers operate with images as a sequence of patches, selectively attending to different image parts based on context. In addition, the attention mechanism in the transformer solves the problem with a small flame, thereby provide detection fire in the early stage. Since transformers using global self-attention, which conducts complex computing, we utilize fine-tuned Swin Transformer as our backbone architecture that computes self-attention with local windows. Thus, solving the classification problems with high-resolution images. Experimental results conducted on the image fire dataset demonstrate the promising capability of the model compared to state-of-the-art methods. Specifically, Vision Transformer obtains a classification accuracy of 98.54% on the publicly available dataset.
引用
收藏
页码:41 / 53
页数:13
相关论文
共 50 条
  • [41] Explainable Anomaly Detection Using Vision Transformer Based SVDD
    Baek, Ji-Won
    Chung, Kyungyong
    CMC-COMPUTERS MATERIALS & CONTINUA, 2023, 74 (03): : 6573 - 6586
  • [42] Fault detection of catenary hanger based on EfficientDet and Vision Transformer
    Bian J.
    Xue X.
    Cui Y.
    Xu H.
    Lu Y.
    Journal of Railway Science and Engineering, 2023, 20 (06) : 2340 - 2349
  • [43] Hyperspectral anomaly detection with vision transformer and adversarial refinement
    Xu, Yating
    Zhao, Kai
    Zhang, Liangang
    Zhu, Mengyao
    Zeng, Dan
    INTERNATIONAL JOURNAL OF REMOTE SENSING, 2023, 44 (13) : 4034 - 4057
  • [44] RailTrack-DaViT: A Vision Transformer-Based Approach for Automated Railway Track Defect Detection
    Phaphuangwittayakul, Aniwat
    Harnpornchai, Napat
    Ying, Fangli
    Zhang, Jinming
    JOURNAL OF IMAGING, 2024, 10 (08)
  • [45] Fire Detection using Transformer Network
    Shahid, Mohammad
    Hua, Kai-lung
    PROCEEDINGS OF THE 2021 INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL (ICMR '21), 2021, : 627 - 630
  • [46] Video Flame and Smoke Based Fire Detection Algorithms: A Literature Review
    Gaur, Anshul
    Singh, Abhishek
    Kumar, Anuj
    Kumar, Ashok
    Kapoor, Kamal
    FIRE TECHNOLOGY, 2020, 56 (05) : 1943 - 1980
  • [47] Fire detection based on vision sensor and support vector machines
    Ko, Byoung Chul
    Cheong, Kwang-Ho
    Nam, Jae-Yeal
    FIRE SAFETY JOURNAL, 2009, 44 (03) : 322 - 329
  • [48] A Comparative Study of Vision Transformer and Convolutional Neural Network Models in Geological Fault Detection
    Wang, Jing
    Ma, Siteng
    An, Yu
    Dong, Ruihai
    IEEE ACCESS, 2024, 12 : 136148 - 136159
  • [49] A 3-D Convolutional Vision Transformer for PolSAR Image Classification and Change Detection
    Wang, Lei
    Gui, Rong
    Hong, Hanyu
    Hu, Jun
    Ma, Lei
    Shi, Yu
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2024, 17 : 11503 - 11520
  • [50] Network Intrusion Detection via Flow-to-Image Conversion and Vision Transformer Classification
    Ho, Chi Mai Kim
    Yow, Kin-Choong
    Zhu, Zhongwen
    Aravamuthan, Sarang
    IEEE ACCESS, 2022, 10 : 97780 - 97793