MemFormer: A memory based unified model for anomaly detection on metro railway tracks

被引:5
作者
Liu, Ruikang [1 ]
Liu, Weiming [1 ]
Duan, Mengfei [1 ]
Xie, Wei [1 ]
Dai, Yuan [1 ]
Liao, Xianzhe [2 ]
机构
[1] South China Univ Technol, Guangzhou 510641, Peoples R China
[2] Shenzhen Metro Grp Co Ltd, Shenzhen 518000, Peoples R China
基金
国家重点研发计划;
关键词
Anomaly detection; Unified model; Unsupervised method; Memory module; Self-attention mechanism; LOCALIZATION;
D O I
10.1016/j.eswa.2023.121509
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Anomaly detection on metro railway tracks is crucial to maintain the safety of train transportation. Unsupervised methods require separate models for various scenarios, rendering them unsuitable for unified detection in the variable trackway scenario. The problem of "abnormal reconstruction"persists in the unified detection model, which preserves abnormal features in the output and hinders the model's ability to recognize anomalies. In addition, real-time capability is still a huge challenge currently faced. In this study, we present a unified model, MemFormer, that employs a memory module, layer-wise normal queries, and a cascade convolution based multi-head self-attention mechanism (CC-MHSA) to overcome the aforementioned issues. Firstly, a memory module is constructed to store diverse normal features that aid in the uniform modeling of a decision boundary for various scenarios. Secondly, we utilized the normal features existed in the memory module and layer-wise normal queries to optimize the attention mechanism, which suppresses the reconstruction of abnormal features. Thirdly, the proposed CC-MHSA benefits from cascaded convolution to improve feature representation and reduce parameter size, thereby reshaping self-attention calculation and reducing model inference time. Under the unified case, our method achieved detection accuracy of {99.2%, 97.5%} and localization accuracy of {99.7%, 97.1%}, respectively, on the metro trackway foreign object anomaly detection dataset and MVTec-AD dataset. The model infer speed is 47.6 FPS, exceeding that of the state-of-the-art alternatives.
引用
收藏
页数:15
相关论文
共 60 条
  • [1] GANomaly: Semi-supervised Anomaly Detection via Adversarial Training
    Akcay, Samet
    Atapour-Abarghouei, Amir
    Breckon, Toby P.
    [J]. COMPUTER VISION - ACCV 2018, PT III, 2019, 11363 : 622 - 637
  • [2] [Anonymous], 2008, P 25 INT C MACH LEAR, DOI DOI 10.1145/1390156.1390294
  • [3] Ba J. L., 2016, arXiv
  • [4] MVTec AD - A Comprehensive Real-World Dataset for Unsupervised Anomaly Detection
    Bergmann, Paul
    Fauser, Michael
    Sattlegger, David
    Steger, Carsten
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 9584 - 9592
  • [5] Mixed supervision for surface-defect detection: From weakly to fully supervised learning
    Bozic, Jakob
    Tabernik, Domen
    Skocaj, Danijel
    [J]. COMPUTERS IN INDUSTRY, 2021, 129
  • [6] Informative knowledge distillation for image anomaly segmentation
    Cao, Yunkang
    Wan, Qian
    Shen, Weiming
    Gao, Liang
    [J]. KNOWLEDGE-BASED SYSTEMS, 2022, 248
  • [7] Carion Nicolas, 2020, Computer Vision - ECCV 2020. 16th European Conference. Proceedings. Lecture Notes in Computer Science (LNCS 12346), P213, DOI 10.1007/978-3-030-58452-8_13
  • [8] Foreign Object Detection in Railway Images Based on an Efficient Two-Stage Convolutional Neural Network
    Chen, Weixun
    Meng, Siming
    Jiang, Yuelong
    [J]. COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2022, 2022
  • [9] Xception: Deep Learning with Depthwise Separable Convolutions
    Chollet, Francois
    [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 1800 - 1807
  • [10] Cohen N, 2021, Arxiv, DOI arXiv:2005.02357