ADDP: Anomaly Detection Based on Denoising Pretraining

被引:1
作者
Ge, Xianlei [1 ,2 ]
Li, Xiaoyan [3 ]
Zhang, Zhipeng [1 ]
机构
[1] Huainan Normal Univ, Sch Elect Engn, Huainan, Peoples R China
[2] Natl Univ, Coll Comp & Informat Technol, Manila 1008, Philippines
[3] Huainan Normal Univ, Sch Comp, Huainan 232038, Peoples R China
关键词
Anomaly Detection; Diffusion Models; Image Denoising; Pretraining; Transfer Learning;
D O I
10.24425/ijet.2023.147693
中图分类号
TN [电子技术、通信技术];
学科分类号
0809 ;
摘要
Acquiring labels in anomaly detection tasks is expensive and challenging. Therefore, as an effective way to improve efficiency, pretraining is widely used in anomaly detection models, which enriches the model's representation capabilities, thereby enhancing both performance and efficiency in anomaly detection. In most pretraining methods, the decoder is typically randomly initialized. Drawing inspiration from the diffusion model, this paper proposed to use denoising as a task to pretrain the decoder in anomaly detection, which is trained to reconstruct the original noise-free input. Denoising requires the model to learn the structure, patterns, and related features of the data, particularly when training samples are limited. This paper explored two approaches on anomaly detection: simultaneous denoising pretraining for encoder and decoder, denoising pretraining for only decoder. Experimental results demonstrate the effectiveness of this method on improving model's performance. Particularly, when the number of samples is limited, the improvement is more pronounced.
引用
收藏
页码:719 / 726
页数:8
相关论文
共 34 条
  • [1] Bachman P, 2019, ADV NEUR IN, V32
  • [2] SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation
    Badrinarayanan, Vijay
    Kendall, Alex
    Cipolla, Roberto
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (12) : 2481 - 2495
  • [3] Baid U, 2021, Arxiv, DOI [arXiv:2107.02314, 10.48550/arXiv.2107.02314, DOI 10.48550/ARXIV.2107.02314]
  • [4] Deep Generative Modelling: A Comparative Review of VAEs, GANs, Normalizing Flows, Energy-Based and Autoregressive Models
    Bond-Taylor, Sam
    Leach, Adam
    Long, Yang
    Willcocks, Chris G.
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (11) : 7327 - 7347
  • [5] Chen J., 2021, arXiv
  • [6] Hjelm RD, 2019, Arxiv, DOI arXiv:1808.06670
  • [7] Dhariwal P, 2021, ADV NEUR IN, V34
  • [8] Dosovitskiy A, 2021, Arxiv, DOI [arXiv:2010.11929, DOI 10.48550/ARXIV.2010.11929]
  • [9] A Robust CNN Model for Diagnosis of COVID-19 Based on CT Scan Images and DL Techniques
    Eldeeb, Ahmed H.
    Amr, Mohammed Nagah
    Ibrahim, Amin S.
    Kamel, Hesham
    Fouad, Sara
    [J]. INTERNATIONAL JOURNAL OF ELECTRONICS AND TELECOMMUNICATIONS, 2022, 68 (04) : 731 - 739
  • [10] Dermatologist-level classification of skin cancer with deep neural networks
    Esteva, Andre
    Kuprel, Brett
    Novoa, Roberto A.
    Ko, Justin
    Swetter, Susan M.
    Blau, Helen M.
    Thrun, Sebastian
    [J]. NATURE, 2017, 542 (7639) : 115 - +