ADDP: Anomaly Detection Based on Denoising Pretraining

被引：1

作者：

Ge, Xianlei ^{[1
,2
]}

Li, Xiaoyan ^{[3
]}

Zhang, Zhipeng ^{[1
]}

机构：

[1] Huainan Normal Univ, Sch Elect Engn, Huainan, Peoples R China

[2] Natl Univ, Coll Comp & Informat Technol, Manila 1008, Philippines

[3] Huainan Normal Univ, Sch Comp, Huainan 232038, Peoples R China

来源：

INTERNATIONAL JOURNAL OF ELECTRONICS AND TELECOMMUNICATIONS | 2023年 / 69卷 / 04期

关键词：

Anomaly Detection; Diffusion Models; Image Denoising; Pretraining; Transfer Learning;

D O I：

10.24425/ijet.2023.147693

中图分类号：

TN [电子技术、通信技术];

学科分类号：

0809 ;

摘要：

Acquiring labels in anomaly detection tasks is expensive and challenging. Therefore, as an effective way to improve efficiency, pretraining is widely used in anomaly detection models, which enriches the model's representation capabilities, thereby enhancing both performance and efficiency in anomaly detection. In most pretraining methods, the decoder is typically randomly initialized. Drawing inspiration from the diffusion model, this paper proposed to use denoising as a task to pretrain the decoder in anomaly detection, which is trained to reconstruct the original noise-free input. Denoising requires the model to learn the structure, patterns, and related features of the data, particularly when training samples are limited. This paper explored two approaches on anomaly detection: simultaneous denoising pretraining for encoder and decoder, denoising pretraining for only decoder. Experimental results demonstrate the effectiveness of this method on improving model's performance. Particularly, when the number of samples is limited, the improvement is more pronounced.

引用

页码：719 / 726

页数：8

共 34 条

[1] Bachman P, 2019, ADV NEUR IN, V32
[2] SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation
Badrinarayanan, Vijay
Kendall, Alex
Cipolla, Roberto
[J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (12) : 2481 - 2495
[3] Baid U, 2021, Arxiv, DOI [arXiv:2107.02314, 10.48550/arXiv.2107.02314, DOI 10.48550/ARXIV.2107.02314]
[4] Deep Generative Modelling: A Comparative Review of VAEs, GANs, Normalizing Flows, Energy-Based and Autoregressive Models
Bond-Taylor, Sam
Leach, Adam
Long, Yang
Willcocks, Chris G.
[J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (11) : 7327 - 7347
[5] Chen J., 2021, arXiv
[6] Hjelm RD, 2019, Arxiv, DOI arXiv:1808.06670
[7] Dhariwal P, 2021, ADV NEUR IN, V34
[8] Dosovitskiy A, 2021, Arxiv, DOI [arXiv:2010.11929, DOI 10.48550/ARXIV.2010.11929]
[9] A Robust CNN Model for Diagnosis of COVID-19 Based on CT Scan Images and DL Techniques
Eldeeb, Ahmed H.
Amr, Mohammed Nagah
Ibrahim, Amin S.
Kamel, Hesham
Fouad, Sara
[J]. INTERNATIONAL JOURNAL OF ELECTRONICS AND TELECOMMUNICATIONS, 2022, 68 (04) : 731 - 739
[10] Dermatologist-level classification of skin cancer with deep neural networks
Esteva, Andre
Kuprel, Brett
Novoa, Roberto A.
Ko, Justin
Swetter, Susan M.
Blau, Helen M.
Thrun, Sebastian
[J]. NATURE, 2017, 542 (7639) : 115 - +

← 1 2 3 4 →