MAMA Net: Multi-Scale Attention Memory Autoencoder Network for Anomaly Detection

被引:30
作者
Chen, Yurong [1 ]
Zhang, Hui [1 ]
Wang, Yaonan [1 ]
Yang, Yimin [2 ]
Zhou, Xianen [1 ]
Wu, Q. M. Jonathan [3 ]
机构
[1] Hunan Univ, Sch Robot, Natl Engn Lab Robot Visual Percept & Control Tech, Changsha 410082, Hunan, Peoples R China
[2] Lakehead Univ, Coll Comp Sci, Thunder Bay, ON P7B 5E1, Canada
[3] Univ Windsor, Coll Elect & Comp Engn, Windsor, ON N9B 3P4, Canada
基金
中国国家自然科学基金;
关键词
COVID-19; Image reconstruction; Anomaly detection; Memory modules; Training; Feature extraction; Computed tomography; diagnose; attention mechanism; hash coding; memory autoencoder; NEURAL-NETWORKS; COVID-19; SEGMENTATION; DIAGNOSIS;
D O I
10.1109/TMI.2020.3045295
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Anomaly detection refers to the identification of cases that do not conform to the expected pattern, which takes a key role in diverse research areas and application domains. Most of existing methods can be summarized as anomaly object detection-based and reconstruction error-based techniques. However, due to the bottleneck of defining encompasses of real-world high-diversity outliers and inaccessible inference process, individually, most of them have not derived groundbreaking progress. To deal with those imperfectness, and motivated by memory-based decision-making and visual attention mechanism as a filter to select environmental information in human vision perceptual system, in this paper, we propose a Multi-scale Attention Memory with hash addressing Autoencoder network (MAMA Net) for anomaly detection. First, to overcome a battery of problems result from the restricted stationary receptive field of convolution operator, we coin the multi-scale global spatial attention block which can be straightforwardly plugged into any networks as sampling, upsampling and downsampling function. On account of its efficient features representation ability, networks can achieve competitive results with only several level blocks. Second, it's observed that traditional autoencoder can only learn an ambiguous model that also reconstructs anomalies "well" due to lack of constraints in training and inference process. To mitigate this challenge, we design a hash addressing memory module that proves abnormalities to produce higher reconstruction error for classification. In addition, we couple the mean square error (MSE) with Wasserstein loss to improve the encoding data distribution. Experiments on various datasets, including two different COVID-19 datasets and one brain MRI (RIDER) dataset prove the robustness and excellent generalization of the proposed MAMA Net.
引用
收藏
页码:1032 / 1041
页数:10
相关论文
共 42 条
  • [1] Latent Space Autoregression for Novelty Detection
    Abati, Davide
    Porrello, Angelo
    Calderara, Simone
    Cucchiara, Rita
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 481 - 490
  • [2] Two-phase multi-model automatic brain tumour diagnosis system from magnetic resonance images using convolutional neural networks
    Abd-Ellah, Mahmoud Khaled
    Awad, Ali Ismail
    Khalaf, Ashraf A. M.
    Hamed, Hesham F. A.
    [J]. EURASIP JOURNAL ON IMAGE AND VIDEO PROCESSING, 2018,
  • [3] The attentive brain: insights from developmental cognitive neuroscience
    Amso, Dima
    Scerif, Gaia
    [J]. NATURE REVIEWS NEUROSCIENCE, 2015, 16 (10) : 606 - 619
  • [4] Arjovsky M, 2017, PR MACH LEARN RES, V70
  • [5] Chen YQ, 2001, IEEE IMAGE PROC, P34, DOI 10.1109/ICIP.2001.958946
  • [6] Cohen J.P, 2020, arXiv preprint arXiv:2006.11988
  • [7] Edgeworth F., 1887, PHIL MAG J SCI, V23, P364, DOI [DOI 10.1080/14786448708628471, 10.1080/14786448708628471doi.org/10.1080/14786448708628471]
  • [8] High-dimensional and large-scale anomaly detection using a linear one-class SVM with deep learning
    Erfani, Sarah M.
    Rajasegarar, Sutharshan
    Karunasekera, Shanika
    Leckie, Christopher
    [J]. PATTERN RECOGNITION, 2016, 58 : 121 - 134
  • [9] Inf-Net: Automatic COVID-19 Lung Infection Segmentation From CT Images
    Fan, Deng-Ping
    Zhou, Tao
    Ji, Ge-Peng
    Zhou, Yi
    Chen, Geng
    Fu, Huazhu
    Shen, Jianbing
    Shao, Ling
    [J]. IEEE TRANSACTIONS ON MEDICAL IMAGING, 2020, 39 (08) : 2626 - 2637
  • [10] Video Captioning With Attention-Based LSTM and Semantic Consistency
    Gao, Lianli
    Guo, Zhao
    Zhang, Hanwang
    Xu, Xing
    Shen, Heng Tao
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2017, 19 (09) : 2045 - 2055