Deep multi-sphere support vector data description based on disentangled representation learning

被引：0

作者：

Xing, Hong-Jie ^{[1
,2
]}

Wu, Hui-Nan ^{[3
]}

Zhang, Ping-Ping ^{[4
]}

机构：

[1] Hebei Univ, Sch Cyber Secur & Comp, Baoding 071000, Peoples R China

[2] Hebei Univ, Sch Math & Informat Sci, Hebei Key Lab Machine Learning & Computat Intellig, Baoding 071002, Peoples R China

[3] Hebei Meteorol Bur, Hebei Meteorol Informat Ctr, Shijiazhuang 050021, Peoples R China

[4] Shenzhen Univ, Coll Comp Sci & Software Engn, Shenzhen 518060, Peoples R China

来源：

PATTERN RECOGNITION | 2024年 / 156卷

基金：

中国国家自然科学基金;

关键词：

Deep support vector data description; Disentangled representation learning; Variational autoencoder; Hypersphere collapse; Anomaly detection; MODEL; SVDD;

D O I：

10.1016/j.patcog.2024.110842

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Deep support vector data description (Deep SVDD) combines deep mapping network and support vector data description (SVDD) to jointly optimize network connection weights and hypersphere volume. However, when the parameters of deep mapping network are set improperly, Deep SVDD may face the problem of hypersphere collapse, where all input data are mapped as the hypersphere center. To overcome the hypersphere collapse problem of Deep SVDD and improve the feature learning ability of deep mapping network, deep multi-sphere SVDD based on disentangled representation learning (DMSVDD-DRL) is proposed. DMSVDD-DRL consists of a variational autoencoder (VAE) and multiple hyperspheres. The feature representations obtained by VAE are disentangled into discriminative representations and generative representations that obey mixture t-distribution and Gaussian distribution, respectively. In the pre-training phase of DMSVDD-DRL, the network parameters and the hypersphere centers are initialized. In the training phase, the augmented data are added into the training set. The discriminative representations of both the input and augmented data are generated through the mapping network. Furthermore, multiple hyperspheres are constructed by the obtained discriminative representations in the feature space. Finally, the VAE loss of the input data, the reconstruction error of the augmented data, the augmentation loss between the input and augmented data, the average radius of the multiple hyperspheres, and the average distance from discriminative representations to their corresponding hypersphere centers are jointly minimized to obtain the optimal network connection weights and the multiple minimum volume hyperspheres. The effectiveness of the proposed DMSVDD-DRL is validated through the comparative and ablation experiments on the benchmark data sets. In addition, it is verified that DMSVDD-DRL is more robust against outliers in comparison with its related methods.

引用

页数：12

共 29 条

[21] Robust unsupervised image categorization based on variational autoencoder with disentangled latent representations [J].

Yang, Lin ;

Fan, Wentao ;

Bouguila, Nizar .

KNOWLEDGE-BASED SYSTEMS, 2022, 246

[22] Clustering Analysis via Deep Generative Models With Mixture Models [J].

Yang, Lin ;

Fan, Wentao ;

Bouguila, Nizar .

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 33 (01) :340-350

[23] Self-Supervised Adversarial Variational Learning [J].

Ye, Fei ;

Bors, Adrian. G. .

PATTERN RECOGNITION, 2024, 148

[24]

Zhang Boqiang, 2024, 2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), P28358, DOI 10.1109/CVPR52733.2024.02679

[25]

Zhang Y, 2019, 2019 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (IEEE SSCI 2019), P2403, DOI 10.1109/SSCI44817.2019.9002714

[26] Anomaly detection using improved deep SVDD model with data structure preservation [J].

Zhang, Zheng ;

Deng, Xiaogang .

PATTERN RECOGNITION LETTERS, 2021, 148 :1-6

[27]

Zheng Ding, 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). Proceedings, P7917, DOI 10.1109/CVPR42600.2020.00794

[28] VAE-based Deep SVDD for anomaly detection [J].

Zhou, Yu ;

Liang, Xiaomin ;

Zhang, Wei ;

Zhang, Linrang ;

Song, Xing .

NEUROCOMPUTING, 2021, 453 :131-140

[29]

Zong B, 2018, INT C LEARN REPR

← 1 2 3 →