Clustering Analysis via Deep Generative Models With Mixture Models

被引:35
作者
Yang, Lin [1 ]
Fan, Wentao [1 ]
Bouguila, Nizar [2 ]
机构
[1] Huaqiao Univ, Dept Comp Sci, Xiamen 361021, Peoples R China
[2] Concordia Univ, Concordia Inst Informat Syst Engn CIISE, Montreal, PQ H3G 1T7, Canada
基金
中国国家自然科学基金;
关键词
Generative adversarial networks; Gallium nitride; Generators; Training; Clustering algorithms; Data models; Decoding; Clustering; generative adversarial network (GAN); mixture models; student's-t mixture model; variational autoencoder (AE); variational inference; Wasserstein GAN; FEATURE-SELECTION;
D O I
10.1109/TNNLS.2020.3027761
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Clustering is a fundamental problem that frequently arises in many fields, such as pattern recognition, data mining, and machine learning. Although various clustering algorithms have been developed in the past, traditional clustering algorithms with shallow structures cannot excavate the interdependence of complex data features in latent space. Recently, deep generative models, such as autoencoder (AE), variational AE (VAE), and generative adversarial network (GAN), have achieved remarkable success in many unsupervised applications thanks to their capabilities for learning promising latent representations from original data. In this work, first we propose a novel clustering approach based on both Wasserstein GAN with gradient penalty (WGAN-GP) and VAE with a Gaussian mixture prior. By combining the WGAN-GP with VAE, the generator of WGAN-GP is formulated by drawing samples from the probabilistic decoder of VAE. Moreover, to provide more robust clustering and generation performance when outliers are encountered in data, a variant of the proposed deep generative model is developed based on a Student's-t mixture prior. The effectiveness of our deep generative models is validated though experiments on both clustering analysis and samples generation. Through the comparison with other state-of-art clustering approaches based on deep generative models, the proposed approach can provide more stable training of the model, improve the accuracy of clustering, and generate realistic samples.
引用
收藏
页码:340 / 350
页数:11
相关论文
共 48 条
[1]  
[Anonymous], 2015, INT C LEARNING REPRE
[2]  
[Anonymous], 1996, Tech. Rep. CUCS-006-96
[3]  
Arjovsky M, 2017, PR MACH LEARN RES, V70
[4]   Representation Learning: A Review and New Perspectives [J].
Bengio, Yoshua ;
Courville, Aaron ;
Vincent, Pascal .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2013, 35 (08) :1798-1828
[5]  
Bishop C., 2006, Pattern Recognition and Machine Learning
[6]   Variational Inference: A Review for Statisticians [J].
Blei, David M. ;
Kucukelbir, Alp ;
McAuliffe, Jon D. .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2017, 112 (518) :859-877
[7]  
Bouguila N, 2007, IEEE T PATTERN ANAL, V29, P1716, DOI [10.1109/TPAMI.2007.1095, 10.1109/TPAMl.2007.1095]
[8]  
Chen X, 2016, ADV NEUR IN, V29
[9]  
Denton E, 2015, ADV NEUR IN, V28
[10]  
Dilokthanakul N, 2016, P 5 INT C LEARNING R