Balanced Self-Paced Learning for Generative Adversarial Clustering Network

被引：80

作者：

Dizaji, Kamran Ghasedi ^{[1
]}

Wang, Xiaoqian ^{[1
]}

Deng, Cheng ^{[2
]}

Huang, Heng ^{[1
,3
]}

机构：

[1] Univ Pittsburgh, Elect & Comp Engn Dept, Pittsburgh, PA 15260 USA

[2] Xidian Univ, Sch Elect Engn, Xian, Shaanxi, Peoples R China

[3] JD Digits, Beijing, Peoples R China

来源：

2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019) | 2019年

关键词：

D O I：

10.1109/CVPR.2019.00452

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Clustering is an important problem in various machine learning applications, but still a challenging task when dealing with complex real data. The existing clustering algorithms utilize either shallow models with insufficient capacity for capturing the non-linear nature of data, or deep models with large number of parameters prone to overfitting. In this paper, we propose a deep Generative Adversarial Clustering Network (ClusterGAN), which tackles the problems of training of deep clustering models in unsupervised manner. ClusterGAN consists of three networks, a discriminator, a generator and a clusterer (i.e. a clustering network). We employ an adversarial game between these three players to synthesize realistic samples given discriminative latent variables via the generator, and learn the inverse mapping of the real samples to the discriminative embedding space via the clusterer. Moreover, we utilize a conditional entropy minimization loss to increase/decrease the similarity of intra/inter cluster samples. Since the ground-truth similarities are unknown in clustering task, we propose a novel balanced self-paced learning algorithm to gradually include samples into training from easy to difficult, while considering the diversity of selected samples from all clusters. Our unsupervised learning framework makes it possible to efficiently train clusterers with large depth. Experimental results indicate that ClusterGAN achieves competitive results compared to the state-of-the-art models on several datasets.

引用

页码：4386 / 4395

页数：10

共 61 条

[1]

[Anonymous], 2011, AAAI

[2]

[Anonymous], 2016, THEANO PYTHON FRAMEW

[3]

[Anonymous], ADV NEURAL INFORM PR

[4]

[Anonymous], P IEEE INT C COMP VI

[5]

[Anonymous], 2016, INT C MACH LEARN

[6]

[Anonymous], 2017, Advances in Neural Information Processing Systems (NIPS)

[7]

[Anonymous], 2015, ARXIV PREPRINT ARXIV

[8]

[Anonymous], 2016, ARXIV161009585

[9]

[Anonymous], ARXIV170208798

[10]

[Anonymous], 2015, CoRR

← 1 2 3 4 5 6 7 →