Support Vector Data Descriptions and k-Means Clustering: One Class?

被引:30
作者
Goernitz, Nico [1 ]
Lima, Luiz Alberto [2 ,3 ]
Mueller, Klaus-Robert [1 ,4 ,5 ]
Kloft, Marius [6 ]
Nakajima, Shinichi [1 ]
机构
[1] Berlin Inst Technol, Machine Learning Grp, D-10587 Berlin, Germany
[2] Pontifical Catholic Univ Rio de Janeiro, BR-22543900 Rio De Janeiro, Brazil
[3] Petrobras SA, BR-20031912 Rio De Janeiro, Brazil
[4] Korea Univ, Dept Brain & Cognit Engn, Seoul 136713, South Korea
[5] Max Planck Inst Informat, D-66123 Saarbrucken, Germany
[6] Humboldt Univ, Dept Comp Sci, Machine Learning Grp, D-12489 Berlin, Germany
基金
新加坡国家研究基金会;
关键词
Anomaly detection; clustering; k-means; one-class classification; support vector data description (SVDD); KERNEL; SVMS;
D O I
10.1109/TNNLS.2017.2737941
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present ClusterSVDD, a methodology that unifies support vector data descriptions (SVDDs) and k-means clustering into a single formulation. This allows both methods to benefit from one another, i.e., by adding flexibility using multiple spheres for SVDDs and increasing anomaly resistance and flexibility through kernels to k-means. In particular, our approach leads to a new interpretation of k-means as a regularized mode seeking algorithm. The unifying formulation further allows for deriving new algorithms by transferring knowledge from one-class learning settings to clustering settings and vice versa. As a showcase, we derive a clustering method for structured data based on a one-class learning scenario. Additionally, our formulation can be solved via a particularly simple optimization scheme. We evaluate our approach empirically to highlight some of the proposed benefits on artificially generated data, as well as on real-world problems, and provide a PYTHON software package comprising various implementations of primal and dual SVDD as well as our proposed ClusterSVDD.
引用
收藏
页码:3994 / 4006
页数:13
相关论文
共 54 条
[51]   Theoretical analysis for solution of support vector data description [J].
Wang, Xiaoming ;
Chung, Fu-lai ;
Wang, Shitong .
NEURAL NETWORKS, 2011, 24 (04) :360-369
[52]   Multi-sphere Support Vector Data Description for Outliers Detection on Multi-distribution Data [J].
Xiao, Yanshan ;
Liu, Bo ;
Cao, Longbing ;
Wu, Xindong ;
Zhang, Chengqi ;
Hao, Zhifeng ;
Yang, Fengzhao ;
Cao, Jie .
2009 IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS (ICDMW 2009), 2009, :82-+
[53]  
Y. Kondo, 2011, THESIS
[54]   The concave-convex procedure [J].
Yuille, AL ;
Rangarajan, A .
NEURAL COMPUTATION, 2003, 15 (04) :915-936