Subspace Clustering Without Knowing the Number of Clusters: A Parameter Free Approach

被引:15
作者
Menon, Vishnu [1 ]
Muthukrishnan, Gokularam [1 ]
Kalyani, Sheetal [1 ]
机构
[1] Indian Inst Technol Madras, Dept Elect Engn, Chennai 600036, Tamil Nadu, India
关键词
Clustering algorithms; Signal processing algorithms; Task analysis; Statistical distributions; Tuning; Face; Principal component analysis; Subspace clustering; data clustering; high-dimensional data; union of subspaces; tuning free; Bhattacharyya distance; unsupervised learning; FACE RECOGNITION; SEGMENTATION; ROBUST; ALGORITHM; HYBRID; MODELS;
D O I
10.1109/TSP.2020.3018665
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Subspace clustering, the task of clustering high dimensional data when the data points come from a union of subspaces, is one of the fundamental tasks in unsupervised machine learning. Most of the existing algorithms for this task require prior knowledge of the number of clusters along with few additional parameters which need to be set or tuned apriori according to the type of data to be clustered. In this work, a parameter free method for subspace clustering is proposed, where the data points are clustered on the basis of the difference in the statistical distributions of the angles subtended by the data points within a subspace and those by points belonging to different subspaces. Given an initial fine clustering, the proposed algorithm merges the clusters until a final clustering is obtained. This, unlike many existing methods, does not require the number of clusters apriori. Also, the proposed algorithm does not involve the use of an unknown parameter or tuning for one. A parameter free method for producing a fine initial clustering is also discussed, making the whole process of subspace clustering parameter free. The comparison of the proposed algorithm's performance with that of the existing state-of-the-art techniques in synthetic and real data sets shows the significance of the proposed method.
引用
收藏
页码:5047 / 5062
页数:16
相关论文
共 75 条
[1]  
Achtert E, 2006, LECT NOTES ARTIF INT, V4213, P446
[2]  
Agarwal N, 2005, LECT NOTES COMPUT SC, V3739, P475
[3]  
[Anonymous], 1998, Learning from data: concepts, theory, and methods
[4]  
[Anonymous], 2000, P 17 INT C MACHINE L
[5]  
[Anonymous], 1943, Bulletin of Calcutta Mathematical Society, DOI DOI 10.1038/157869B0
[6]  
[Anonymous], 2017, ADV NEURAL INF PROCE
[7]  
[Anonymous], 2014, Advances in Neural Information Processing Systems
[8]  
[Anonymous], 2017, ARXIV170904744
[9]  
[Anonymous], 2013, Advances in Neural Information Processing Systems
[10]  
[Anonymous], 2004, SIGKDD EXPLOR NEWSL, DOI DOI 10.1145/1007730.1007731