Model-Based Method for Projective Clustering

被引:22
作者
Chen, Lifei [1 ]
Jiang, Qingshan [2 ]
Wang, Shengrui [3 ]
机构
[1] Fujian Normal Univ, Sch Math & Comp Sci, Fuzhou 350108, Fujian, Peoples R China
[2] Chinese Acad Sci, Shenzhen Key Lab High Performance Data Min, Shenzhen Inst Adv Technol, Shenzhen 518055, Peoples R China
[3] Univ Sherbrooke, Dept Comp Sci, Sherbrooke, PQ J1K 2R1, Canada
基金
中国国家自然科学基金; 加拿大自然科学与工程研究理事会;
关键词
Clustering; high dimensions; projective clustering; probability model; ALGORITHM;
D O I
10.1109/TKDE.2010.256
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Clustering high-dimensional data is a major challenge due to the curse of dimensionality. To solve this problem, projective clustering has been defined as an extension to traditional clustering that attempts to find projected clusters in subsets of the dimensions of a data space. In this paper, a probability model is first proposed to describe projected clusters in high-dimensional data space. Then, we present a model-based algorithm for fuzzy projective clustering that discovers clusters with overlapping boundaries in various projected subspaces. The suitability of the proposal is demonstrated in an empirical study done with synthetic data set and some widely used real-world data set.
引用
收藏
页码:1291 / 1305
页数:15
相关论文
共 47 条
[1]  
Agarwal P. K., 2004, P 23 ACM SIGMOD SIGA, P155
[2]   Redefining clustering for high-dimensional applications [J].
Aggarwal, CC ;
Yu, PS .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2002, 14 (02) :210-225
[3]  
Aggarwal CC, 1999, SIGMOD RECORD, VOL 28, NO 2 - JUNE 1999, P61, DOI 10.1145/304181.304188
[4]   Automatic subspace clustering of high dimensional data [J].
Agrawal, R ;
Gehrke, J ;
Gunopulos, D ;
Raghavan, P .
DATA MINING AND KNOWLEDGE DISCOVERY, 2005, 11 (01) :5-33
[5]  
Androutsopoulos I., 2000, P EUR C MACH LEARN, P9
[6]  
[Anonymous], 2019, ENTROPY BASED SUBSPA, DOI DOI 10.1145/312129.312199
[7]  
[Anonymous], 2006, P 32 INT C VER LARG
[8]  
[Anonymous], 2004, SIGKDD EXPLOR, DOI DOI 10.1145/1007730.1007731
[9]  
[Anonymous], 2006, C EM ANT CEAS
[10]   Density connected clustering with local subspace preferences [J].
Böhm, C ;
Kailing, K ;
Kriegel, HP ;
Kröger, P .
FOURTH IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2004, :27-34