A survey on soft subspace clustering

被引:100
作者
Deng, Zhaohong [1 ,2 ]
Choi, Kup-Sze [3 ]
Jiang, Yizhang [1 ]
Wang, Jun [1 ]
Wang, Shitong [1 ]
机构
[1] Jiangnan Univ, Sch Digital Media, Wuxi, Jiangsu, Peoples R China
[2] Univ Calif Davis, Dept Biomed Engn, Davis, CA 95616 USA
[3] Hong Kong Polytech Univ, Ctr Smart Hlth, Sch Nursing, Hong Kong, Hong Kong, Peoples R China
基金
中国国家自然科学基金;
关键词
Soft subspace clustering; Fuzzy weighting; Entropy weighting; Fuzzy C-means/k-means model; Mixture model; ALGORITHM; FACTORIAL; MODEL;
D O I
10.1016/j.ins.2016.01.101
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Subspace clustering (SC) is a promising technology involving clusters that are identified based on their association with subspaces in high-dimensional spaces. SC can be classified into hard subspace clustering (HSC) and soft subspace clustering (SSC). While HSC algorithms have been studied extensively and are well accepted by the scientific community, SSC algorithms are relatively new. However, as they are said to be more adaptable than their HSC counterparts, SSC algorithms have been attracting more attention in recent years. A comprehensive survey of existing SSC algorithms and recent developments in the field are presented in this paper. SSC algorithms have been systematically classified into three main categories: conventional SSC (CSSC), independent SSC (ISSC), and extended SSC (XSSC). The characteristics of these algorithms are highlighted and potential future developments in the area of SSC are discussed. Through a comprehensive review of SSC, this paper aims to provide readers with a clear profile of existing SSC methods and to foster the development of more effective clustering technologies and significant research in this area. (C) 2016 Elsevier Inc. All rights reserved.
引用
收藏
页码:84 / 106
页数:23
相关论文
共 72 条
[1]  
Aggarwal CC, 1999, SIGMOD RECORD, VOL 28, NO 2 - JUNE 1999, P61, DOI 10.1145/304181.304188
[2]  
Aggarwal CC, 2000, SIGMOD REC, V29, P70, DOI 10.1145/335191.335383
[3]  
Agrawal R., 1998, SIGMOD Record, V27, P94, DOI 10.1145/276305.276314
[4]   A k-means type clustering algorithm for subspace clustering of mixed numeric and categorical datasets [J].
Ahmad, Amir ;
Dey, Lipika .
PATTERN RECOGNITION LETTERS, 2011, 32 (07) :1062-1069
[5]   Distinct types of diffuse large B-cell lymphoma identified by gene expression profiling [J].
Alizadeh, AA ;
Eisen, MB ;
Davis, RE ;
Ma, C ;
Lossos, IS ;
Rosenwald, A ;
Boldrick, JG ;
Sabet, H ;
Tran, T ;
Yu, X ;
Powell, JI ;
Yang, LM ;
Marti, GE ;
Moore, T ;
Hudson, J ;
Lu, LS ;
Lewis, DB ;
Tibshirani, R ;
Sherlock, G ;
Chan, WC ;
Greiner, TC ;
Weisenburger, DD ;
Armitage, JO ;
Warnke, R ;
Levy, R ;
Wilson, W ;
Grever, MR ;
Byrd, JC ;
Botstein, D ;
Brown, PO ;
Staudt, LM .
NATURE, 2000, 403 (6769) :503-511
[6]   A novel attribute weighting algorithm for clustering high-dimensional categorical data [J].
Bai, Liang ;
Liang, Jiye ;
Dang, Chuangyin ;
Cao, Fuyuan .
PATTERN RECOGNITION, 2011, 44 (12) :2843-2861
[7]   Classification of human lung carcinomas by mRNA expression profiling reveals distinct adenocarcinoma subclasses [J].
Bhattacharjee, A ;
Richards, WG ;
Staunton, J ;
Li, C ;
Monti, S ;
Vasa, P ;
Ladd, C ;
Beheshti, J ;
Bueno, R ;
Gillette, M ;
Loda, M ;
Weber, G ;
Mark, EJ ;
Lander, ES ;
Wong, W ;
Johnson, BE ;
Golub, TR ;
Sugarbaker, DJ ;
Meyerson, M .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2001, 98 (24) :13790-13795
[8]   Extending Data Reliability Measure to a Filter Approach for Soft Subspace Clustering [J].
Boongoen, Tossapon ;
Shang, Changjing ;
Iam-On, Natthakan ;
Shen, Qiang .
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2011, 41 (06) :1705-1714
[9]   A Model-Based Approach for Discrete Data Clustering and Feature Weighting Using MAP and Stochastic Complexity [J].
Bouguila, Nizar .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2009, 21 (12) :1649-1664
[10]  
Can GJ, 2006, LECT NOTES ARTIF INT, V4093, P271