Automatic aspect discrimination in data clustering

被引:10
|
作者
Horta, Danilo [1 ]
Campello, Ricardo J. G. B. [1 ]
机构
[1] Univ Sao Paulo, ICMC, BR-13560970 Sao Carlos, SP, Brazil
基金
巴西圣保罗研究基金会;
关键词
Clustering; Aspect discrimination; Attribute weighting; Cluster validation; FUZZY EXTENSION; RELATIONAL DATA; VALIDITY; AGGREGATION; VALIDATION; ALGORITHMS; COMPLEXITY; CRITERION; INDEXES;
D O I
10.1016/j.patcog.2012.05.011
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The attributes describing a data set may often be arranged in meaningful subsets, each of which corresponds to a different aspect of the data. An unsupervised algorithm (SCAD) that simultaneously performs fuzzy clustering and aspects weighting was proposed in the literature. However, SCAD may fail and halt given certain conditions. To fix this problem, its steps are modified and then reordered to reduce the number of parameters required to be set by the user. In this paper we prove that each step of the resulting algorithm, named ASCAD, globally minimizes its cost-function with respect to the argument being optimized. The asymptotic analysis of ASCAD leads to a time complexity which is the same as that of fuzzy c-means. A hard version of the algorithm and a novel validity criterion that considers aspect weights in order to estimate the number of clusters are also described. The proposed method is assessed over several artificial and real data sets. (C) 2012 Elsevier Ltd. All rights reserved.
引用
收藏
页码:4370 / 4388
页数:19
相关论文
共 50 条
  • [1] Automatic similarity detection and clustering of data
    Einstein, Craig
    Chin, Peter
    CYBER SENSING 2017, 2017, 10185
  • [2] Automatic clustering of hyperspectral data
    Salomon, R.
    Dolberg, S.
    Rotman, S. R.
    2006 IEEE 24TH CONVENTION OF ELECTRICAL & ELECTRONICS ENGINEERS IN ISRAEL, 2006, : 334 - +
  • [3] Elastic Differential Evolution for Automatic Data Clustering
    Chen, Jun-Xian
    Gong, Yue-Jiao
    Chen, Wei-Neng
    Li, Mengting
    Zhang, Jun
    IEEE TRANSACTIONS ON CYBERNETICS, 2021, 51 (08) : 4134 - 4147
  • [4] Automatic Subspace Clustering of High Dimensional Data
    Rakesh Agrawal
    Johannes Gehrke
    Dimitrios Gunopulos
    Prabhakar Raghavan
    Data Mining and Knowledge Discovery, 2005, 11 : 5 - 33
  • [5] A Bacterial Evolutionary Algorithm for Automatic Data Clustering
    Das, Swagatam
    Chowdhury, Archana
    Abraham, Ajith
    2009 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION, VOLS 1-5, 2009, : 2403 - +
  • [6] Automatic subspace clustering of high dimensional data
    Agrawal, R
    Gehrke, J
    Gunopulos, D
    Raghavan, P
    DATA MINING AND KNOWLEDGE DISCOVERY, 2005, 11 (01) : 5 - 33
  • [7] A survey of cluster validity indices for automatic data clustering using differential evolution
    Jose-Garcia, Adan
    Gomez-Flores, Wilfrido
    PROCEEDINGS OF THE 2021 GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE (GECCO'21), 2021, : 314 - 322
  • [8] Categorical Data Clustering with Automatic Selection of Cluster Number
    Liao, Hai-Yong
    Ng, Michael K.
    FUZZY INFORMATION AND ENGINEERING, 2009, 1 (01) : 5 - 25
  • [9] An Automatic Data Clustering Algorithm based on Differential Evolution
    Tsai, Chun-Wei
    Tai, Chiech-An
    Chiang, Ming-Chao
    2013 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC 2013), 2013, : 794 - 799
  • [10] Automatic Generation of Merge Factor for Clustering Microarray Data
    Pavan, K. Karteeka
    Rao, Allam Appa
    Rao, A. V. Dattatreya
    Sridhar, G. R.
    INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2008, 8 (09): : 127 - 131