Automatic aspect discrimination in data clustering

被引:10
|
作者
Horta, Danilo [1 ]
Campello, Ricardo J. G. B. [1 ]
机构
[1] Univ Sao Paulo, ICMC, BR-13560970 Sao Carlos, SP, Brazil
基金
巴西圣保罗研究基金会;
关键词
Clustering; Aspect discrimination; Attribute weighting; Cluster validation; FUZZY EXTENSION; RELATIONAL DATA; VALIDITY; AGGREGATION; VALIDATION; ALGORITHMS; COMPLEXITY; CRITERION; INDEXES;
D O I
10.1016/j.patcog.2012.05.011
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The attributes describing a data set may often be arranged in meaningful subsets, each of which corresponds to a different aspect of the data. An unsupervised algorithm (SCAD) that simultaneously performs fuzzy clustering and aspects weighting was proposed in the literature. However, SCAD may fail and halt given certain conditions. To fix this problem, its steps are modified and then reordered to reduce the number of parameters required to be set by the user. In this paper we prove that each step of the resulting algorithm, named ASCAD, globally minimizes its cost-function with respect to the argument being optimized. The asymptotic analysis of ASCAD leads to a time complexity which is the same as that of fuzzy c-means. A hard version of the algorithm and a novel validity criterion that considers aspect weights in order to estimate the number of clusters are also described. The proposed method is assessed over several artificial and real data sets. (C) 2012 Elsevier Ltd. All rights reserved.
引用
收藏
页码:4370 / 4388
页数:19
相关论文
共 50 条
  • [31] Evaluation of Clustering Results in the Aspect of Information Theory
    Grabusts, Peter
    2020 61ST INTERNATIONAL SCIENTIFIC CONFERENCE ON INFORMATION TECHNOLOGY AND MANAGEMENT SCIENCE OF RIGA TECHNICAL UNIVERSITY (ITMS), 2020,
  • [32] Automatic Clustering of Code Changes
    Kreutzer, Patrick
    Dotzler, Georg
    Ring, Matthias
    Eskofier, Bjoern M.
    Philippsen, Michael
    13TH WORKING CONFERENCE ON MINING SOFTWARE REPOSITORIES (MSR 2016), 2016, : 61 - 72
  • [33] Automatic clustering of faces in meetings
    Vallespi, Carlos
    De la Torre, Fernando
    Veloso, Manuela
    Kanade, Takeo
    2006 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP 2006, PROCEEDINGS, 2006, : 1841 - +
  • [34] Automatic Bitcoin Address Clustering
    Ermilov, Dmitry
    Panov, Maxim
    Yanovich, Yury
    2017 16TH IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA), 2017, : 461 - 466
  • [35] An Automatic Index Validity for Clustering
    Fan, Zizhu
    Jiang, Xiangang
    Xu, Baogen
    Jiang, Zhaofeng
    ADVANCES IN SWARM INTELLIGENCE, PT 2, PROCEEDINGS, 2010, 6146 : 359 - +
  • [36] Fully automatic clustering system
    Patanè, G
    Russo, M
    IEEE TRANSACTIONS ON NEURAL NETWORKS, 2002, 13 (06): : 1285 - 1298
  • [37] Dual-Graph-Regularization Constrained Nonnegative Matrix Factorization with Label Discrimination for Data Clustering
    Li, Jie
    Li, Yaotang
    Li, Chaoqian
    MATHEMATICS, 2024, 12 (01)
  • [38] Automatic geobody detection from seismic data using minimum message length clustering
    Xu, Y
    Caers, J
    Arroyo-Garcia, C
    COMPUTERS & GEOSCIENCES, 2004, 30 (07) : 741 - 751
  • [39] Automatic Data Clustering Framework Using Nature-Inspired Binary Optimization Algorithms
    Merikhi, Behnaz
    Soleymani, M. R.
    IEEE ACCESS, 2021, 9 : 93703 - 93722
  • [40] Automatic Inspection for Wafer Defect Pattern Recognition with Unsupervised Clustering
    Li, Katherine Shu-Min
    Chen, Leon Li-Yang
    Cheng, Ken Chau-Cheung
    Liao, Peter Yi-Yu
    Wang, Sying-Jyan
    Huang, Andrew Yi-An
    Tsai, Nova
    Chou, Leon
    Han, Gus Chang-Hung
    Chen, Jwu E.
    Liang, Hsing-Chung
    Hsu, Chun-Lung
    2021 IEEE EUROPEAN TEST SYMPOSIUM (ETS 2021), 2021,