Automatic aspect discrimination in data clustering

被引:10
|
作者
Horta, Danilo [1 ]
Campello, Ricardo J. G. B. [1 ]
机构
[1] Univ Sao Paulo, ICMC, BR-13560970 Sao Carlos, SP, Brazil
基金
巴西圣保罗研究基金会;
关键词
Clustering; Aspect discrimination; Attribute weighting; Cluster validation; FUZZY EXTENSION; RELATIONAL DATA; VALIDITY; AGGREGATION; VALIDATION; ALGORITHMS; COMPLEXITY; CRITERION; INDEXES;
D O I
10.1016/j.patcog.2012.05.011
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The attributes describing a data set may often be arranged in meaningful subsets, each of which corresponds to a different aspect of the data. An unsupervised algorithm (SCAD) that simultaneously performs fuzzy clustering and aspects weighting was proposed in the literature. However, SCAD may fail and halt given certain conditions. To fix this problem, its steps are modified and then reordered to reduce the number of parameters required to be set by the user. In this paper we prove that each step of the resulting algorithm, named ASCAD, globally minimizes its cost-function with respect to the argument being optimized. The asymptotic analysis of ASCAD leads to a time complexity which is the same as that of fuzzy c-means. A hard version of the algorithm and a novel validity criterion that considers aspect weights in order to estimate the number of clusters are also described. The proposed method is assessed over several artificial and real data sets. (C) 2012 Elsevier Ltd. All rights reserved.
引用
收藏
页码:4370 / 4388
页数:19
相关论文
共 50 条
  • [21] A generalized automatic clustering algorithm in a multiobjective framework
    Saha, Sriparna
    Bandyopadhyay, Sanghamitra
    APPLIED SOFT COMPUTING, 2013, 13 (01) : 89 - 108
  • [22] On clustering shape data
    Nabil, M.
    Golalizadeh, M.
    JOURNAL OF STATISTICAL COMPUTATION AND SIMULATION, 2016, 86 (15) : 2995 - 3008
  • [23] Relational data clustering with incomplete data
    Hathaway, RJ
    Overstreet, DD
    Murphy, TE
    Bezdek, JC
    APPLICATIONS AND SCIENCE OF COMPUTATIONAL INTELLIGENCE IV, 2001, 4390 : 273 - 280
  • [24] Aspect Clustering Methods for Sentiment Analysis
    Vargas, Francielle Alves
    Salgueiro Pardo, Thiago Alexandre
    COMPUTATIONAL PROCESSING OF THE PORTUGUESE LANGUAGE, PROPOR 2018, 2018, 11122 : 365 - 374
  • [25] AUTOMATIC CLUSTERING OF MULTISPECTRAL DATA USING A NON-GAUSSIAN STATISTICAL MODEL
    Khan, Salman
    Doulgeris, Anthony P.
    Savastano, Salvatore
    Guida, Raffaella
    2014 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS), 2014,
  • [26] Dynamic Data Driven-based Automatic Clustering and Semantic Annotation for Internet of Things Sensor Data
    Lin, Szu-Yin
    Li, Jun-Bin
    Yu, Ching-Tzu
    SENSORS AND MATERIALS, 2019, 31 (06) : 1789 - 1801
  • [27] XML Data Clustering: An Overview
    Algergawy, Alsayed
    Mesiti, Marco
    Nayak, Richi
    Saake, Gunter
    ACM COMPUTING SURVEYS, 2011, 43 (04)
  • [28] Functional data clustering: a survey
    Jacques, Julien
    Preda, Cristian
    ADVANCES IN DATA ANALYSIS AND CLASSIFICATION, 2014, 8 (03) : 231 - 255
  • [29] Data clustering: application and trends
    Oyewole, Gbeminiyi John
    Thopil, George Alex
    ARTIFICIAL INTELLIGENCE REVIEW, 2023, 56 (07) : 6439 - 6475
  • [30] Robust clustering of imprecise data
    D'Urso, Pierpaolo
    De Giovanni, Livia
    CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2014, 136 : 58 - 80