Symbolic clustering of large datasets

被引:2
作者
Lechevallier, Yves
Verde, Rosanna [1 ]
de Carvalho, Francisco de A. T. [2 ]
机构
[1] Seconda Univ Napoli, Dip Strateg Aziendali & Metod Quantit, I-81043 Capua, CE, Italy
[2] Cidade Univ, Ctr Informat, BR-50740540 Recife, PE, Brazil
来源
DATA SCIENCE AND CLASSIFICATION | 2006年
关键词
D O I
10.1007/3-540-34416-0_21
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present an approach to cluster large datasets that integrates the Kohonen Self Organizing Maps (SOM) with a dynamic clustering algorithm of symbolic data (SCLUST). A preliminary data reduction using SOM algorithm is performed. As a result, the individual measurements are replaced by micro-clusters. These micro-clusters are then grouped in a few clusters which are modeled by symbolic objects. By computing the extension of these symbolic objects, symbolic clustering algorithm allows discovering the natural classes. An application on a real data set shows the usefulness of this methodology.
引用
收藏
页码:193 / +
页数:3
相关论文
共 50 条
  • [21] POFCM: A Parallel Fuzzy Clustering Algorithm for Large Datasets
    Perez-Ortega, Joaquin
    Rey-Figueroa, Cesar David
    Roblero-Aguilar, Sandra Silvia
    Almanza-Ortega, Nelva Nely
    Zavala-Diaz, Crispin
    Garcia-Paredes, Salomon
    Landero-Najera, Vanesa
    MATHEMATICS, 2023, 11 (08)
  • [22] Effective data summarization for hierarchical clustering in large datasets
    Patra, Bidyut Kr.
    Nandi, Sukumar
    KNOWLEDGE AND INFORMATION SYSTEMS, 2015, 42 (01) : 1 - 20
  • [23] Effective data summarization for hierarchical clustering in large datasets
    Bidyut Kr. Patra
    Sukumar Nandi
    Knowledge and Information Systems, 2015, 42 : 1 - 20
  • [24] NBC: An Efficient Hierarchical Clustering Algorithm for Large Datasets
    Zhang, Wei
    Zhang, Gongxuan
    Wang, Yongli
    Zhu, Zhaomeng
    Li, Tao
    INTERNATIONAL JOURNAL OF SEMANTIC COMPUTING, 2015, 9 (03) : 307 - 331
  • [25] Efficient Hierarchical Clustering of Large High Dimensional Datasets
    Gilpin, Sean
    Qian, Buyue
    Davidson, Ian
    PROCEEDINGS OF THE 22ND ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT (CIKM'13), 2013, : 1371 - 1380
  • [26] A Framework for Data Clustering of Large Datasets in a Distributed Environment
    Swapna, Ch. Swetha
    Kumar, V. Vijaya
    Murthy, J. V. R.
    PROCEEDINGS OF THE SECOND INTERNATIONAL CONFERENCE ON COMPUTER AND COMMUNICATION TECHNOLOGIES, IC3T 2015, VOL 1, 2016, 379 : 425 - 441
  • [27] Fast adaptive clustering by synchronization on large scale datasets
    Ying, Wenhao
    Xu, Min
    Wang, Shitong
    Deng, Zhaohong
    Ying, W. (cslgywh@163.com), 1600, Science Press (51): : 707 - 720
  • [28] DHC: A Distributed Hierarchical Clustering Algorithm for Large Datasets
    Zhang, Wei
    Zhang, Gongxuan
    Chen, Xiaohui
    Liu, Yueqi
    Zhou, Xiumin
    Zhou, Junlong
    JOURNAL OF CIRCUITS SYSTEMS AND COMPUTERS, 2019, 28 (04)
  • [29] Spectral Clustering Trough Topological Learning for Large Datasets
    Rogovschi, Nicoleta
    Grozavu, Nistor
    Labiod, Lazhar
    NEURAL INFORMATION PROCESSING, PT II, 2015, 9490 : 216 - 223
  • [30] Clustering Large Datasets by MergingK-Means Solutions
    Melnykov, Volodymyr
    Michael, Semhar
    JOURNAL OF CLASSIFICATION, 2020, 37 (01) : 97 - 123