SIMILARITY MEASURES FOR NOMINAL VARIABLE CLUSTERING

被引:0
|
作者
Sulc, Zdenek [1 ]
机构
[1] Univ Econ, Dept Stat & Probabil, Prague 13067 3, Czech Republic
来源
8TH INTERNATIONAL DAYS OF STATISTICS AND ECONOMICS | 2014年
关键词
nominal variables; variable clustering; similarity measures;
D O I
暂无
中图分类号
F [经济];
学科分类号
02 ;
摘要
The paper deals with selected similarity measures which can be used for hierarchical clustering of nominal variables. These variables are commonly used in questionnaire surveys. Cluster analysis can be applied in case a reduction of a dataset size is welcomed. In this paper, there are examined several similarity measures for nominal variable clustering, which have been introduced in recent years. On the contrary to the simple matching coefficient, which is considered to be a basic similarity measure, they take into account more characteristics regarding the dataset, such as distribution of frequencies of categories. Therefore, they should provide better results in a comparison to the simple matching coefficient. The performance of clustering with selected similarity measures is examined on two real datasets. For cluster quality evaluation, indices based on the within-cluster variability have been chosen. All computations have been performed in the statistical systems Matlab, IBM SPSS Statistics and MS Excel.
引用
收藏
页码:1536 / 1545
页数:10
相关论文
共 50 条
  • [1] NOMINAL VARIABLE CLUSTERING AND ITS EVALUATION
    Rezankova, Hana
    8TH INTERNATIONAL DAYS OF STATISTICS AND ECONOMICS, 2014, : 1293 - 1302
  • [2] Improving patient clustering by incorporating structured variable label relationships in similarity measures
    Lambert, Judith
    Leutenegger, Anne-Louise
    Baudot, Anais
    Jannot, Anne-Sophie
    BMC MEDICAL RESEARCH METHODOLOGY, 2025, 25 (01)
  • [3] Similarity Measures for Spatial Clustering
    Hamdad, Leila
    Benatchba, Karima
    Ifrez, Soraya
    Mohguen, Yasmine
    COMPUTATIONAL INTELLIGENCE AND ITS APPLICATIONS, 2018, 522 : 25 - 36
  • [4] Clustering of documents via similarity measures
    Rezanková, H
    Húsek, D
    Smid, J
    Snásel, V
    CIC'03: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON COMMUNICATIONS IN COMPUTING, 2003, : 292 - 299
  • [5] Improved Similarity Measures For Software Clustering
    Naseem, Rashid
    Maqbool, Onaiza
    Muhammad, Siraj
    2011 15TH EUROPEAN CONFERENCE ON SOFTWARE MAINTENANCE AND REENGINEERING (CSMR), 2011, : 45 - 54
  • [6] Comparison of similarity measures for clustering Turkish documents
    Madylova, Ainura
    Oguducu, Sule Guenduez
    INTELLIGENT DATA ANALYSIS, 2009, 13 (05) : 815 - 832
  • [7] Similarity Measures Recommendation for Mixed Data Clustering
    Diop, Abdoulaye
    El Malki, Nabil
    Chevalier, Max
    Peninou, Andre
    Teste, Olivier
    Jimenez, Geoffrey Roman
    SCIENTIFIC AND STATISTICAL DATABASE MANAGEMENT 36TH INTERNATIONAL CONFERENCE, SSDBM 2024, 2024,
  • [8] EVALUATION OF POLSAR SIMILARITY MEASURES WITH SPECTRAL CLUSTERING
    Hu, Jingliang
    Wang, Yuanyuan
    Ghamisi, Pedram
    Zhu, Xiao Xiang
    2017 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS), 2017, : 3254 - 3257
  • [9] Comparison of Similarity Measures in Context of Rules Clustering
    Nowak-Brzezinska, Agnieszka
    Rybotycki, Tomasz
    2017 IEEE INTERNATIONAL CONFERENCE ON INNOVATIONS IN INTELLIGENT SYSTEMS AND APPLICATIONS (INISTA), 2017, : 235 - 240
  • [10] Comparison of similarity measures for clustering electrocardiogram complexes
    Chang, KC
    Lee, RG
    Wen, C
    Yeh, MF
    COMPUTERS IN CARDIOLOGY 2005, VOL 32, 2005, 32 : 759 - 762