Impact of Dataset Scaling on Hierarchical Clustering: A Comparative Analysis of DistanceBased and Ratio-Based Methods

被引:0
|
作者
Alzahrani, Ali Rashash R. [1 ]
机构
[1] Umm Al Qura Univ, Fac Sci, Math Dept, Mecca, Saudi Arabia
来源
INTERNATIONAL JOURNAL OF ANALYSIS AND APPLICATIONS | 2024年 / 22卷
关键词
distance type methods; ratio type method; median linkage; centroid linkage; average linkage;
D O I
10.28924/2291-8639-22-2024-36
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
In this study, the distance -based agglomerative hierarchical clustering techniques were compared to a ratiobased approach. Two real datasets, which were also used in a prior study by Roux (2018), were considered. Firstly, it was observed that the type of scaling applied to the datasets was found to affect the results of hierarchical clustering. Thus, various scaling methods were employed prior to implementing hierarchical clustering. Furthermore, two rankbased goodness -of -fit measures were used to evaluate the hierarchical clustering methods. In contrast to Roux (2018) findings, it was observed that the distance -based methods, such as Median linkage, Average linkage, and centroid linkage, performed better than the ratio -based method. The ratio -based methods also showed issues with branch crossing in the hierarchical clustering dendrogram. Consequently, this study illustrates that, with appropriate dataset scaling, the distance -based methods outperform ratio -based methods in terms of goodness -of -fit measures.
引用
收藏
页数:14
相关论文
共 50 条
  • [1] Analysis of ratio-based responses
    Righetti, Timothy L.
    Sandrock, David R.
    Strik, Bernadine
    Vasconcelos, Carmo
    Moreno, Yerko
    Ortega-Farias, Samuel
    Banados, Pilar
    JOURNAL OF THE AMERICAN SOCIETY FOR HORTICULTURAL SCIENCE, 2007, 132 (01) : 3 - 13
  • [2] Pythagorean Fuzzy Clustering Analysis: A Hierarchical Clustering Algorithm with the Ratio Index-Based Ranking Methods
    Zhang, Xiaolu
    INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 2018, 33 (09) : 1798 - 1822
  • [3] Hierarchical clustering analysis from genomic dataset
    Yin, W
    Yang, Q
    Yao, SY
    Shi, Y
    2005 IEEE/WIC/ACM International Conference on Intelligent Agent Technology, Proceedings, 2005, : 759 - 762
  • [4] Comparative Assessment of Hierarchical Clustering Methods for Grouping in Singular Spectrum Analysis
    Hassani, Hossein
    Kalantari, Mahdi
    Beneki, Christina
    APPLIEDMATH, 2021, 1 (01): : 18 - 36
  • [5] A comprehensive analysis of ratio-based metrics in physiology
    Kerkhof, Peter L. M.
    Diaz-Navarro, Rienzi A.
    Handly, Neal
    JOURNAL OF PHYSIOLOGY-LONDON, 2025, 603 (04): : 1007 - 1008
  • [7] A COMPARATIVE STUDY OF ALTERNATIVE SOFTWARE TO CONDUCT HAZARD RATIO-BASED NETWORK META-ANALYSIS
    Perera, C.
    Heron, L.
    Hirst, A.
    VALUE IN HEALTH, 2024, 27 (12) : S456 - S457
  • [8] An analysis of angle-based with ratio-based vegetation indices
    Jiang, Zhangyan
    Huete, Alfredo R.
    Li, Jing
    Chen, Yunhao
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2006, 44 (09): : 2506 - 2513
  • [9] Performance comparison of some likelihood ratio-based statistical surveillance methods
    Mahmoud, Mahmoud A.
    Woodall, William H.
    Davis, Robert E.
    JOURNAL OF APPLIED STATISTICS, 2008, 35 (07) : 783 - 798
  • [10] Risk Analysis on the Occupation Hazards of Dust Based on the Hierarchical Clustering Methods
    Yao Jian
    Tian Dongmei
    PROCEEDINGS OF THE 2010 CHINESE SEMINAR ON THE PRINCIPLES OF SAFETY SCIENCE AND TECHNOLOGY, 2010, : 522 - 527