Quantile-based clustering

被引:5
|
作者
Hennig, Christian [1 ]
Viroli, Cinzia [1 ]
Anderlucci, Laura [1 ]
机构
[1] Univ Bologna, Dept Stat Sci, Via Belle Arti 41, I-40126 Bologna, Italy
来源
ELECTRONIC JOURNAL OF STATISTICS | 2019年 / 13卷 / 02期
关键词
Fixed partition model; quantile discrepancy; high dimensional clustering; nonparametric mixture; CLASSIFICATION; CONSISTENCY;
D O I
10.1214/19-EJS1640
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
A new cluster analysis method, K-quantiles clustering, is introduced. K-quantiles clustering can be computed by a simple greedy algorithm in the style of the classical Lloyd's algorithm for K-means. It can be applied to large and high-dimensional datasets. It allows for within-cluster skewness and internal variable scaling based on within-cluster variation. Different versions allow for different levels of parsimony and computational efficiency. Although K-quantiles clustering is conceived as nonparametric, it can be connected to a fixed partition model of generalized asymmetric Laplace-distributions. The consistency of K-quantiles clustering is proved, and it is shown that K-quantiles clusters correspond to well separated mixture components in a nonparametric mixture. In a simulation, K-quantiles clustering is compared with a number of popular clustering methods with good results. A high-dimensional microarray dataset is clustered by K-quantiles.
引用
收藏
页码:4849 / 4883
页数:35
相关论文
共 50 条
  • [1] A Nonparametric Clustering Algorithm with a Quantile-Based Likelihood Estimator
    Hino, Hideitsu
    Murata, Noboru
    NEURAL COMPUTATION, 2014, 26 (09) : 2074 - 2101
  • [2] Quantile-based classifiers
    Hennig, C.
    Viroli, C.
    BIOMETRIKA, 2016, 103 (02) : 435 - 446
  • [3] Quantile-based fuzzy clustering of multivariate time series in the frequency domain
    Lopez-Oriona, Angel
    Vilar, Jose A.
    D'Urso, Pierpaolo
    FUZZY SETS AND SYSTEMS, 2022, 443 : 115 - 154
  • [4] Quantile-based fuzzy clustering of multivariate time series in the frequency domain
    Lopez-Oriona, Angel
    Vilar, Jose A.
    D'Urso, Pierpaolo
    FUZZY SETS AND SYSTEMS, 2022, 443 : 115 - 154
  • [5] Quantile-based cumulative entropies
    Sankaran, P. G.
    Sunoj, S. M.
    COMMUNICATIONS IN STATISTICS-THEORY AND METHODS, 2017, 46 (02) : 805 - 814
  • [6] Quantile-Based Reliability Analysis
    Nair, N. Unnikrishnan
    Sankaran, P. G.
    COMMUNICATIONS IN STATISTICS-THEORY AND METHODS, 2009, 38 (02) : 222 - 232
  • [7] Quantile-Based Hydrological Modelling
    Tyralis, Hristos
    Papacharalampous, Georgia
    WATER, 2021, 13 (23)
  • [8] Quantile-Based Risk Sharing
    Embrechts, Paul
    Liu, Haiyan
    Wang, Ruodu
    OPERATIONS RESEARCH, 2018, 66 (04) : 936 - 949
  • [9] Composite quantile-based classifiers
    Pritchard, David A.
    Liu, Yufeng
    STATISTICAL ANALYSIS AND DATA MINING, 2020, 13 (04) : 337 - 353
  • [10] Quantile-based overlap measures
    Mathew, Angel
    Joseph, Chinu
    RICERCHE DI MATEMATICA, 2024, 73 (04) : 1919 - 1936