Hierarchical variable clustering via copula-based divergence measures between random vectors

被引:8
|
作者
De Keyser, Steven [1 ]
Gijbels, Irene [1 ]
机构
[1] Katholieke Univ Leuven, Dept Math, Celestijnenlaan 200B, B-3001 Leuven, Belgium
关键词
Copulas; phi-dependence; Random vectors; Trans-elliptical distributions; Variable clustering; KERNEL ESTIMATION;
D O I
10.1016/j.ijar.2023.109090
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This article considers rank-invariant clustering of continuous data via copula-based phi-dependence measures. The general theoretical framework establishes dependence quantification between random vectors (groups of variables), which is used for measuring the similarity between variable clusters in an agglomerative hierarchical procedure afterwards. Special attention is devoted to meta-elliptical copulas, where we present an improved kernel estimator for the density generator and a corresponding bandwidth selector. This allows for non-Gaussian similarities also capturing e.g. tail dependence. Further, a fully non-parametric estimator is considered, enabling cluster detection in contexts where other measures fail. The theory is supported by simulations and a real data example, focusing on cluster analysis of continuous variables.
引用
收藏
页数:22
相关论文
共 50 条
  • [11] Copula-based measures of asymmetry between the lower and upper tail probabilities
    Shogo Kato
    Toshinao Yoshiba
    Shinto Eguchi
    Statistical Papers, 2022, 63 : 1907 - 1929
  • [12] Copula-based fuzzy clustering of spatial time series
    Disegna, Marta
    D'Urso, Pierpaolo
    Durante, Fabrizio
    SPATIAL STATISTICS, 2017, 21 : 209 - 225
  • [13] Copula-based measures of dependence structure in assets returns
    Fernandez, Viviana
    PHYSICA A-STATISTICAL MECHANICS AND ITS APPLICATIONS, 2008, 387 (14) : 3615 - 3628
  • [14] Copula-based regression models with data missing at random
    Hamori, Shigeyuki
    Motegi, Kaiji
    Zhang, Zheng
    JOURNAL OF MULTIVARIATE ANALYSIS, 2020, 180
  • [15] Nonlinear Random Forest Classification, a Copula-Based Approach
    Mesiar, Radko
    Sheikhi, Ayyub
    APPLIED SCIENCES-BASEL, 2021, 11 (15):
  • [16] Flood routing via a copula-based approach
    Nazeri Tahroudi, Mohammad
    Ramezani, Yousef
    De Michele, Carlo
    Mirabbasi, Rasoul
    HYDROLOGY RESEARCH, 2021, 52 (06): : 1294 - 1308
  • [17] Copula-Based Random Effects Models for Clustered Data
    Pereda-Fernandez, Santiago
    JOURNAL OF BUSINESS & ECONOMIC STATISTICS, 2021, 39 (02) : 575 - 588
  • [18] A Vine Copula-Based Hierarchical Framework for Multiscale Uncertainty Analysis
    Xu, Can
    Liu, Zhao
    Tao, Wei
    Zhu, Ping
    JOURNAL OF MECHANICAL DESIGN, 2020, 142 (03)
  • [19] A copula-based clustering algorithm to analyse EU country diets
    Di Lascio, F. Marta L.
    Disegna, Marta
    KNOWLEDGE-BASED SYSTEMS, 2017, 132 : 72 - 84
  • [20] Testing and dating structural changes in copula-based dependence measures
    Stark, Florian
    Otto, Sven
    JOURNAL OF APPLIED STATISTICS, 2022, 49 (05) : 1121 - 1139