Hierarchical variable clustering via copula-based divergence measures between random vectors

被引:8
|
作者
De Keyser, Steven [1 ]
Gijbels, Irene [1 ]
机构
[1] Katholieke Univ Leuven, Dept Math, Celestijnenlaan 200B, B-3001 Leuven, Belgium
关键词
Copulas; phi-dependence; Random vectors; Trans-elliptical distributions; Variable clustering; KERNEL ESTIMATION;
D O I
10.1016/j.ijar.2023.109090
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This article considers rank-invariant clustering of continuous data via copula-based phi-dependence measures. The general theoretical framework establishes dependence quantification between random vectors (groups of variables), which is used for measuring the similarity between variable clusters in an agglomerative hierarchical procedure afterwards. Special attention is devoted to meta-elliptical copulas, where we present an improved kernel estimator for the density generator and a corresponding bandwidth selector. This allows for non-Gaussian similarities also capturing e.g. tail dependence. Further, a fully non-parametric estimator is considered, enabling cluster detection in contexts where other measures fail. The theory is supported by simulations and a real data example, focusing on cluster analysis of continuous variables.
引用
收藏
页数:22
相关论文
共 50 条
  • [1] Parametric dependence between random vectors via copula-based divergence measures
    De Keyser, Steven
    Gijbels, Irene
    JOURNAL OF MULTIVARIATE ANALYSIS, 2024, 203
  • [2] Copula-Based Divergence Measures for Dependence Between Random Vectors
    De Keyser, Steven
    Gijbels, Irene
    BUILDING BRIDGES BETWEEN SOFT AND STATISTICAL METHODOLOGIES FOR DATA SCIENCE, 2023, 1433 : 104 - 111
  • [3] Hierarchical variable clustering based on the predictive strength between random vectors
    Fuchs, Sebastian
    Wang, Yuping
    INTERNATIONAL JOURNAL OF APPROXIMATE REASONING, 2024, 170
  • [4] Copula-based dependence measures
    Liebscher, Eckhard
    DEPENDENCE MODELING, 2014, 2 (01): : 49 - 64
  • [5] Hierarchical copula-based distributed detection
    Javadi, S. Hamed
    Mohammadi, Abdolreza
    Farina, Alfonso
    SIGNAL PROCESSING, 2019, 158 : 100 - 106
  • [6] Hierarchical Variable Clustering Based on Measures of Predictability
    Wang, Yuping
    Fuchs, Sebastian
    COMBINING, MODELLING AND ANALYZING IMPRECISION, RANDOMNESS AND DEPENDENCE, SMPS 2024, 2024, 1458 : 548 - 553
  • [7] A copula-based hierarchical hybrid loss distribution
    Bernardi, Enrico
    Romagnoli, Silvia
    STATISTICS & RISK MODELING, 2015, 32 (01) : 73 - 87
  • [8] Copula-based measures of asymmetry between the lower and upper tail probabilities
    Kato, Shogo
    Yoshiba, Toshinao
    Eguchi, Shinto
    STATISTICAL PAPERS, 2022, 63 (06) : 1907 - 1929
  • [9] Copula-based measures and tests for conditional asymmetry
    Mokhtari, E.
    Dolati, A.
    Dastbaravarde, A.
    COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION, 2025, 54 (01) : 1 - 22
  • [10] Distorted Copula-Based Probability Distribution of a Counting Hierarchical Variable: A Credit Risk Application
    Bernardi, Enrico
    Romagnoli, Silvia
    INTERNATIONAL JOURNAL OF INFORMATION TECHNOLOGY & DECISION MAKING, 2016, 15 (02) : 285 - 310