Hierarchical variable clustering via copula-based divergence measures between random vectors

被引:8
|
作者
De Keyser, Steven [1 ]
Gijbels, Irene [1 ]
机构
[1] Katholieke Univ Leuven, Dept Math, Celestijnenlaan 200B, B-3001 Leuven, Belgium
关键词
Copulas; phi-dependence; Random vectors; Trans-elliptical distributions; Variable clustering; KERNEL ESTIMATION;
D O I
10.1016/j.ijar.2023.109090
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This article considers rank-invariant clustering of continuous data via copula-based phi-dependence measures. The general theoretical framework establishes dependence quantification between random vectors (groups of variables), which is used for measuring the similarity between variable clusters in an agglomerative hierarchical procedure afterwards. Special attention is devoted to meta-elliptical copulas, where we present an improved kernel estimator for the density generator and a corresponding bandwidth selector. This allows for non-Gaussian similarities also capturing e.g. tail dependence. Further, a fully non-parametric estimator is considered, enabling cluster detection in contexts where other measures fail. The theory is supported by simulations and a real data example, focusing on cluster analysis of continuous variables.
引用
收藏
页数:22
相关论文
共 50 条
  • [41] A new family of copula-based concordance orderings of random pairs: Properties and nonparametric tests
    Quessy, Jean-Francois
    Mesfioui, Mhamed
    ELECTRONIC JOURNAL OF STATISTICS, 2021, 15 (01): : 2393 - 2429
  • [42] A clusterized copula-based probability distribution of a counting variable for high-dimensional problems
    Bernardi, Enrico
    Romagnoli, Silvia
    JOURNAL OF CREDIT RISK, 2013, 9 (02): : 3 - 26
  • [43] Copula-Based Correlation Analysis of Intensity Measures of Mainshock-Aftershock Ground Motions
    Zhu R.-G.
    Lü D.-G.
    Gongcheng Lixue/Engineering Mechanics, 2019, 36 (02): : 114 - 123
  • [44] A Bayesian inference for time series via copula-based Markov chain models
    Sun, Li-Hsien
    Lee, Chang-Shang
    Emura, Takeshi
    COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION, 2020, 49 (11) : 2897 - 2913
  • [45] Computing the Semantic Similarity between Documents by the Copula-Based Econometric Models
    Huang, Jih-Jeng
    2019 2ND INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND PATTERN RECOGNITION (AIPR 2019), 2019, : 134 - 139
  • [46] LP-moments of random vectors via majorizing measures
    Guedon, Olivier
    Rudelson, Mark
    ADVANCES IN MATHEMATICS, 2007, 208 (02) : 798 - 823
  • [47] Improvement of Hierarchical Clustering Results by Refinement of Variable Types and Distance Measures
    Curic, Sofija Pinjusic
    Vranic, Mihaela
    Pintar, Damir
    AUTOMATIKA, 2011, 52 (04) : 353 - 364
  • [48] A numerical strategy to evaluate performance of predictive scores via a copula-based approach
    Zhang Yilong
    Shao Yongzhao
    STATISTICS IN MEDICINE, 2020, 39 (20) : 2671 - 2684
  • [49] Copula-Based Reliability Analysis of Vehicles Based on Censored Failures Data Using Reliability Importance Measures
    Oszczypala, Mateusz
    Konwerski, Jakub
    Ziolkowski, Jaroslaw
    Malachowski, Jerzy
    IEEE ACCESS, 2024, 12 : 154119 - 154137
  • [50] Agglomerative hierarchical co-clustering based on Bregman divergence
    Shen, Guowei, 1600, Springer Verlag (287):