Agglomerative hierarchical clustering of continuous variables based on mutual information

被引:37
作者
Kojadinovic, I [1 ]
机构
[1] Univ La Reunion, IREMIA, 15 Ave Rene Cassin BP 7151, F-97715 St Denis Messageries 9, Reunion, France
关键词
agglomerative hierarchical clustering; continuous variables; mutual information; Shannon entropy; redundancy; adaptive kernel density estimation;
D O I
10.1016/S0167-9473(03)00153-1
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
In order to study interdependencies among continuous variables in the framework of a data analysis problem, an agglomerative hierarchical clustering of the set of variables is performed. The similarity measure used within the clustering algorithm is based on the notion of mutual information. Recent results on the estimation of this measure of stochastic dependence are presented and the behavior of the clustering algorithm is studied on several artificial problems, i.e., which "structure" is known. (C) 2003 Elsevier B.V. All rights reserved.
引用
收藏
页码:269 / 294
页数:26
相关论文
共 26 条
  • [1] [Anonymous], 1981, ANAL TYPOLOGIQUE THE
  • [2] Beirlant J, 1997, INT J MATH STAT SCI, V6, P17
  • [3] Bellman R, 1961, ADAPTIVE CONTROL PRO, DOI DOI 10.1515/9781400874668
  • [4] BENZECRI JP, 1976, ANAL DONNES TAXONOMI
  • [5] Cover T. M., 2005, ELEM INF THEORY, DOI 10.1002/047174882X
  • [6] UNCERTAINTY, INFORMATION, AND SEQUENTIAL EXPERIMENTS
    DEGROOT, MH
    [J]. ANNALS OF MATHEMATICAL STATISTICS, 1962, 33 (02): : 404 - &
  • [7] Flannery B.P., 1992, NUMERICAL RECIPES C
  • [8] Fukunaga K., 1990, INTRO STAT PATTERN R
  • [9] Gordon A, 1999, Classification
  • [10] ON THE ESTIMATION OF ENTROPY
    HALL, P
    MORTON, SC
    [J]. ANNALS OF THE INSTITUTE OF STATISTICAL MATHEMATICS, 1993, 45 (01) : 69 - 88