Identifying cluster number for subspace projected functional data clustering

被引:23
作者
Li, Pai-Ling [1 ]
Chiou, Jeng-Min [2 ]
机构
[1] Tamkang Univ, New Taipei City 25137, Taiwan
[2] Acad Sinica, Taipei 11529, Taiwan
关键词
Bootstrapping; Cluster analysis; Functional data analysis; Functional principal components; Gene expression profiles; Hypothesis test; GENE-EXPRESSION; DATA SET; CLASSIFICATION; VALIDATION; ALGORITHM; MODEL;
D O I
10.1016/j.csda.2011.01.001
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
We propose a new approach, the forward functional testing (FFT) procedure, to cluster number selection for functional data clustering. We present a framework of subspace projected functional data clustering based on the functional multiplicative random-effects model, and propose to perform functional hypothesis tests on equivalence of cluster structures to identify the number of clusters. The aim is to find the maximum number of distinctive clusters while retaining significant differences between cluster structures. The null hypotheses comprise equalities between the cluster mean functions and between the sets of cluster eigenfunctions of the covariance kernels. Bootstrap resampling methods are developed to construct reference distributions of the derived test statistics. We compare several other cluster number selection criteria, extended from methods of multivariate data, with the proposed FFT procedure. The performance of the proposed approaches is examined by simulation studies, with applications to clustering gene expression profiles. (C) 2011 Elsevier B.V. All rights reserved.
引用
收藏
页码:2090 / 2103
页数:14
相关论文
共 37 条
  • [1] Unsupervised curve clustering using B-splines
    Abraham, C
    Cornillon, PA
    Matzner-Lober, E
    Molinari, N
    [J]. SCANDINAVIAN JOURNAL OF STATISTICS, 2003, 30 (03) : 581 - 595
  • [2] [Anonymous], 1993, An introduction to the bootstrap
  • [3] Gene expression during the life cycle of Drosophila melanogaster
    Arbeitman, MN
    Furlong, EEM
    Imam, F
    Johnson, E
    Null, BH
    Baker, BS
    Krasnow, MA
    Scott, MP
    Davis, RW
    White, KP
    [J]. SCIENCE, 2002, 297 (5590) : 2270 - 2275
  • [4] Calinski T., 1974, Communications in Statistics-theory and Methods, V3, P1, DOI [10.1080/03610927408827101, DOI 10.1080/03610927408827101]
  • [5] Functional clustering and identifying substructures of longitudinal data
    Chiou, Jeng-Min
    Li, Pai-Ling
    [J]. JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2007, 69 : 679 - 699
  • [6] Correlation-Based Functional Clustering via Subspace Projection
    Chiou, Jeng-Min
    Li, Pai-Ling
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2008, 103 (484) : 1684 - 1692
  • [7] Robust estimation and classification for functional data via projection-based depth notions
    Cuevas, Antonio
    Febrero, Manuel
    Fraiman, Ricardo
    [J]. COMPUTATIONAL STATISTICS, 2007, 22 (03) : 481 - 496
  • [8] Cluster analysis and display of genome-wide expression patterns
    Eisen, MB
    Spellman, PT
    Brown, PO
    Botstein, D
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1998, 95 (25) : 14863 - 14868
  • [9] Ferraty F., 2006, SPR S STAT
  • [10] How many clusters? Which clustering method? Answers via model-based cluster analysis
    Fraley, C
    Raftery, AE
    [J]. COMPUTER JOURNAL, 1998, 41 (08) : 578 - 588