Hybrid hierarchical clustering with applications to microarray data

被引:95
作者
Chipman, H [1 ]
Tibshirani, R
机构
[1] Acadia Univ, Dept Math & Stat, Wolfville, NS B4P 2R6, Canada
[2] Stanford Univ, Dept Hlth Res & Policy, Stanford, CA 94305 USA
[3] Stanford Univ, Dept Stat, Stanford, CA 94305 USA
关键词
bottom-up clustering; mutual cluster; top-down clustering;
D O I
10.1093/biostatistics/kxj007
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
In this paper, we propose a hybrid clustering method that combines the strengths of bottom-up hierarchical clustering with that of top-down clustering. The first method is good at identifying small clusters but not large ones; the strengths are reversed for the second method. The hybrid method is built on the new idea of a mutual cluster: a group of points closer to each other than to any other points. Theoretical connections between mutual clusters and bottom-up clustering methods are established, aiding in their interpretation and providing an algorithm for identification of mutual clusters. We illustrate the technique on simulated and real microarray datasets.
引用
收藏
页码:286 / 301
页数:16
相关论文
共 12 条
  • [11] DISSIMILARITY ANALYSIS - NEW TECHNIQUE OF HIERARCHICAL SUB-DIVISION
    MACNAUGHTONSMIT.P
    MOCKETT, LG
    DALE, MB
    WILLIAMS, WT
    [J]. NATURE, 1964, 202 (493) : 1034 - &
  • [12] Gene expression patterns of breast carcinomas distinguish tumor subclasses with clinical implications
    Sorlie, T
    Perou, CM
    Tibshirani, R
    Aas, T
    Geisler, S
    Johnsen, H
    Hastie, T
    Eisen, MB
    van de Rijn, M
    Jeffrey, SS
    Thorsen, T
    Quist, H
    Matese, JC
    Brown, PO
    Botstein, D
    Lonning, PE
    Borresen-Dale, AL
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2001, 98 (19) : 10869 - 10874