AutoClass@IJM: a powerful tool for Bayesian classification of heterogeneous data in biology

被引:34
作者
Achcar, Fiona [1 ,2 ,3 ]
Camadro, Jean-Michel [2 ,3 ]
Mestivier, Denis [1 ]
机构
[1] CNRS, Modeling Integrat Biol Grp, F-75205 Paris 13, France
[2] CNRS, Inst Jacques Monod, Prot Engn & Metab Control Grp, UMR7592, F-75205 Paris 13, France
[3] Univ Paris Diderot, F-75205 Paris 13, France
关键词
GENE-EXPRESSION DATA; CLUSTERING MICROARRAY DATA; MIXTURE MODEL; SACCHAROMYCES-CEREVISIAE; KNOWLEDGE; PROFILES; PROTEINS; PATTERNS; SEQUENCE;
D O I
10.1093/nar/gkp430
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Recently, several theoretical and applied studies have shown that unsupervised Bayesian classification systems are of particular relevance for biological studies. However, these systems have not yet fully reached the biological community mainly because there are few freely available dedicated computer programs, and Bayesian clustering algorithms are known to be time consuming, which limits their usefulness when using personal computers. To overcome these limitations, we developed AutoClass@IJM, a computational resource with a web interface to AutoClass, a powerful unsupervised Bayesian classification system developed by the Ames Research Center at N.A.S.A. AutoClass has many powerful features with broad applications in biological sciences: (i) it determines the number of classes automatically, (ii) it allows the user to mix discrete and real valued data, (iii) it handles missing values. End users upload their data sets through our web interface; computations are then queued in our cluster server. When the clustering is completed, an URL to the results is sent back to the user by e-mail. AutoClass@IJM is freely available at: http://ytat2.ijm.univ-paris-diderot.fr/AutoclassAtIJM.html.
引用
收藏
页码:W63 / W67
页数:5
相关论文
共 31 条
  • [1] MEME: discovering and analyzing DNA and protein sequence motifs
    Bailey, Timothy L.
    Williams, Nadya
    Misleh, Chris
    Li, Wilfred W.
    [J]. NUCLEIC ACIDS RESEARCH, 2006, 34 : W369 - W373
  • [2] Bayesian hierarchical spatially correlated functional data analysis with application to colon carcinogenesis
    Baladandayuthapani, Veerabhadran
    Mallick, Bani K.
    Hong, Mee Young
    Lupton, Joanne R.
    Turner, Nancy D.
    Carroll, Raymond J.
    [J]. BIOMETRICS, 2008, 64 (01) : 64 - 73
  • [3] Predicting gene expression from sequence
    Beer, MA
    Tavazoie, S
    [J]. CELL, 2004, 117 (02) : 185 - 198
  • [4] Cheeseman P., 1996, Advances in knowledge discovery and data mining
  • [5] CHEESEMAN P, 1988, P 5 INT C MACH LEARN
  • [6] Identifying marker genes in transcription profiling data using a mixture of feature relevance experts
    Chow, ML
    Moler, EJ
    Mian, IS
    [J]. PHYSIOLOGICAL GENOMICS, 2001, 5 (02) : 99 - 111
  • [7] Identifying the structure in cuttlefish visual signals
    Crook, AC
    Baddeley, R
    Osorio, D
    [J]. PHILOSOPHICAL TRANSACTIONS OF THE ROYAL SOCIETY OF LONDON SERIES B-BIOLOGICAL SCIENCES, 2002, 357 (1427) : 1617 - 1624
  • [8] Open source clustering software
    de Hoon, MJL
    Imoto, S
    Nolan, J
    Miyano, S
    [J]. BIOINFORMATICS, 2004, 20 (09) : 1453 - 1454
  • [9] Fuzzy C-means method for clustering microarray data
    Dembélé, D
    Kastner, P
    [J]. BIOINFORMATICS, 2003, 19 (08) : 973 - 980
  • [10] Cluster analysis and display of genome-wide expression patterns
    Eisen, MB
    Spellman, PT
    Brown, PO
    Botstein, D
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1998, 95 (25) : 14863 - 14868