Quality assessment of Affymetrix GeneChip data using the EM algorithm and a naive Bayes classifier

被引:0
作者
Howard, Brian E. [1 ]
Sick, Beate [4 ]
Perera, Imara [2 ]
Im, Yang Ju [2 ]
Winter-Sederoff, Heike [2 ]
Heber, Steffen [3 ]
机构
[1] North Carolina State Univ, Bioinformat Res Ctr, Raleigh, NC 27695 USA
[2] North Carolina State Univ, Dept Plant Biol, Raleigh, NC 27695 USA
[3] North Carolina State Univ, Dept Comp Sci, Raleigh, NC 27695 USA
[4] Zurich Univ Appl Sci Winterthur, Inst Data Anal & Proc Design, Winterthur, Switzerland
来源
PROCEEDINGS OF THE 7TH IEEE INTERNATIONAL SYMPOSIUM ON BIOINFORMATICS AND BIOENGINEERING, VOLS I AND II | 2007年
关键词
microarray; quality control; EM algorithm; naive Bayes;
D O I
暂无
中图分类号
R318 [生物医学工程];
学科分类号
0831 ;
摘要
Recent research has demonstrated the utility of using supervised classification systems for automatic identification of low quality microarray data. However, this approach requires annotation of a large training set by a qualified expert. In this paper we demonstrate the utility of an unsupervised classification technique based on the Expectation-Maximization (EM) algorithm and naive Bayes classification. On our test set, this system exhibits performance comparable to that of an analogous supervised learner constructed from the same training data.
引用
收藏
页码:145 / +
页数:3
相关论文
共 24 条
  • [1] Affymetrix, 2003, GENECHIP EXPR AN DAT
  • [2] Reliability analysis of microarray data using fuzzy c-means and normal mixture modeling based classification methods
    Asyali, MH
    Alci, M
    [J]. BIOINFORMATICS, 2005, 21 (05) : 644 - 649
  • [3] Assessment of reliability of microarray data and estimation of signal thresholds using mixture modeling
    Asyali, MH
    Shoukri, MM
    Demirkaya, O
    Khabar, KSA
    [J]. NUCLEIC ACIDS RESEARCH, 2004, 32 (08) : 2323 - 2335
  • [4] BOLSTAD B, 2007, AFFYPLM METHODS FITT
  • [5] BRETTSCHNEIDER J, 2006, QUALITY ASSESSMENT S
  • [6] MAXIMUM LIKELIHOOD FROM INCOMPLETE DATA VIA EM ALGORITHM
    DEMPSTER, AP
    LAIRD, NM
    RUBIN, DB
    [J]. JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-METHODOLOGICAL, 1977, 39 (01): : 1 - 38
  • [7] GAMEZ J A., 2006, Probabilistic Graphical Models, P123
  • [8] Gentleman R, 2005, BIOINFORMATICS COMPU, V746718470
  • [9] Bioconductor: open software development for computational biology and bioinformatics
    Gentleman, RC
    Carey, VJ
    Bates, DM
    Bolstad, B
    Dettling, M
    Dudoit, S
    Ellis, B
    Gautier, L
    Ge, YC
    Gentry, J
    Hornik, K
    Hothorn, T
    Huber, W
    Iacus, S
    Irizarry, R
    Leisch, F
    Li, C
    Maechler, M
    Rossini, AJ
    Sawitzki, G
    Smith, C
    Smyth, G
    Tierney, L
    Yang, JYH
    Zhang, JH
    [J]. GENOME BIOLOGY, 2004, 5 (10)
  • [10] HEBER S, 2006, ACM SE 44, P411