Reliability analysis of microarray data using fuzzy c-means and normal mixture modeling based classification methods

被引:31
作者
Asyali, MH
Alci, M
机构
[1] King Faisal Specialist Hosp & Res Ctr, Dept Biostat Epidemiol & Sci Comp, Riyadh 11211, Saudi Arabia
[2] Ege Univ, Dept Elect & Elect Engn, TR-35100 Izmir, Turkey
关键词
D O I
10.1093/bioinformatics/bti036
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: A serious limitation in microarray analysis is the unreliability of the data generated from low signal intensities. Such data may produce erroneous gene expression ratios and cause unnecessary validation or post-analysis follow-up tasks. Therefore, the elimination of unreliable signal intensities will enhance reproducibility and reliability of gene expression ratios produced from microarray data. In this study, we applied fuzzy c-means (FCM) and normal mixture modeling (NMM) based classification methods to separate microarray data into reliable and unreliable signal intensity populations. Results: We compared the results of FCM classification with those of classification based on NMM. Both approaches were validated against reference sets of biological data consisting of only true positives and true negatives. We observed that both methods performed equally well in terms of sensitivity and specificity. Although a comparison of the computation times indicated that the fuzzy approach is computationally more efficient, other considerations support the use of NMM for the reliability analysis of microarray data.
引用
收藏
页码:644 / 649
页数:6
相关论文
共 33 条
  • [1] [Anonymous], NONLINEAR BIOMEDICAL
  • [2] [Anonymous], Pattern Recognition With Fuzzy Objective Function Algorithms
  • [3] Assessment of reliability of microarray data and estimation of signal thresholds using mixture modeling
    Asyali, MH
    Shoukri, MM
    Demirkaya, O
    Khabar, KSA
    [J]. NUCLEIC ACIDS RESEARCH, 2004, 32 (08) : 2323 - 2335
  • [4] CONVERGENCE THEORY FOR FUZZY C-MEANS - COUNTEREXAMPLES AND REPAIRS
    BEZDEK, JC
    HATHAWAY, RJ
    SABIN, MJ
    TUCKER, WT
    [J]. IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS, 1987, 17 (05): : 873 - 877
  • [5] Defining signal thresholds in DNA microarrays: exemplary application for invasive cancer
    Bilban, M
    Buehler, LK
    Head, S
    Desoye, G
    Quaranta, V
    [J]. BMC GENOMICS, 2002, 3 (1)
  • [6] Significance and statistical errors in the analysis of DNA microarray data
    Brody, JP
    Williams, BA
    Wold, BJ
    Quake, SR
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2002, 99 (20) : 12975 - 12978
  • [7] Gene expression signature of fibroblast serum response predicts human cancer progression: Similarities between tumors and wounds
    Chang, HY
    Sneddon, JB
    Alizadeh, AA
    Sood, R
    West, RB
    Montgomery, K
    Chi, JT
    van de Rijn, M
    Botstein, D
    Brown, PO
    [J]. PLOS BIOLOGY, 2004, 2 (02) : 206 - 214
  • [8] Fuzzy C-means method for clustering microarray data
    Dembélé, D
    Kastner, P
    [J]. BIOINFORMATICS, 2003, 19 (08) : 973 - 980
  • [9] MAXIMUM LIKELIHOOD FROM INCOMPLETE DATA VIA EM ALGORITHM
    DEMPSTER, AP
    LAIRD, NM
    RUBIN, DB
    [J]. JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-METHODOLOGICAL, 1977, 39 (01): : 1 - 38
  • [10] Duda R. O., 2000, PATTERN CLASSIFICATI