A new reliable cancer diagnosis method using boosted fuzzy classifier with a SWEEP operator method

被引:21
作者
Takahashi, H [1 ]
Honda, H [1 ]
机构
[1] Nagoya Univ, Sch Engn, Dept Biotechnol, Chikusa Ku, Nagoya, Aichi 4648603, Japan
关键词
cancer diagnosis; boosting; fuzzy classifier; reliability evaluation; rule extraction;
D O I
10.1252/jcej.38.763
中图分类号
TQ [化学工业];
学科分类号
0817 ;
摘要
For the adequate treatment of patients, it is important to have an accurate and reliable algorithm developed for construction of a diagnosis system that can deal with gene expression data of DNA microarray, or proteomic data obtained by means of mass spectrometry (MS). It is also necessary that this algorithm is fast because these data consist of thousands of attributes (genes or proteins). We have developed a boosted fuzzy classifier with a SWEEP operator (BFCS) method on the basis of the fuzzy theory and boosting algorithm. This method has been applied for the construction of class predictors for cancer diagnosis using clinical data for breast cancer or proteomic pattern data of MS for ovarian cancer. The model performance has been evaluated by comparison with a conventional method such as a support vector machine (SVM) and a fuzzy neural network combined with the SWEEP operator (FNN-SWEEP) method previously proposed by us. The BFCS algorithm is 1,000 to 10,000 times faster than the other two methods. The constructed BFCS class predictors could discriminate classes of breast cancer and ovarian cancer with the same or higher accuracy than the other two methods. Furthermore, BFCS enabled the calculation of the reliability index for each patient, while the feature is not incorporated into a conventional algorithm. Based on this index, the discriminated group with 100% prediction accuracy was separated from the others.
引用
收藏
页码:763 / 773
页数:11
相关论文
共 35 条
  • [1] Fuzzy neural network applied to gene expression profiling for predicting the prognosis of diffuse large B-cell lymphoma
    Ando, T
    Suguro, M
    Hanai, T
    Kobayashi, T
    Honda, H
    Seto, M
    [J]. JAPANESE JOURNAL OF CANCER RESEARCH, 2002, 93 (11): : 1207 - 1212
  • [2] [Anonymous], 1999, REPOSIT TU DORTMUND, DOI DOI 10.17877/DE290R-5098
  • [3] SVM based method for predicting HLA-DRB1*0401 binding peptides in an antigen sequence
    Bhasin, M
    Raghava, GPS
    [J]. BIOINFORMATICS, 2004, 20 (03) : 421 - 423
  • [4] Knowledge-based analysis of microarray gene expression data by using support vector machines
    Brown, MPS
    Grundy, WN
    Lin, D
    Cristianini, N
    Sugnet, CW
    Furey, TS
    Ares, M
    Haussler, D
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2000, 97 (01) : 262 - 267
  • [5] Protein classification based on text document classification techniques
    Cheng, BYM
    Carbonell, JG
    Klein-Seetharaman, J
    [J]. PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2005, 58 (04) : 955 - 970
  • [6] Cluster analysis and display of genome-wide expression patterns
    Eisen, MB
    Spellman, PT
    Brown, PO
    Botstein, D
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1998, 95 (25) : 14863 - 14868
  • [7] An adaptive version of the boost by majority algorithm
    Freund, Y
    [J]. MACHINE LEARNING, 2001, 43 (03) : 293 - 318
  • [8] A decision-theoretic generalization of on-line learning and an application to boosting
    Freund, Y
    Schapire, RE
    [J]. JOURNAL OF COMPUTER AND SYSTEM SCIENCES, 1997, 55 (01) : 119 - 139
  • [9] Additive logistic regression: A statistical view of boosting - Rejoinder
    Friedman, J
    Hastie, T
    Tibshirani, R
    [J]. ANNALS OF STATISTICS, 2000, 28 (02) : 400 - 407
  • [10] Molecular classification of cancer: Class discovery and class prediction by gene expression monitoring
    Golub, TR
    Slonim, DK
    Tamayo, P
    Huard, C
    Gaasenbeek, M
    Mesirov, JP
    Coller, H
    Loh, ML
    Downing, JR
    Caligiuri, MA
    Bloomfield, CD
    Lander, ES
    [J]. SCIENCE, 1999, 286 (5439) : 531 - 537