The classification of cancer stage microarray data

被引:13
作者
Chen, Chi-Kan [1 ]
机构
[1] Natl Chung Hsing Univ, Dept Appl Math, Taichung, Taiwan
关键词
Classification; Ordinal response; Cancer stage; Microarray data; MOLECULAR CLASSIFICATION; REGRESSION-MODELS; GENE SELECTION; PREDICTION;
D O I
10.1016/j.cmpb.2012.07.001
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Correctly diagnosing the cancer stage is most important for selecting an appropriate cancer treatment option for a patient. Recent advances in microarray technology allow the cancer stage to be predicted using gene expression patterns. The cancer stage is in ordinal scale. In this paper, we employ strict ordinal regressions including cumulative logit model in traditional statistics with data dimensionality reduction, and distribution free approaches of large margin rank boundaries implemented by the support vector machine, as well as an ensemble ranking scheme to model the cancer stage using gene expression microarray data. Predictive genes included in Models are selected by univariate feature ranking, and recursive feature elimination. We perform cross-validation experiments to assess and compare classification accuracies of ordinal and non-ordinal algorithms on five cancer stage microarray datasets. We conclude that a strict ordinal classifier trained by a validated approach can predict the cancer stage more accurately than traditional non-ordinal classifiers without considering the order of cancer stages. (C) 2012 Elsevier Ireland Ltd. All rights reserved.
引用
收藏
页码:1070 / 1077
页数:8
相关论文
共 27 条
  • [1] Regression models for ordinal responses: A review of methods and applications
    Ananth, CV
    Kleinbaum, DG
    [J]. INTERNATIONAL JOURNAL OF EPIDEMIOLOGY, 1997, 26 (06) : 1323 - 1333
  • [2] [Anonymous], 2005, INT C MACH LEARN
  • [3] [Anonymous], LARGE MARGIN RANK BO
  • [4] [Anonymous], R LANG ENV STAT COMP
  • [5] L 1 penalized continuation ratio models for ordinal response prediction using high-dimensional datasets
    Archer, K. J.
    Williams, A. A. A.
    [J]. STATISTICS IN MEDICINE, 2012, 31 (14) : 1464 - 1474
  • [6] Chu S., 2005, P 22 INT C MACH LEAR, P145, DOI DOI 10.1145/1102351.1102370
  • [7] Comparison of discrimination methods for the classification of tumors using gene expression data
    Dudoit, S
    Fridlyand, J
    Speed, TP
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2002, 97 (457) : 77 - 87
  • [8] Identifying distinct classes of bladder carcinoma using microarrays
    Dyrskjot, L
    Thykjaer, T
    Kruhoffer, M
    Jensen, JL
    Marcussen, N
    Hamilton-Dutoit, S
    Wolf, H
    Orntoft, TF
    [J]. NATURE GENETICS, 2003, 33 (01) : 90 - 96
  • [9] Molecular classification of cancer: Class discovery and class prediction by gene expression monitoring
    Golub, TR
    Slonim, DK
    Tamayo, P
    Huard, C
    Gaasenbeek, M
    Mesirov, JP
    Coller, H
    Loh, ML
    Downing, JR
    Caligiuri, MA
    Bloomfield, CD
    Lander, ES
    [J]. SCIENCE, 1999, 286 (5439) : 531 - 537
  • [10] Gene selection for cancer classification using support vector machines
    Guyon, I
    Weston, J
    Barnhill, S
    Vapnik, V
    [J]. MACHINE LEARNING, 2002, 46 (1-3) : 389 - 422