An analytical method for multiclass molecular cancer classification

被引:54
|
作者
Rifkin, R [1 ]
Mukherjee, S
Tamayo, P
Ramaswamy, S
Yeang, CH
Angelo, M
Reich, M
Poggio, T
Lander, ES
Golub, TR
Mesirov, JP
机构
[1] MIT, Whitehead Inst, Ctr Genome Res, Cambridge, MA 02139 USA
[2] Dana Farber Canc Inst, Dept Adult Oncol, Boston, MA 02115 USA
[3] Dana Farber Canc Inst, Dept Pediat Oncol, Boston, MA 02115 USA
[4] MIT, Dept Biol, Cambridge, MA 02139 USA
[5] MIT, McGovern Inst, Ctr Biol & Computat Learning, Cambridge, MA 02139 USA
[6] MIT, Dept Elect Engn & Comp Sci, Cambridge, MA 02139 USA
[7] X Mine, Brisbane, CA 94005 USA
关键词
multiclass classification; support vector machine; tumor; molecular classification; pattern recognition; cancer; computational biology;
D O I
10.1137/S0036144502411986
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
Modern cancer treatment relies upon microscopic tissue examination to classify tumors according to anatomical site of origin. This approach is effective but subjective and variable even among experienced clinicians and pathologists. Recently, DNA microarray-generated gene expression data has been used to build molecular cancer classifiers. Previous work from our group and others demonstrated methods for solving pairwise classification problems using such global gene expression patterns. However, classification across multiple primary tumor classes poses new methodological and computational challenges. In this paper we describe a computational methodology for multiclass prediction that combines class-specific (one vs. all) binary support vector machines. We apply this methodology to the diagnosis of multiple common adult malignancies using DNA microarray data from a collection of 198 tumor samples, spanning 14 of the most common tumor types. Overall classification accuracy is 78%, far exceeding the expected accuracy for random classification. In a large subset of the samples (80%), the algorithm attains 90% accuracy. The methodology described in this paper both demonstrates that accurate gene expression-based multiclass cancer diagnosis is possible and highlights some of the analytic challenges inherent in applying such strategies to biomedical research.
引用
收藏
页码:706 / 723
页数:18
相关论文
共 50 条
  • [31] A Multiclass Classification Tool Using Cloud Computing Architecture
    Shen, Chia-Ping
    Liu, Chia-Hung
    Lin, Feng-Sheng
    Lin, Han
    Huang, Chi-Ying F.
    Kao, Cheng-Yan
    Lai, Feipei
    Lin, Jeng-Wei
    2012 IEEE/ACM INTERNATIONAL CONFERENCE ON ADVANCES IN SOCIAL NETWORKS ANALYSIS AND MINING (ASONAM), 2012, : 765 - 770
  • [32] Adaptive binary tree for fast SVM multiclass classification
    Chen, Jin
    Wang, Cheng
    Wang, Runsheng
    NEUROCOMPUTING, 2009, 72 (13-15) : 3370 - 3375
  • [33] Why Is Multiclass Classification Hard?
    Del Moral, Pablo
    Nowaczyk, Slawomir
    Pashami, Sepideh
    IEEE ACCESS, 2022, 10 : 80448 - 80462
  • [34] A prototype classification method and its use in a hybrid solution for multiclass pattern recognition
    Chou, CH
    Lin, CC
    Liu, YH
    Chang, F
    PATTERN RECOGNITION, 2006, 39 (04) : 624 - 634
  • [35] A new optimizing parameter approach of LSSVM multiclass classification model
    Yang, Kui He
    Zhao, Ling Ling
    NEURAL COMPUTING & APPLICATIONS, 2012, 21 (05) : 945 - 955
  • [36] Twin SVM for conditional probability estimation in binary and multiclass classification
    Shao, Yuan -Hai
    Lv, Xiao-Jing
    Huang, Ling-Wei
    Bai, Lan
    PATTERN RECOGNITION, 2023, 136
  • [37] Multidimensional genetic programming for multiclass classification
    La Cava, William
    Silva, Sara
    Danai, Kourosh
    Spector, Lee
    Vanneschi, Leonardo
    Moore, Jason H.
    SWARM AND EVOLUTIONARY COMPUTATION, 2019, 44 : 260 - 272
  • [38] Class binarization to neuroevolution for multiclass classification
    Lan, Gongjin
    Gao, Zhenyu
    Tong, Lingyao
    Liu, Ting
    NEURAL COMPUTING & APPLICATIONS, 2022, 34 (22) : 19845 - 19862
  • [39] Least squares DAGSVM for multiclass classification
    Wu, Haoyu
    Zhou, Zhijian
    Journal of Information and Computational Science, 2015, 12 (18): : 6863 - 6871
  • [40] Class binarization to neuroevolution for multiclass classification
    Gongjin Lan
    Zhenyu Gao
    Lingyao Tong
    Ting Liu
    Neural Computing and Applications, 2022, 34 : 19845 - 19862