An analytical method for multiclass molecular cancer classification

被引:54
|
作者
Rifkin, R [1 ]
Mukherjee, S
Tamayo, P
Ramaswamy, S
Yeang, CH
Angelo, M
Reich, M
Poggio, T
Lander, ES
Golub, TR
Mesirov, JP
机构
[1] MIT, Whitehead Inst, Ctr Genome Res, Cambridge, MA 02139 USA
[2] Dana Farber Canc Inst, Dept Adult Oncol, Boston, MA 02115 USA
[3] Dana Farber Canc Inst, Dept Pediat Oncol, Boston, MA 02115 USA
[4] MIT, Dept Biol, Cambridge, MA 02139 USA
[5] MIT, McGovern Inst, Ctr Biol & Computat Learning, Cambridge, MA 02139 USA
[6] MIT, Dept Elect Engn & Comp Sci, Cambridge, MA 02139 USA
[7] X Mine, Brisbane, CA 94005 USA
关键词
multiclass classification; support vector machine; tumor; molecular classification; pattern recognition; cancer; computational biology;
D O I
10.1137/S0036144502411986
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
Modern cancer treatment relies upon microscopic tissue examination to classify tumors according to anatomical site of origin. This approach is effective but subjective and variable even among experienced clinicians and pathologists. Recently, DNA microarray-generated gene expression data has been used to build molecular cancer classifiers. Previous work from our group and others demonstrated methods for solving pairwise classification problems using such global gene expression patterns. However, classification across multiple primary tumor classes poses new methodological and computational challenges. In this paper we describe a computational methodology for multiclass prediction that combines class-specific (one vs. all) binary support vector machines. We apply this methodology to the diagnosis of multiple common adult malignancies using DNA microarray data from a collection of 198 tumor samples, spanning 14 of the most common tumor types. Overall classification accuracy is 78%, far exceeding the expected accuracy for random classification. In a large subset of the samples (80%), the algorithm attains 90% accuracy. The methodology described in this paper both demonstrates that accurate gene expression-based multiclass cancer diagnosis is possible and highlights some of the analytic challenges inherent in applying such strategies to biomedical research.
引用
收藏
页码:706 / 723
页数:18
相关论文
共 50 条
  • [21] Multiclass Job Recommendation System in the IT Field between Classification and Prediction Method
    Prafajar, Kevin Novian
    Vallyan, Hansen
    Candradewi, Ni Luh Putu Adi
    Edbert, Ivan Sebastian
    Suhartono, Derwin
    2022 INTERNATIONAL CONFERENCE ON GREEN ENERGY, COMPUTING AND SUSTAINABLE TECHNOLOGY (GECOST), 2022, : 181 - 186
  • [22] Regularized Least Squares Twin SVM for Multiclass Classification
    Ali, Javed
    Aldhaifallah, M.
    Nisar, Kottakkaran Sooppy
    Aljabr, A. A.
    Tanveer, M.
    BIG DATA RESEARCH, 2022, 27
  • [23] On the consistency of multiclass classification methods
    Tewari, Ambuj
    Bartlett, Peter L.
    JOURNAL OF MACHINE LEARNING RESEARCH, 2007, 8 : 1007 - 1025
  • [24] Extreme Multiclass Classification Criteria
    Choromanska, Anna
    Jain, Ish Kumar
    COMPUTATION, 2019, 7 (01)
  • [25] RipMC: RIPPER for Multiclass Classification
    Asadi, Shahrokh
    Shahrabi, Jamal
    NEUROCOMPUTING, 2016, 191 : 19 - 33
  • [26] Blind Multiclass Ensemble Classification
    Traganitis, Panagiotis A.
    Pages-Zamora, Alba
    Giannakis, Georgios B.
    IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2018, 66 (18) : 4737 - 4752
  • [27] Training Data Optimization Strategy for Multiclass Text Classification
    Arusada, Muhammad Diaphan Nizam
    Putri, Novi Amalia Santika
    Alamsyah, Andry
    2017 5TH INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION TECHNOLOGY (ICOIC7), 2017,
  • [28] Multiclass support vector classification via coding and regression
    Chen, Pei-Chun
    Lee, Kuang-Yao
    Lee, Tsung-Ju
    Lee, Yuh-Jye
    Huang, Su-Yun
    NEUROCOMPUTING, 2010, 73 (7-9) : 1501 - 1512
  • [29] Privacy Preserving Multiclass Classification for Horizontally Distributed Data
    Lu, Yunmei
    Yan, Mingyuan
    Han, Meng
    Yang, Qingliang
    Zhang, Yanqing
    SIGITE'18: PROCEEDINGS OF THE 19TH ANNUAL SIG CONFERENCE ON INFORMATION TECHNOLOGY EDUCATION, 2018, : 165 - 165
  • [30] A triplot for multiclass classification visualisation
    Gardner-Lubbe, Sugnet
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2016, 94 : 20 - 32