Systematic Analysis of Challenge-Driven Improvements in Molecular Prognostic Models for Breast Cancer

被引:97
作者
Margolin, Adam A. [1 ]
Bilal, Erhan [2 ]
Huang, Erich [1 ,3 ,4 ]
Norman, Thea C. [1 ]
Ottestad, Lars [5 ]
Mecham, Brigham H. [1 ,6 ]
Sauerwine, Ben [7 ]
Kellen, Michael R. [1 ]
Mangravite, Lara M. [1 ]
Furia, Matthew D. [1 ,8 ]
Vollan, Hans Kristian Moen [5 ,9 ,10 ,11 ]
Rueda, Oscar M. [11 ]
Guinney, Justin [1 ]
Deflaux, Nicole A. [1 ]
Hoff, Bruce [1 ]
Schildwachter, Xavier [1 ]
Russnes, Hege G. [9 ,10 ,12 ]
Park, Daehoon [13 ]
Vang, Veronica O. [9 ,10 ]
Pirtle, Tyler [7 ]
Youseff, Lamia [7 ]
Citro, Craig [7 ]
Curtis, Christina [14 ]
Kristensen, Vessela N. [9 ,10 ,15 ]
Hellerstein, Joseph [7 ]
Friend, Stephen H. [1 ]
Stolovitzky, Gustavo [2 ]
Aparicio, Samuel [16 ,17 ,18 ]
Caldas, Carlos [11 ,19 ,20 ,21 ]
Borresen-Dale, Anne-Lise [9 ,10 ]
机构
[1] Sage Bionetworks, Seattle, WA 98109 USA
[2] IBM Corp, Computat Biol Ctr, Funct Genom & Syst Biol, Yorktown Hts, NY 10598 USA
[3] Duke Univ, Inst Genome Sci & Policy, Durham, NC 27708 USA
[4] Duke Univ, Sch Med, Dept Surg, Durham, NC 27710 USA
[5] Oslo Univ Hosp, Dept Oncol, Div Canc Surg & Transplantat, N-0450 Oslo, Norway
[6] Trialomics LLC, Seattle, WA 98103 USA
[7] Google Inc, Seattle, WA 98103 USA
[8] Novartis Res Fdn, Genom Inst, San Diego, CA 92121 USA
[9] Norwegian Radium Hosp, Oslo Univ Hosp, Inst Canc Res, Dept Genet, N-0310 Oslo, Norway
[10] Univ Oslo, Inst Clin Med, Fac Med, KG Jebsen Ctr Breast Canc Res, N-0313 Oslo, Norway
[11] Canc Res UK, Li Ka Shing Ctr, Cambridge Res Inst, Cambridge CB2 0RE, England
[12] Oslo Univ Hosp, Dept Pathol, N-0450 Oslo, Norway
[13] Drammen Hosp, Dept Pathol, Vestre Viken HF, N-3004 Drammen, Norway
[14] Univ So Calif, Keck Sch Med, Dept Prevent Med, Los Angeles, CA 90033 USA
[15] Akershus Univ Hosp, Div Med, Dept Clin Mol Oncol, N-1478 Ahus, Norway
[16] British Columbia Canc Res Ctr, Vancouver, BC V5Z 1L3, Canada
[17] Univ British Columbia, Dept Pathol & Lab Med, Vancouver, BC V6T 1Z4, Canada
[18] BC Canc Agcy, Genome Sci Ctr, Vancouver, BC V5Z 1L3, Canada
[19] Cambridge Univ Hosp NHS Fdn Trust, Addenbrookes Hosp, Cambridge Breast Unit, Cambridge CB2 2QQ, England
[20] NIHR Cambridge Biomed Res Ctr, Cambridge CB2 2QQ, England
[21] Cambridge Expt Canc Med Ctr, Cambridge CB2 0RE, England
关键词
GENE NETWORK INFERENCE; MINDACT TRIAL; VALIDATION; PREDICTION; SURVIVAL; BIOLOGY; TUMORS;
D O I
10.1126/scitranslmed.3006112
中图分类号
Q2 [细胞生物学];
学科分类号
071009 ; 090102 ;
摘要
Although molecular prognostics in breast cancer are among the most successful examples of translating genomic analysis to clinical applications, optimal approaches to breast cancer clinical risk prediction remain controversial. The Sage Bionetworks-DREAM Breast Cancer Prognosis Challenge (BCC) is a crowdsourced research study for breast cancer prognostic modeling using genome-scale data. The BCC provided a community of data analysts with a common platform for data access and blinded evaluation of model accuracy in predicting breast cancer survival on the basis of gene expression data, copy number data, and clinical covariates. This approach offered the opportunity to assess whether a crowdsourced community Challenge would generate models of breast cancer prognosis commensurate with or exceeding current best-in-class approaches. The BCC comprised multiple rounds of blinded evaluations on held-out portions of data on 1981 patients, resulting in more than 1400 models submitted as open source code. Participants then retrained their models on the full data set of 1981 samples and submitted up to five models for validation in a newly generated data set of 184 breast cancer patients. Analysis of the BCC results suggests that the best-performing modeling strategy outperformed previously reported methods in blinded evaluations; model performance was consistent across several independent evaluations; and aggregating community-developed models achieved performance on par with the best-performing individual models.
引用
收藏
页数:11
相关论文
共 43 条
[1]  
Allio R., 2004, Strategy Leadership, V32, P4
[2]  
[Anonymous], 2011, PRECISION MED BUILDI
[3]   CONSTRUCTING CONFIDENCE SETS USING RANK STATISTICS [J].
BAUER, DF .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1972, 67 (339) :687-690
[4]  
Bell Robert M., 2007, Acm Sigkdd Explorations Newsletter, V9, P75
[5]   Improving Breast Cancer Survival Analysis through Competition-Based Multidimensional Modeling [J].
Bilal, Erhan ;
Dutkowski, Janusz ;
Guinney, Justin ;
Jang, In Sock ;
Logsdon, Benjamin A. ;
Pandey, Gaurav ;
Sauerwine, Benjamin A. ;
Shimoni, Yishai ;
Vollan, Hans Kristian Moen ;
Mecham, Brigham H. ;
Rueda, Oscar M. ;
Tost, Jorg ;
Curtis, Christina ;
Alvarez, Mariano J. ;
Kristensen, Vessela N. ;
Aparicio, Samuel ;
Borresen-Dale, Anne-Lise ;
Caldas, Carlos ;
Califano, Andrea ;
Friend, Stephen H. ;
Ideker, Trey ;
Schadt, Eric E. ;
Stolovitzky, Gustavo A. ;
Margolin, Adam A. .
PLOS COMPUTATIONAL BIOLOGY, 2013, 9 (05)
[6]   Global estimates of cancer prevalence for 27 sites in the adult population in 2008 [J].
Bray, Freddie ;
Ren, Jian-Song ;
Masuyer, Eric ;
Ferlay, Jacques .
INTERNATIONAL JOURNAL OF CANCER, 2013, 132 (05) :1133-1145
[7]   The MINDACT trial: The first prospective clinical validation of a genomic tool [J].
Cardoso, Fatima ;
Piccart-Gebhart, Martine ;
Van't Veer, Laura ;
Rutgers, Emiel .
MOLECULAR ONCOLOGY, 2007, 1 (03) :246-251
[8]   May the Best Analyst Win [J].
Carpenter, Jennifer .
SCIENCE, 2011, 331 (6018) :698-699
[9]  
CARVALHO B, PD GENOMEWIDESNP 6 P
[10]   Development of a Prognostic Model for Breast Cancer Survival in an Open Challenge Environment [J].
Cheng, Wei-Yi ;
Yang, Tai-Hsien Ou ;
Anastassiou, Dimitris .
SCIENCE TRANSLATIONAL MEDICINE, 2013, 5 (181)