Machine learning enables detection of early-stage colorectal cancer by whole-genome sequencing of plasma cell-free DNA

被引:122
作者
Wan, Nathan [1 ]
Weinberg, David [1 ]
Liu, Tzu-Yu [1 ]
Niehaus, Katherine [1 ]
Ariazi, Eric A. [1 ]
Delubac, Daniel [1 ]
Kannan, Ajay [1 ]
White, Brandon [1 ]
Bailey, Mitch [1 ]
Bertin, Marvin [1 ]
Boley, Nathan [1 ]
Bowen, Derek [1 ]
Cregg, James [1 ]
Drake, Adam M. [1 ]
Ennis, Riley [1 ]
Fransen, Signe [1 ]
Gafni, Erik [1 ]
Hansen, Loren [1 ]
Liu, Yaping [1 ]
Otte, Gabriel L. [1 ]
Pecson, Jennifer [1 ]
Rice, Brandon [1 ]
Sanderson, Gabriel E. [1 ]
Sharma, Aarushi [1 ]
St John, John [1 ]
Tang, Catherina [1 ]
Tzou, Abraham [1 ]
Young, Leilani [1 ]
Putcha, Girish [1 ]
Haque, Imran S. [1 ]
机构
[1] Freenome, San Francisco, CA 94080 USA
关键词
Cell-free DNA; Colorectal cancer; Screening; Whole-genome sequencing; Early-stage cancer; LIQUID BIOPSIES; TUMOR;
D O I
10.1186/s12885-019-6003-8
中图分类号
R73 [肿瘤学];
学科分类号
100214 ;
摘要
Background Blood-based methods using cell-free DNA (cfDNA) are under development as an alternative to existing screening tests. However, early-stage detection of cancer using tumor-derived cfDNA has proven challenging because of the small proportion of cfDNA derived from tumor tissue in early-stage disease. A machine learning approach to discover signatures in cfDNA, potentially reflective of both tumor and non-tumor contributions, may represent a promising direction for the early detection of cancer. Methods Whole-genome sequencing was performed on cfDNA extracted from plasma samples (N = 546 colorectal cancer and 271 non-cancer controls). Reads aligning to protein-coding gene bodies were extracted, and read counts were normalized. cfDNA tumor fraction was estimated using IchorCNA. Machine learning models were trained using k-fold cross-validation and confounder-based cross-validations to assess generalization performance. Results In a colorectal cancer cohort heavily weighted towards early-stage cancer (80% stage I/II), we achieved a mean AUC of 0.92 (95% CI 0.91-0.93) with a mean sensitivity of 85% (95% CI 83-86%) at 85% specificity. Sensitivity generally increased with tumor stage and increasing tumor fraction. Stratification by age, sequencing batch, and institution demonstrated the impact of these confounders and provided a more accurate assessment of generalization performance. Conclusions A machine learning approach using cfDNA achieved high sensitivity and specificity in a large, predominantly early-stage, colorectal cancer cohort. The possibility of systematic technical and institution-specific biases warrants similar confounder analyses in other studies. Prospective validation of this machine learning method and evaluation of a multi-analyte approach are underway.
引用
收藏
页数:10
相关论文
共 37 条
[1]   Scalable whole-exome sequencing of cell-free DNA reveals high concordance with metastatic tumors [J].
Adalsteinsson, Viktor A. ;
Ha, Gavin ;
Freeman, Samuel S. ;
Choudhury, Atish D. ;
Stover, Daniel G. ;
Parsons, Heather A. ;
Gydush, Gregory ;
Reed, Sarah C. ;
Rotem, Denisse ;
Rhoades, Justin ;
Loginov, Denis ;
Livitz, Dimitri ;
Rosebrock, Daniel ;
Leshchiner, Ignaty ;
Kim, Jaegil ;
Stewart, Chip ;
Rosenberg, Mara ;
Francis, Joshua M. ;
Zhang, Cheng-Zhong ;
Cohen, Ofir ;
Oh, Coyin ;
Ding, Huiming ;
Polak, Paz ;
Lloyd, Max ;
Mahmud, Sairah ;
Helvie, Karla ;
Merrill, Margaret S. ;
Santiago, Rebecca A. ;
O'Connor, Edward P. ;
Jeong, Seong H. ;
Leeson, Rachel ;
Barry, Rachel M. ;
Kramkowski, Joseph F. ;
Zhang, Zhenwei ;
Polacek, Laura ;
Lohr, Jens G. ;
Schleicher, Molly ;
Lipscomb, Emily ;
Saltzman, Andrea ;
Oliver, Nelly M. ;
Marini, Lori ;
Waks, Adrienne G. ;
Harshman, Lauren C. ;
Tolaney, Sara M. ;
Van Allen, Eliezer M. ;
Winer, Eric P. ;
Lin, Nancy U. ;
Nakabayashi, Mari ;
Taplin, Mary-Ellen ;
Johannessen, Cory M. .
NATURE COMMUNICATIONS, 2017, 8
[2]  
[Anonymous], CHALLENGES USING CTD
[3]   Next-Generation Sequencing of Circulating Tumor DNA for Early Cancer Detection [J].
Aravanis, Alexander M. ;
Lee, Mark ;
Klausner, Richard D. .
CELL, 2017, 168 (04) :571-574
[4]   RNA-Seq of Tumor-Educated Platelets Enables Blood-Based Pan-Cancer, Multiclass, and Molecular Pathway Cancer Diagnostics [J].
Best, Myron G. ;
Sol, Nik ;
Kooi, Irsan ;
Tannous, Jihane ;
Westerman, Bart A. ;
Rustenburg, Francois ;
Schellen, Pepijn ;
Verschueren, Heleen ;
Post, Edward ;
Koster, Jan ;
Ylstra, Bauke ;
Ameziane, Najim ;
Dorsman, Josephine ;
Smit, Egbert F. ;
Verheul, Henk M. ;
Noske, David P. ;
Reijneveld, Jaap C. ;
Nilsson, R. Jonas A. ;
Tannous, Bakhos A. ;
Wesseling, Pieter ;
Wurdinger, Thomas .
CANCER CELL, 2015, 28 (05) :666-676
[5]   Detection of Circulating Tumor DNA in Early- and Late-Stage Human Malignancies [J].
Bettegowda, Chetan ;
Sausen, Mark ;
Leary, Rebecca J. ;
Kinde, Isaac ;
Wang, Yuxuan ;
Agrawal, Nishant ;
Bartlett, Bjarne R. ;
Wang, Hao ;
Luber, Brandon ;
Alani, Rhoda M. ;
Antonarakis, Emmanuel S. ;
Azad, Nilofer S. ;
Bardelli, Alberto ;
Brem, Henry ;
Cameron, John L. ;
Lee, Clarence C. ;
Fecher, Leslie A. ;
Gallia, Gary L. ;
Gibbs, Peter ;
Le, Dung ;
Giuntoli, Robert L. ;
Goggins, Michael ;
Hogarty, Michael D. ;
Holdhoff, Matthias ;
Hong, Seung-Mo ;
Jiao, Yuchen ;
Juhl, Hartmut H. ;
Kim, Jenny J. ;
Siravegna, Giulia ;
Laheru, Daniel A. ;
Lauricella, Calogero ;
Lim, Michael ;
Lipson, Evan J. ;
Marie, Suely Kazue Nagahashi ;
Netto, George J. ;
Oliner, Kelly S. ;
Olivi, Alessandro ;
Olsson, Louise ;
Riggins, Gregory J. ;
Sartore-Bianchi, Andrea ;
Schmidt, Kerstin ;
Shih, Ie-Ming ;
Oba-Shinjo, Sueli Mieko ;
Siena, Salvatore ;
Theodorescu, Dan ;
Tie, Jeanne ;
Harkins, Timothy T. ;
Veronese, Silvio ;
Wang, Tian-Li ;
Weingart, Jon D. .
SCIENCE TRANSLATIONAL MEDICINE, 2014, 6 (224)
[6]   Noninvasive detection of cancer-associated genome-wide hypomethylation and copy number aberrations by plasma DNA bisulfite sequencing [J].
Chan, K. C. Allen ;
Jiang, Peiyong ;
Chan, Carol W. M. ;
Sun, Kun ;
Wong, John ;
Hui, Edwin P. ;
Chan, Stephen L. ;
Chan, Wing Cheong ;
Hui, David S. C. ;
Ng, Simon S. M. ;
Chan, Henry L. Y. ;
Wong, Cesar S. C. ;
Ma, Brigette B. Y. ;
Chan, Anthony T. C. ;
Lai, Paul B. S. ;
Sun, Hao ;
Chiu, Rossa W. K. ;
Lo, Y. M. Dennis .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2013, 110 (47) :18761-18768
[7]   ROBUST LOCALLY WEIGHTED REGRESSION AND SMOOTHING SCATTERPLOTS [J].
CLEVELAND, WS .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1979, 74 (368) :829-836
[8]   Detection and localization of surgically resectable cancers with a multi-analyte blood test [J].
Cohen, Joshua D. ;
Li, Lu ;
Wang, Yuxuan ;
Thoburn, Christopher ;
Afsari, Bahman ;
Danilova, Ludmila ;
Douville, Christopher ;
Javed, Ammar A. ;
Wong, Fay ;
Mattox, Austin ;
Hruban, Ralph. H. ;
Wolfgang, Christopher L. ;
Goggins, Michael G. ;
Dal Molin, Marco ;
Wang, Tian-Li ;
Roden, Richard ;
Klein, Alison P. ;
Ptak, Janine ;
Dobbyn, Lisa ;
Schaefer, Joy ;
Silliman, Natalie ;
Popoli, Maria ;
Vogelstein, Joshua T. ;
Browne, James D. ;
Schoen, Robert E. ;
Brand, Randall E. ;
Tie, Jeanne ;
Gibbs, Peter ;
Wong, Hui-Li ;
Mansfield, Aaron S. ;
Jen, Jin ;
Hanash, Samir M. ;
Falconi, Massimo ;
Allen, Peter J. ;
Zhou, Shibin ;
Bettegowda, Chetan ;
Diaz, Luis A., Jr. ;
Tomasetti, Cristian ;
Kinzler, Kenneth W. ;
Vogelstein, Bert ;
Lennon, Anne Marie ;
Papadopoulos, Nickolas .
SCIENCE, 2018, 359 (6378) :926-+
[9]   The chromatin accessibility landscape of primary human cancers [J].
Corces, M. Ryan ;
Granja, Jeffrey M. ;
Shams, Shadi ;
Louie, Bryan H. ;
Seoane, Jose A. ;
Zhou, Wanding ;
Silva, Tiago C. ;
Groeneveld, Clarice ;
Wong, Christopher K. ;
Cho, Seung Woo ;
Satpathy, Ansuman T. ;
Mumbach, Maxwell R. ;
Hoadley, Katherine A. ;
Robertson, A. Gordon ;
Sheffield, Nathan C. ;
Felau, Ina ;
Castro, Mauro A. A. ;
Berman, Benjamin P. ;
Staudt, Louis M. ;
Zenklusen, Jean C. ;
Laird, Peter W. ;
Curtis, Christina ;
Greenleaf, William J. ;
Chang, Howard Y. .
SCIENCE, 2018, 362 (6413) :420-+
[10]   Multi-analyte profiling reveals relationships among circulating biomarkers in colorectal cancer [J].
Delubac, Daniel ;
Ariazi, Eric ;
Berliner, Jonathan ;
Drake, Adam ;
Dulin, John ;
Ennis, Riley ;
Gafni, Erik ;
Niehaus, Kate ;
Otte, Gabriel ;
Pecson, Jennifer ;
Putcha, Girish ;
Schaninger, Corey ;
Sharma, Aarushi ;
Singer, Mike ;
Tzou, Abraham ;
Waters, Jill ;
Weinberg, David ;
White, Brandon ;
Haque, Imran S. .
CANCER RESEARCH, 2018, 78 (13)