In silico learning of tumor evolution through mutational time series

被引:17
作者
Auslander, Noam [1 ]
Wolf, Yuri I. [1 ]
Koonin, Eugene V. [1 ]
机构
[1] Natl Lib Med, Natl Ctr Biotechnol Informat, NIH, Bethesda, MD 20894 USA
基金
美国国家卫生研究院;
关键词
cancer progression; driver mutations; passenger mutations; machine learning; neural networks; PREMALIGNANT LESIONS; PASSENGER MUTATIONS; LUNG ADENOCARCINOMA; GENETIC ALTERATIONS; SOMATIC MUTATIONS; MULTISTEP NATURE; CANCER; CARCINOGENESIS; EXPRESSION; NETWORKS;
D O I
10.1073/pnas.1901695116
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Cancer arises through the accumulation of somatic mutations over time. Understanding the sequence of mutation occurrence during cancer progression can assist early and accurate diagnosis and improve clinical decision-making. Here we employ long short-term memory (LSTM) networks, a class of recurrent neural network, to learn the evolution of a tumor through an ordered sequence of mutations. We demonstrate the capacity of LSTMs to learn complex dynamics of the mutational time series governing tumor progression, allowing accurate prediction of the mutational burden and the occurrence of mutations in the sequence. Using the probabilities learned by the LSTM, we simulate mutational data and show that the simulation results are statistically indistinguishable from the empirical data. We identify passenger mutations that are significantly associated with established cancer drivers in the sequence and demonstrate that the genes carrying these mutations are substantially enriched in interactions with the corresponding driver genes. Breaking the network into modules consisting of driver genes and their interactors, we show that these interactions are associated with poor patient prognosis, thus likely conferring growth advantage for tumor progression. Thus, application of LSTM provides for prediction of numerous additional conditional drivers and reveals hitherto unknown aspects of cancer evolution.
引用
收藏
页码:9501 / 9510
页数:10
相关论文
共 70 条
[51]  
SHIN DM, 1994, CANCER RES, V54, P321
[52]   The cancer genome [J].
Stratton, Michael R. ;
Campbell, Peter J. ;
Futreal, P. Andrew .
NATURE, 2009, 458 (7239) :719-724
[53]   Nesprin-1 role in DNA damage response [J].
Sur, Ilknur ;
Neumann, Sascha ;
Noegel, Angelika A. .
NUCLEUS-AUSTIN, 2014, 5 (02) :173-191
[54]  
Sutskever I., 2011, Proceedings of the 28th International Conference on Machine Learning (ICML-11)
[55]   The STRING database in 2017: quality-controlled protein-protein association networks, made broadly accessible [J].
Szklarczyk, Damian ;
Morris, John H. ;
Cook, Helen ;
Kuhn, Michael ;
Wyder, Stefan ;
Simonovic, Milan ;
Santos, Alberto ;
Doncheva, Nadezhda T. ;
Roth, Alexander ;
Bork, Peer ;
Jensen, Lars J. ;
von Mering, Christian .
NUCLEIC ACIDS RESEARCH, 2017, 45 (D1) :D362-D368
[56]   STRING v10: protein-protein interaction networks, integrated over the tree of life [J].
Szklarczyk, Damian ;
Franceschini, Andrea ;
Wyder, Stefan ;
Forslund, Kristoffer ;
Heller, Davide ;
Huerta-Cepas, Jaime ;
Simonovic, Milan ;
Roth, Alexander ;
Santos, Alberto ;
Tsafou, Kalliopi P. ;
Kuhn, Michael ;
Bork, Peer ;
Jensen, Lars J. ;
von Mering, Christian .
NUCLEIC ACIDS RESEARCH, 2015, 43 (D1) :D447-D452
[57]  
Tanaka Takuji, 2009, J Carcinog, V8, P5
[58]   COSMIC: the Catalogue Of Somatic Mutations In Cancer [J].
Tate, John G. ;
Bamford, Sally ;
Jubb, Harry C. ;
Sondka, Zbyslaw ;
Beare, David M. ;
Bindal, Nidhi ;
Boutselakis, Harry ;
Cole, Charlotte G. ;
Creatore, Celestino ;
Dawson, Elisabeth ;
Fish, Peter ;
Harsha, Bhavana ;
Hathaway, Charlie ;
Jupe, Steve C. ;
Kok, Chai Yin ;
Noble, Kate ;
Ponting, Laura ;
Ramshaw, Christopher C. ;
Rye, Claire E. ;
Speedy, Helen E. ;
Stefancsik, Ray ;
Thompson, Sam L. ;
Wang, Shicai ;
Ward, Sari ;
Campbell, Peter J. ;
Forbes, Simon A. .
NUCLEIC ACIDS RESEARCH, 2019, 47 (D1) :D941-D947
[59]   Multiple numerical chromosome aberrations in cancer: what are their causes and what are their consequences? [J].
Teixeira, MR ;
Heim, S .
SEMINARS IN CANCER BIOLOGY, 2005, 15 (01) :3-12
[60]  
Tessneer Kandice L, 2013, J Can Res Updates, V2, P144