Machine learning application in personalised lung cancer recurrence and survivability prediction

被引:46
作者
Yang, Yang [1 ]
Xu, Li [2 ]
Sun, Liangdong [2 ]
Zhang, Peng [2 ]
Farid, Suzanne S. [1 ]
机构
[1] UCL, Dept Biochem Engn, Gower St, London WC1E 6BT, England
[2] Tongji Univ, Shanghai Pulm Hosp, Dept Thorac Surg, Sch Med, Shanghai 200043, Peoples R China
基金
英国工程与自然科学研究理事会;
关键词
Machine learning; Decision tree; Lung cancer; Personalized diagnosis and prognosis; ARTIFICIAL NEURAL-NETWORKS; C-REACTIVE PROTEIN; GENETIC POLYMORPHISMS; 8TH EDITION; RISK; CLASSIFICATION; MUTATIONS; PROGNOSIS; MODEL; TP53;
D O I
10.1016/j.csbj.2022.03.035
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Machine learning is an important artificial intelligence technique that is widely applied in cancer diagnosis and detection. More recently, with the rise of personalised and precision medicine, there is a growing trend towards machine learning applications for prognosis prediction. However, to date, building reliable prediction models of cancer outcomes in everyday clinical practice is still a hurdle. In this work, we integrate genomic, clinical and demographic data of lung adenocarcinoma (LUAD) and squamous cell carcinoma (LUSC) patients from The Cancer Genome Atlas (TCGA) and introduce copy number variation (CNV) and mutation information of 15 selected genes to generate predictive models for recurrence and survivability. We compare the accuracy and benefits of three well-established machine learning algorithms: decision tree methods, neural networks and support vector machines. Although the accuracy of predictive models using the decision tree method has no significant advantage, the tree models reveal the most important predictors among genomic information (e.g. KRAS, EGFR, TP53), clinical status (e.g. TNM stage and radiotherapy) and demographics (e.g. age and gender) and how they influence the prediction of recurrence and survivability for both early stage LUAD and LUSC. The machine learning models have the potential to help clinicians to make personalised decisions on aspects such as follow-up timeline and to assist with personalised planning of future social care needs. (c) 2022 The Authors. Published by Elsevier B.V. on behalf of Research Network of Computational and Structural Biotechnology. This is an open access article under the CC BY license (http://creativecommons. org/licenses/by/4.0/).
引用
收藏
页码:1811 / 1820
页数:10
相关论文
共 75 条
[1]   Follow up and surveillance of the patient with lung cancer: What do you do after surgery? [J].
Alberts, W. Michael .
RESPIROLOGY, 2007, 12 (01) :16-21
[2]   Preresection serum C-reactive protein measurement and survival among patients with resectable non-small cell lung cancer [J].
Alifano, Marco ;
Falcoz, Pierre E. ;
Seegers, Valerie ;
Roche, Nicolas ;
Schussler, Olivier ;
Younes, Mohamad ;
Antonacci, Filippo ;
Forgez, Patricia ;
Dechartres, Agnes ;
Massard, Gilbert ;
Damotte, Diane ;
Regnard, Jean-Francois .
JOURNAL OF THORACIC AND CARDIOVASCULAR SURGERY, 2011, 142 (05) :1161-1167
[3]   Radiomics with artificial intelligence for precision medicine in radiation therapy [J].
Arimura, Hidetaka ;
Soufi, Mazen ;
Kamezawa, Hidemi ;
Ninomiya, Kenta ;
Yamada, Masahiro .
JOURNAL OF RADIATION RESEARCH, 2019, 60 (01) :150-157
[4]   A survey of cross-validation procedures for model selection [J].
Arlot, Sylvain ;
Celisse, Alain .
STATISTICS SURVEYS, 2010, 4 :40-79
[5]   A learning rule for very simple universal approximators consisting of a single layer of perceptrons [J].
Auer, Peter ;
Burgsteiner, Harald ;
Maass, Wolfgang .
NEURAL NETWORKS, 2008, 21 (05) :786-795
[6]   Variations in lung cancer risk among smokers [J].
Bach, PB ;
Kattan, MW ;
Thornquist, MD ;
Kris, MG ;
Tate, RC ;
Barnett, MJ ;
Hsieh, LJ ;
Begg, CB .
JNCI-JOURNAL OF THE NATIONAL CANCER INSTITUTE, 2003, 95 (06) :470-478
[7]  
Bray F, 2018, CA-CANCER J CLIN, V68, P394, DOI [10.3322/caac.21492, 10.3322/caac.21609]
[8]   Genetic polymorphisms in DNA repair genes and risk of lung cancer [J].
Butkiewicz, D ;
Rusin, M ;
Enewold, L ;
Shields, PG ;
Chorazy, M ;
Harris, CC .
CARCINOGENESIS, 2001, 22 (04) :593-597
[9]   SNHG7 mediates cisplatin-resistance in non-small cell lung cancer by activating PI3K/AKT pathway [J].
Chen, K. ;
Abuduwufuer, A. ;
Zhang, H. ;
Luo, L. ;
Suotesiyali, M. ;
Zou, Y. .
EUROPEAN REVIEW FOR MEDICAL AND PHARMACOLOGICAL SCIENCES, 2019, 23 (16) :6935-6943
[10]   Risk classification of cancer survival using ANN with gene expression data from multiple laboratories [J].
Chen, Yen-Chen ;
Ke, Wan-Chi ;
Chiu, Hung-Wen .
COMPUTERS IN BIOLOGY AND MEDICINE, 2014, 48 :1-7