Transfer learning with convolutional neural networks for cancer survival prediction using gene-expression data

被引:55
作者
Lopez-Garcia, Guillermo [1 ]
Jerez, Jose M. [1 ]
Franco, Leonardo [1 ]
Veredas, Francisco J. [1 ]
机构
[1] Univ Malaga, Dept Lenguajes & Ciencias Comp, ETSI Informat, Malaga, Spain
来源
PLOS ONE | 2020年 / 15卷 / 03期
关键词
RNA-SEQ; DEEP; GENOME; RECURRENCE; SIGNATURE; KEGG;
D O I
10.1371/journal.pone.0230536
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Precision medicine in oncology aims at obtaining data from heterogeneous sources to have a precise estimation of a given patient's state and prognosis. With the purpose of advancing to personalized medicine framework, accurate diagnoses allow prescription of more effective treatments adapted to the specificities of each individual case. In the last years, next-generation sequencing has impelled cancer research by providing physicians with an overwhelming amount of gene-expression data from RNA-seq high-throughput platforms. In this scenario, data mining and machine learning techniques have widely contribute to gene-expression data analysis by supplying computational models to supporting decision-making on real-world data. Nevertheless, existing public gene-expression databases are characterized by the unfavorable imbalance between the huge number of genes (in the order of tenths of thousands) and the small number of samples (in the order of a few hundreds) available. Despite diverse feature selection and extraction strategies have been traditionally applied to surpass derived over-fitting issues, the efficacy of standard machine learning pipelines is far from being satisfactory for the prediction of relevant clinical outcomes like follow-up endpoints or patient's survival. Using the public Pan-Cancer dataset, in this study we pre-train convolutional neural network architectures for survival prediction on a subset composed of thousands of gene-expression samples from thirty-one tumor types. The resulting architectures are subsequently fine-tuned to predict lung cancer progression-free interval. The application of convolutional networks to gene-expression data has many limitations, derived from the unstructured nature of these data. In this work we propose a methodology to rearrange RNA-seq data by transforming RNA-seq samples into gene-expression images, from which convolutional networks can extract high-level features. As an additional objective, we investigate whether leveraging the information extracted from other tumor-type samples contributes to the extraction of high-level features that improve lung cancer progression prediction, compared to other machine learning approaches.
引用
收藏
页数:24
相关论文
共 50 条
[41]   Automatic diagnosis of stage of COVID-19 patients using an ensemble of transfer learning with convolutional neural networks based on computed tomography images [J].
Gifani, Parisa ;
Vafaeezadeh, Majid ;
Ghorbani, Mahdi ;
Mehri-Kakavand, Ghazal ;
Pursamimi, Mohamad ;
Shalbaf, Ahmad ;
Davanloo, Amirhossein Abbaskhani .
JOURNAL OF MEDICAL SIGNALS & SENSORS, 2023, 13 (02) :101-109
[42]   Artificial neural networks - A method for prediction of survival following liver resection for colorectal cancer metastases [J].
Spelt, L. ;
Nilsson, J. ;
Andersson, R. ;
Andersson, B. .
EJSO, 2013, 39 (06) :648-654
[43]   Development of robust artificial neural networks for prediction of 5-year survival in bladder cancer [J].
Bhambhvani, Hriday P. ;
Zamora, Alvaro ;
Shkolyar, Eugene ;
Prado, Kris ;
Greenberg, Daniel R. ;
Kasman, Alex M. ;
Liao, Joseph ;
Shah, Sumit ;
Srinivas, Sandy ;
Skinner, Eila C. ;
Shah, Jay B. .
UROLOGIC ONCOLOGY-SEMINARS AND ORIGINAL INVESTIGATIONS, 2021, 39 (03) :193.e7-193.e12
[44]   An integrated model of clinical information and gene expression for prediction of survival in ovarian cancer patients [J].
Yang, Rendong ;
Xiong, Jie ;
Deng, Defeng ;
Wang, Yiren ;
Liu, Hequn ;
Jiang, Guli ;
Peng, Yangqin ;
Peng, Xiaoning ;
Zeng, Xiaomin .
TRANSLATIONAL RESEARCH, 2016, 172 :84-95
[45]   A Survey of Machine Learning Approaches Applied to Gene Expression Analysis for Cancer Prediction [J].
Khalsan, Mahmood ;
Machado, Lee R. ;
Al-Shamery, Eman Salih ;
Ajit, Suraj ;
Anthony, Karen ;
Mu, Mu ;
Agyeman, Michael Opoku .
IEEE ACCESS, 2022, 10 :27522-27534
[46]   Feature Selection with Ensemble Learning for Prostate Cancer Prediction from Gene Expression [J].
Abass, Yusuf Aleshinloye ;
Adeshina, Steve A. .
INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2021, 21 (12) :526-538
[47]   Gene-expression data integration to squamous cell lung cancer subtypes reveals drug sensitivity [J].
D Wu ;
Y Pang ;
M D Wilkerson ;
D Wang ;
P S Hammerman ;
J S Liu .
British Journal of Cancer, 2013, 109 :1599-1608
[48]   Gene-expression data integration to squamous cell lung cancer subtypes reveals drug sensitivity [J].
Wu, D. ;
Pang, Y. ;
Wilkerson, M. D. ;
Wang, D. ;
Hammerman, P. S. ;
Liu, J. S. .
BRITISH JOURNAL OF CANCER, 2013, 109 (06) :1599-1608
[49]   Analyzing RNA-Seq Gene Expression Data Using Deep Learning Approaches for Cancer Classification [J].
Rukhsar, Laiqa ;
Bangyal, Waqas Haider ;
Ali Khan, Muhammad Sadiq ;
Ag Ibrahim, Ag Asri ;
Nisar, Kashif ;
Rawat, Danda B. .
APPLIED SCIENCES-BASEL, 2022, 12 (04)
[50]   Meta analysis algorithms for microarray gene expression data using Gene Regulatory Networks [J].
Kazmi, Saira A. ;
Kim, Yoo-Ah ;
Shin, Dong-Guk .
INTERNATIONAL JOURNAL OF DATA MINING AND BIOINFORMATICS, 2010, 4 (05) :487-504