Machine learning with in silico analysis markedly improves survival prediction modeling in colon cancer patients

被引:13
作者
Lee, Choong-Jae [1 ]
Baek, Bin [2 ]
Cho, Sang Hee [3 ]
Jang, Tae-Young [1 ]
Jeon, So-El [1 ]
Lee, Sunjae [1 ]
Lee, Hyunju [2 ]
Nam, Jeong-Seok [1 ,4 ]
机构
[1] Gwangju Inst Sci & Technol, Sch Life Sci, Gwangju 61005, South Korea
[2] Gwangju Inst Sci & Technol, Sch Elect Engn & Comp Sci, Gwangju 61005, South Korea
[3] Chonnam Natl Univ, Med Sch, Dept Hematooncol, Gwangju, South Korea
[4] Gwangju Inst Sci & Technol, Cell Logist Res Ctr, Gwangju, South Korea
基金
新加坡国家研究基金会;
关键词
biomarkers; clinical outcome; colon cancer; in silico system analysis; machine learning; survival prediction model; GENE-EXPRESSION; PROGNOSIS; METASTASIS; SELECTION;
D O I
10.1002/cam4.5420
中图分类号
R73 [肿瘤学];
学科分类号
100214 ;
摘要
Background Predicting the survival of cancer patients provides prognostic information and therapeutic guidance. However, improved prediction models are needed for use in diagnosis and treatment. Objective This study aimed to identify genomic prognostic biomarkers related to colon cancer (CC) based on computational data and to develop survival prediction models. Methods We performed machine-learning (ML) analysis to screen pathogenic survival-related driver genes related to patient prognosis by integrating copy number variation and gene expression data. Moreover, in silico system analysis was performed to clinically assess data from ML analysis, and we identified RABGAP1L, MYH9, and DRD4 as candidate genes. These three genes and tumor stages were used to generate survival prediction models. Moreover, the genes were validated by experimental and clinical analyses, and the theranostic application of the survival prediction models was assessed. Results RABGAP1L, MYH9, and DRD4 were identified as survival-related candidate genes by ML and in silico system analysis. The survival prediction model using the expression of the three genes showed higher predictive performance when applied to predict the prognosis of CC patients. A series of functional analyses revealed that each knockdown of three genes reduced the protumor activity of CC cells. In particular, validation with an independent cohort of CC patients confirmed that the coexpression of MYH9 and DRD4 gene expression reflected poorer clinical outcomes in terms of overall survival and disease-free survival. Conclusions Our survival prediction approach will contribute to providing information on patients and developing a therapeutic strategy for CC patients.
引用
收藏
页码:7603 / 7615
页数:13
相关论文
共 45 条
[1]   Applications of Bayesian network models in predicting types of hematological malignancies [J].
Agrahari, Rupesh ;
Foroushani, Amir ;
Docking, T. Roderick ;
Chang, Linda ;
Duns, Gerben ;
Hudoba, Monika ;
Karsan, Aly ;
Zare, Habil .
SCIENTIFIC REPORTS, 2018, 8
[2]   An Integrated Approach to Uncover Drivers of Cancer [J].
Akavia, Uri David ;
Litvin, Oren ;
Kim, Jessica ;
Sanchez-Garcia, Felix ;
Kotliar, Dylan ;
Causton, Helen C. ;
Pochanard, Panisa ;
Mozes, Eyal ;
Garraway, Levi A. ;
Pe'er, Dana .
CELL, 2010, 143 (06) :1005-1017
[3]   Broad patterns of gene expression revealed by clustering analysis of tumor and normal colon tissues probed by oligonucleotide arrays [J].
Alon, U ;
Barkai, N ;
Notterman, DA ;
Gish, K ;
Ybarra, S ;
Mack, D ;
Levine, AJ .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1999, 96 (12) :6745-6750
[4]   DEOD: uncovering dominant effects of cancer-driver genes based on a partial covariance selection method [J].
Amgalan, Bayarbaatar ;
Lee, Hyunju .
BIOINFORMATICS, 2015, 31 (15) :2452-2460
[5]   Predicting stage-specific cancer related genes and their dynamic modules by integrating multiple datasets [J].
Aouiche, Chaima ;
Chen, Bolin ;
Shang, Xuequn .
BMC BIOINFORMATICS, 2019, 20 (Suppl 7)
[6]  
Baek B., 2020, SCI REP-UK, V10, P1
[7]   Correlation of amplification and overexpression of the c-myc oncogene in high-grade breast cancer:: FISH, in situ hybridisation and immunohistochemical analyses [J].
Blancato, J ;
Singh, B ;
Liu, A ;
Liao, DJ ;
Dickson, RB .
BRITISH JOURNAL OF CANCER, 2004, 90 (08) :1612-1619
[8]   Feature selection in machine learning: A new perspective [J].
Cai, Jie ;
Luo, Jiawei ;
Wang, Shulin ;
Yang, Sheng .
NEUROCOMPUTING, 2018, 300 :70-79
[9]  
Cancer Genome Atlas Network, 2012, Nature, V487, P330, DOI [10.1038/nature11252, DOI 10.1038/NATURE11252]
[10]   Spelling interface using intracortical signals in a completely locked-in patient enabled via auditory neurofeedback training [J].
Chaudhary, Ujwal ;
Vlachos, Ioannis ;
Zimmermann, Jonas B. ;
Espinosa, Arnau ;
Tonin, Alessandro ;
Jaramillo-Gonzalez, Andres ;
Khalili-Ardali, Majid ;
Topka, Helge ;
Lehmberg, Jens ;
Friehs, Gerhard M. ;
Woodtli, Alain ;
Donoghue, John P. ;
Birbaumer, Niels .
NATURE COMMUNICATIONS, 2022, 13 (01)