Machine learning with in silico analysis markedly improves survival prediction modeling in colon cancer patients

被引:12
作者
Lee, Choong-Jae [1 ]
Baek, Bin [2 ]
Cho, Sang Hee [3 ]
Jang, Tae-Young [1 ]
Jeon, So-El [1 ]
Lee, Sunjae [1 ]
Lee, Hyunju [2 ]
Nam, Jeong-Seok [1 ,4 ]
机构
[1] Gwangju Inst Sci & Technol, Sch Life Sci, Gwangju 61005, South Korea
[2] Gwangju Inst Sci & Technol, Sch Elect Engn & Comp Sci, Gwangju 61005, South Korea
[3] Chonnam Natl Univ, Med Sch, Dept Hematooncol, Gwangju, South Korea
[4] Gwangju Inst Sci & Technol, Cell Logist Res Ctr, Gwangju, South Korea
来源
CANCER MEDICINE | 2023年 / 12卷 / 06期
基金
新加坡国家研究基金会;
关键词
biomarkers; clinical outcome; colon cancer; in silico system analysis; machine learning; survival prediction model; GENE-EXPRESSION; PROGNOSIS; METASTASIS; SELECTION;
D O I
10.1002/cam4.5420
中图分类号
R73 [肿瘤学];
学科分类号
100214 ;
摘要
Background Predicting the survival of cancer patients provides prognostic information and therapeutic guidance. However, improved prediction models are needed for use in diagnosis and treatment. Objective This study aimed to identify genomic prognostic biomarkers related to colon cancer (CC) based on computational data and to develop survival prediction models. Methods We performed machine-learning (ML) analysis to screen pathogenic survival-related driver genes related to patient prognosis by integrating copy number variation and gene expression data. Moreover, in silico system analysis was performed to clinically assess data from ML analysis, and we identified RABGAP1L, MYH9, and DRD4 as candidate genes. These three genes and tumor stages were used to generate survival prediction models. Moreover, the genes were validated by experimental and clinical analyses, and the theranostic application of the survival prediction models was assessed. Results RABGAP1L, MYH9, and DRD4 were identified as survival-related candidate genes by ML and in silico system analysis. The survival prediction model using the expression of the three genes showed higher predictive performance when applied to predict the prognosis of CC patients. A series of functional analyses revealed that each knockdown of three genes reduced the protumor activity of CC cells. In particular, validation with an independent cohort of CC patients confirmed that the coexpression of MYH9 and DRD4 gene expression reflected poorer clinical outcomes in terms of overall survival and disease-free survival. Conclusions Our survival prediction approach will contribute to providing information on patients and developing a therapeutic strategy for CC patients.
引用
收藏
页码:7603 / 7615
页数:13
相关论文
共 45 条
  • [1] Applications of Bayesian network models in predicting types of hematological malignancies
    Agrahari, Rupesh
    Foroushani, Amir
    Docking, T. Roderick
    Chang, Linda
    Duns, Gerben
    Hudoba, Monika
    Karsan, Aly
    Zare, Habil
    [J]. SCIENTIFIC REPORTS, 2018, 8
  • [2] An Integrated Approach to Uncover Drivers of Cancer
    Akavia, Uri David
    Litvin, Oren
    Kim, Jessica
    Sanchez-Garcia, Felix
    Kotliar, Dylan
    Causton, Helen C.
    Pochanard, Panisa
    Mozes, Eyal
    Garraway, Levi A.
    Pe'er, Dana
    [J]. CELL, 2010, 143 (06) : 1005 - 1017
  • [3] Broad patterns of gene expression revealed by clustering analysis of tumor and normal colon tissues probed by oligonucleotide arrays
    Alon, U
    Barkai, N
    Notterman, DA
    Gish, K
    Ybarra, S
    Mack, D
    Levine, AJ
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1999, 96 (12) : 6745 - 6750
  • [4] DEOD: uncovering dominant effects of cancer-driver genes based on a partial covariance selection method
    Amgalan, Bayarbaatar
    Lee, Hyunju
    [J]. BIOINFORMATICS, 2015, 31 (15) : 2452 - 2460
  • [5] Predicting stage-specific cancer related genes and their dynamic modules by integrating multiple datasets
    Aouiche, Chaima
    Chen, Bolin
    Shang, Xuequn
    [J]. BMC BIOINFORMATICS, 2019, 20 (Suppl 7)
  • [6] Baek B., 2020, SCI REP-UK, V10, P1
  • [7] Correlation of amplification and overexpression of the c-myc oncogene in high-grade breast cancer:: FISH, in situ hybridisation and immunohistochemical analyses
    Blancato, J
    Singh, B
    Liu, A
    Liao, DJ
    Dickson, RB
    [J]. BRITISH JOURNAL OF CANCER, 2004, 90 (08) : 1612 - 1619
  • [8] Feature selection in machine learning: A new perspective
    Cai, Jie
    Luo, Jiawei
    Wang, Shulin
    Yang, Sheng
    [J]. NEUROCOMPUTING, 2018, 300 : 70 - 79
  • [9] Spelling interface using intracortical signals in a completely locked-in patient enabled via auditory neurofeedback training
    Chaudhary, Ujwal
    Vlachos, Ioannis
    Zimmermann, Jonas B.
    Espinosa, Arnau
    Tonin, Alessandro
    Jaramillo-Gonzalez, Andres
    Khalili-Ardali, Majid
    Topka, Helge
    Lehmberg, Jens
    Friehs, Gerhard M.
    Woodtli, Alain
    Donoghue, John P.
    Birbaumer, Niels
    [J]. NATURE COMMUNICATIONS, 2022, 13 (01)
  • [10] Data Descriptor: Meta-mining of copy number profiles of high-risk neuroblastoma tumors
    Depuydt, Pauline
    Koster, Jan
    Boeva, Valentina
    Hocking, Toby D.
    Speleman, Frank
    Schleiermacher, Gudrun
    De Preter, Katleen
    [J]. SCIENTIFIC DATA, 2018, 5