Identifying Stage II Colorectal Cancer Recurrence Associated Genes by Microarray Meta-Analysis and Building Predictive Models with Machine Learning Algorithms

被引:5
作者
Lu, Wei [1 ,2 ]
Pan, Xiang [1 ,2 ]
Dai, Siqi [1 ,2 ]
Fu, Dongliang [1 ,2 ]
Hwang, Maxwell [1 ,2 ]
Zhu, Yingshuang [1 ,2 ]
Zhang, Lina [1 ,2 ]
Wei, Jingsun [1 ,2 ]
Kong, Xiangxing [1 ,2 ]
Li, Jun [1 ,2 ]
Xiao, Qian [1 ,2 ]
Ding, Kefeng [1 ,2 ]
机构
[1] Zhejiang Univ, Sch Med, Minist Educ,Affiliated Hosp 2, Dept Colorectal Surg & Oncol,Key Lab Canc Prevent, Hangzhou, Zhejiang, Peoples R China
[2] Zhejiang Univ, Canc Ctr, Hangzhou, Peoples R China
基金
中国国家自然科学基金; 国家重点研发计划;
关键词
POOLED ANALYSIS; POOR-PROGNOSIS; RECTAL-CANCER; EXPRESSION; SURVIVAL; SIGNATURE; TUMOR; SURVEILLANCE; METASTASIS; BIOMARKER;
D O I
10.1155/2021/6657397
中图分类号
R73 [肿瘤学];
学科分类号
100214 ;
摘要
Background. Stage II colorectal cancer patients had heterogeneous prognosis, and patients with recurrent events had poor survival. In this study, we aimed to identify stage II colorectal cancer recurrence associated genes by microarray meta-analysis and build predictive models to stratify patients' recurrence-free survival. Methods. We searched the GEO database to retrieve eligible microarray datasets. The microarray meta-analysis was used to identify universal recurrence associated genes. Total samples were randomly divided into the training set and the test set. Two survival models (lasso Cox model and random survival forest model) were trained in the training set, and AUC values of the time-dependent receiver operating characteristic (ROC) curves were calculated. Survival analysis was performed to determine whether there was significant difference between the predicted high and low risk groups in the test set. Results. Six datasets containing 651 stage II colorectal cancer patients were included in this study. The microarray meta-analysis identified 479 recurrence associated genes. KEGG and GO enrichment analysis showed that G protein-coupled glutamate receptor binding and Hedgehog signaling were significantly enriched. AUC values of the lasso Cox model and the random survival forest model were 0.815 and 0.993 at 60 months, respectively. In addition, the random survival forest model demonstrated that the effects of gene expression on the recurrence-free survival probability were nonlinear. According to the risk scores computed by the random survival forest model, the high risk group had significantly higher recurrence risk than the low risk group (HR = 1.824, 95% CI: 1.079-3.084, p = 0.025). Conclusions. We identified 479 stage II colorectal cancer recurrence associated genes by microarray meta-analysis. The random survival forest model which was based on the recurrence associated gene signature could strongly predict the recurrence risk of stage II colorectal cancer patients.
引用
收藏
页数:13
相关论文
共 46 条
[31]   Key Issues in Conducting a Meta-Analysis of Gene Expression Microarray Datasets [J].
Ramasamy, Adaikalavan ;
Mondry, Adrian ;
Holmes, Chris C. ;
Altman, Douglas G. .
PLOS MEDICINE, 2008, 5 (09) :1320-1332
[32]  
Saito R, 2012, NAT METHODS, V9, P1069, DOI [10.1038/nmeth.2212, 10.1038/NMETH.2212]
[33]   Cancer statistics, 2020 [J].
Siegel, Rebecca L. ;
Miller, Kimberly D. ;
Jemal, Ahmedin .
CA-A CANCER JOURNAL FOR CLINICIANS, 2020, 70 (01) :7-30
[34]  
Team R.C., 2022, R: A language and environment for statistical computing
[35]  
Tibshirani R, 1997, STAT MED, V16, P385, DOI 10.1002/(SICI)1097-0258(19970228)16:4<385::AID-SIM380>3.0.CO
[36]  
2-3
[37]   Meta- and Orthogonal Integration of Influenza "OMICs'' Data Defines a Role for UBR4 in Virus Budding [J].
Tripathi, Shashank ;
Pohl, Marie O. ;
Zhou, Yingyao ;
Rodriguez-Frandsen, Ariel ;
Wang, Guojun ;
Stein, David A. ;
Moulton, Hong M. ;
DeJesus, Paul ;
Che, Jianwei ;
Mulder, Lubbertus C. F. ;
Yangueez, Emilio ;
Andenmatten, Dario ;
Pache, Lars ;
Manicassamy, Balaji ;
Albrecht, Randy A. ;
Gonzalez, Maria G. ;
Nguyen, Quy ;
Brass, Abraham ;
Elledge, Stephen ;
White, Michael ;
Shapira, Sagi ;
Hacohen, Nir ;
Karlas, Alexander ;
Meyer, Thomas F. ;
Shales, Michael ;
Gatorano, Andre ;
Johnson, Jeffrey R. ;
Jang, Gwen ;
Johnson, Tasha ;
Verschueren, Erik ;
Sanders, Doug ;
Krogan, Nevan ;
Shaw, Megan ;
Koenig, Renate ;
Stertz, Silke ;
Garcia-Sastre, Adolfo ;
Chanda, Sumit K. .
CELL HOST & MICROBE, 2015, 18 (06) :723-735
[38]   Predictive Factors of Early Relapse in UICC Stage I-III Colorectal Cancer Patients After Curative Resection [J].
Tsai, Hsiang-Lin ;
Chu, Koung-Shing ;
Huang, Yu-Ho ;
Su, Yu-Chung ;
Wu, Jeng-Yih ;
Kuo, Chao-Hung ;
Chen, Chao-Wen ;
Wang, Jaw-Yuan .
JOURNAL OF SURGICAL ONCOLOGY, 2009, 100 (08) :736-743
[39]   Trends in incidence, treatment and survival of patients with stage IV colorectal cancer: a population-based series [J].
van der Pool, A. E. M. ;
Damhuis, R. A. ;
IJzermans, J. N. M. ;
de Wilt, J. H. W. ;
Eggermont, A. M. M. ;
Kranse, R. ;
Verhoef, C. .
COLORECTAL DISEASE, 2012, 14 (01) :56-61
[40]   Strong correlation between ASPM gene expression and HCV cirrhosis progression identified by co-expression analysis [J].
Wang, Fan ;
Chang, Ying ;
Li, Jin ;
Wang, Hongling ;
Zhou, Rui ;
Qi, Jian ;
Liu, Jing ;
Zhao, Qiu .
DIGESTIVE AND LIVER DISEASE, 2017, 49 (01) :70-76