A reliable method for colorectal cancer prediction based on feature selection and support vector machine

被引:0
作者
Dandan Zhao
Hong Liu
Yuanjie Zheng
Yanlin He
Dianjie Lu
Chen Lyu
机构
[1] School of Information Science and Engineering,Shandong Normal University
[2] Shandong Provincial Key Laboratory for Novel Distributed Computer Software Technology,undefined
来源
Medical & Biological Engineering & Computing | 2019年 / 57卷
关键词
Colorectal cancer; Logistic regression; Support vector machine; Microbiome;
D O I
暂无
中图分类号
学科分类号
摘要
Colorectal cancer (CRC) is a common cancer responsible for approximately 600,000 deaths per year worldwide. Thus, it is very important to find the related factors and detect the cancer accurately. However, timely and accurate prediction of the disease is challenging. In this study, we build an integrated model based on logistic regression (LR) and support vector machine (SVM) to classify the CRC into cancer and normal samples. From various factors, human location, age, gender, BMI, and cancer tumor type, tumor grade, and DNA, of the cancer, we select the most significant factors (p < 0.05) using logistic regression as main features, and with these features, a grid-search SVM model is designed using different kernel types (Linear, radial basis function (RBF), Sigmoid, and Polynomial). The result of the logistic regression indicates that the Firmicutes (AUC 0.918), Bacteroidetes (AUC 0.856), body mass index (BMI) (AUC 0.777), and age (AUC 0.710) and their combined factors (AUC 0.942) are effective for CRC detection. And the best kernel type is RBF, which achieves an accuracy of 90.1% when k = 5, and 91.2% when k = 10. This study provides a new method for colorectal cancer prediction based on independent risky factors.
引用
收藏
页码:901 / 912
页数:11
相关论文
共 208 条
[1]  
Zadeh SA(2017)A novel and reliable computational intelligence system for breast cancer detection Germ J Med Biol Eng Comp 9 1-12
[2]  
Sj SMC(2015)Identifying relevant group of miRNAs in cancer using fuzzy mutual information Germ J Medical & Biological Engineering & Computing 54 701-710
[3]  
Mohammadi Z(2010)Primary prevention of colorectal cancer J Gastroenterol 138 2029-2043
[4]  
Pal JK(2010)Innate immune mechanisms of colitis and colitis-associated colorectal cancer N Eng J Nature Rev Immunol 11 9-20
[5]  
Ray SS(2016)Gut microbiota, inflammation, and colorectal cancer US J Ann Rev Microbiol 70 395-411
[6]  
Pal SK(2014)Network-based prediction of cancer under genetic storm J Cancer Inform 13 15-31
[7]  
Chan AT(2015)A colorectal cancer prediction model using traditional and genetic risk scores in Koreans N Eng J BMC Genet 16 1-7
[8]  
Giovannucci EL(2016)Development and external validation of a fecal immunochemical test-based prediction model for colorectal cancer detection in symptomatic patients J BMC Med 14 128-140
[9]  
Saleh M(2015)Application of artificial neural networks to link genetic and environmental factors to DNA methylation in colorectal cancer N Eng J Epigenomics 7 175-186
[10]  
Trinchieri G(2015)Role of blood tumor markers in predicting metastasis and local recurrence after curative resection of colon cancer J Int J Clin Exp Med 8 982-990