Machine Learning for Colorectal Cancer Risk Prediction

被引:4
作者
Zheng, Ling [1 ]
Eniola, Elijah [1 ]
Wang, Jiacun [1 ]
机构
[1] Monmouth Univ, Comp Sci & Software Engn Dept, W Long Branch, NJ 07764 USA
来源
2021 INTERNATIONAL CONFERENCE ON CYBER-PHYSICAL SOCIAL INTELLIGENCE (ICCSI) | 2021年
关键词
machine learning; colorectal cancer; risk prediction;
D O I
10.1109/ICCSI53130.2021.9736248
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Colorectal cancer is the third most prevalent cancer and the second most common cause of cancer deaths in the United States. Screening is one of the most powerful tools for colorectal cancer prevention. Current screening recommendations are only based on history of colorectal cancer and age. To facilitate a more effective screening of colorectal cancer, this paper explores the feasibility of machine learning algorithms for the colorectal cancer risk prediction. The longitudinal Pancreatic, Lung, Colorectal, Ovarian Cancer dataset from the National Cancer Institute was utilized for the training and testing of eight machine learning algorithms. The experiment results show that the gradient boosting model has the largest area under the Receiver Operating Characteristics curve 0.82, and the random forest model has the highest accuracy 0.75, highest recall 0.76 and highest F1 score 0.75. The two optimal models were also used to evaluate the importance of top risk factors, which are helpful for a more effective screening recommendation.
引用
收藏
页数:6
相关论文
共 24 条
[1]  
[Anonymous], 2021, COLORECTAL CANC RISK
[2]  
[Anonymous], 2021, PLCO
[3]  
[Anonymous], 2021, TRIAL SUMMARY
[4]  
[Anonymous], 2021, MAIN QUESTIONNAIRES
[5]  
[Anonymous], 2021, CAN COLORECTAL POLYP
[6]   Use of colonoscopy as a primary screening test for colorectal cancer in average risk people [J].
Betés, M ;
Muñoz-Navas, MA ;
Duque, JM ;
Angós, R ;
Macías, E ;
Súbtil, JC ;
Herraiz, M ;
De La Riva, S ;
Delgado-Rodríguez, M ;
Martínez-González, MA .
AMERICAN JOURNAL OF GASTROENTEROLOGY, 2003, 98 (12) :2648-2654
[7]  
Bishop C.M., 2006, Pattern Recognition and Machine Learning, DOI DOI 10.1007/978-0-387-45528-0
[8]  
cancer, American Cancer Society Guideline for Diet and Physical Activity
[9]   Harvard report on cancer prevention volume 4: Harvard Cancer Risk Index [J].
Colditz, GA ;
Atwood, KA ;
Emmons, K ;
Monson, RR ;
Willett, WC ;
Trichopoulos, D ;
Hunter, DJ .
CANCER CAUSES & CONTROL, 2000, 11 (06) :477-488
[10]  
Du Y., 2019, IEEE INT C NETWORK S