Prediction of risk factors for synchronous colorectal cancer in patients with colorectal cancer

被引:0
作者
Chin C. [1 ]
Ting W.-C. [2 ,3 ]
Chang C.-C. [4 ,5 ]
Zhang Y.-X. [4 ]
机构
[1] Internal Medical, Antai Medical Care Coporation Antai Tian-Sheng Memorial Hospital
[2] Division of Colorectal Surgery, Chung Shan Medical University Hospital
[3] School of Medicine, Chung Shan Medical University
[4] School of Medical Informatics, Chung Shan Medical University
[5] IT Office, Chung Shan Medical University Hospital
来源
Journal of Quality | 2020年 / 27卷 / 04期
关键词
Colorectal cancer; Machine learning techniques; Synchronous;
D O I
10.6220/joq.202008_27(4).0002
中图分类号
学科分类号
摘要
Screening for cancer and advanced treatments have not only improved treatment outcomes and patient survival rates but also led to an increase in the number of diagnosed synchronous colorectal cancer (SCC) cases. This study used machine learning techniques to develop a predictive model including seven classification techniques (naive Bayes, logistic regression, K-Star, random committee, randomizable filtered classifier, random forests, and random tree) to identify the risk factors and clinical features of SCC. The clinical dataset comprised a total of 4,287 valid records and was obtained from three cancer registries. Fourteen independent variables were selected as risk factors to analyze the characteristics of SCC. Seven classification techniques were tested in this study by using Waikato software. Performance indicators were analyzed in terms of sensitivity, accuracy, specificity, F-measure score, and precision. The results of this study revealed that the most important risk factors of SCC are combined stage, tumor size, chemotherapy, and grade/differentiation. Among the classification techniques tested, the naive Bayes method revealed the highest accuracy (90.03%). Finally, chemotherapy for SCC is an important factor for future observation. © 2020, Chinese Society for Quality. All rights reserved.
引用
收藏
页码:231 / 245
页数:14
相关论文
共 32 条
[1]  
Amrehn M., Mualla F., Angelopoulou E., Steidl S., Maier A., The Random Forest Classifier in WEKA: Discussion and New Developments for Imbalanced Data, (2018)
[2]  
Asaju L.B., Shola P.B., Franklin N., Abiola H.M., Intrusion detection system on a computer network using an ensemble of randomizable filtered classifier, k-nearest neighbor algorithm, FUW Trends in Science & Technology Journal, 2, 1B, pp. 550-553, (2017)
[3]  
Chang C.-C., Chen S.-H., Developing a novel machine learning-based classification scheme for predicting SPCs in breast cancer survivors, Frontiers in Genetics, 10, (2019)
[4]  
Cleary J.G., Trigg L.E., K*: An instance-based learner using and entropic distance measure, Proceedings of the 12Th International Conference on Machine Learning, (1995)
[5]  
Corkum M., Hayden J.A., Kephart G., Urquhart R., Schlievert C., Porter G., Screening for new primary cancers in cancer survivors compared to non-cancer controls: A systematic review and meta-analysis, Journal of Cancer Survivorship, 7, 3, pp. 455-463, (2013)
[6]  
Cui S., Zhao L., Wang Y., Dong Q., Ma J., Wang Y., Et al., Using naive Bayes classifier to predict osteonecrosis of the femoral head with cannulated screw fixation, Injury, 49, 10, pp. 1865-1870, (2018)
[7]  
Giardiello F.M., Allen J.I., Axilbund J.E., Boland C.R., Burke C.A., Burt R.W., Et al., Guidelines on genetic evaluation and management of Lynch syndrome: A consensus statement by the US multi-society task force on colorectal cancer, Gastroenterology, 147, 2, pp. 502-526, (2014)
[8]  
Guinness R.E., Beyond where to how: A machine learning approach for sensing mobility contexts using smartphone sensors, Sensors, 15, 5, pp. 9962-9985, (2015)
[9]  
Huang J.-H., Yan J., Wu Q.-H., Ferro M.D., Yi L.-Z., Lu H.-M., Et al., Selective of informative metabolites using random forests based on model population analysis, Talanta, 117, pp. 549-555, (2013)
[10]  
Kalmegh S.R., Comparative analysis of WEKA data mining algorithm random forest, random tree and LAD tree for classification of indigenous news data, International Journal of Emerging Technology and Advanced Engineering, 5, 1, pp. 507-517, (2015)