Risk assessment of coronary heart disease based on cloud-random forest

被引:51
作者
Wang, Jing [1 ]
Rao, Congjun [1 ]
Goh, Mark [2 ,3 ]
Xiao, Xinping [1 ]
机构
[1] Wuhan Univ Technol, Sch Sci, Wuhan 430070, Peoples R China
[2] Natl Univ Singapore, NUS Business Sch, Singapore 119623, Singapore
[3] Natl Univ Singapore, Logist Inst Asia Pacific, Singapore 119623, Singapore
基金
中国国家自然科学基金;
关键词
CHD; Risk assessment; CART; Cloud model; C-RF; FEATURE-SELECTION; LEARNING APPROACH; CLASSIFICATION; DIAGNOSIS; CANCER; CARE;
D O I
10.1007/s10462-022-10170-z
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Coronary heart disease (CHD) is a major public health problem affecting a nation's economic and social development. Risk assessing CHD in a timely manner helps to stop, reverse, and reduce the spread of many chronic diseases and health hazards. This paper proposes a cloud-random forest (C-RF) model combining cloud model and random forest to assess the risk of CHD. In this model, based on the traditional classification and regression trees (CART), a weight determining algorithm based on the cloud model and decision-making trial and evaluation laboratory is applied to obtain the weights of the evaluation attributes. The attribute weight and the gain value of the smallest Gini coefficient corresponding to the same attribute are weighted and summed. The weighted sum is then used to replace the original gain value. This value rule is used as a new CART node split criterion to construct a new decision tree, thus forming a new random forest, namely, the C-RF. The Framingham dataset of the Kaggle platform is the research sample for the empirical analysis. Comparing the C-RF model with CART, support vector machine (SVM), convolutional neural network (CNN), and random forest (RF) using standard performance evaluation indexes such as accuracy, error rates, ROC curve and AUC value. The result shows that the classification accuracy of the C-RF model is 85%, which is improved by 8, 9, 4 and 3% respectively compared with CART, SVM, CNN and RF. The error rate of the first type is 13.99%, which is 6.99, 7.44, 4.47 and 3.02% lower than CART, SVM, CNN and RF respectively. The AUC value is 0.85, which is also higher than other comparison models. Thus, the C-RF model is more superior on classification performance and classification effect in the risk assessment of CHD.
引用
收藏
页码:203 / 232
页数:30
相关论文
共 50 条
[41]   Risk Assessment of PPP Waste to Energy Generation Based on Cloud Model [J].
Xie, Gaomei ;
Han, Wenhua ;
Wang, Weihua .
2021 POWER SYSTEM AND GREEN ENERGY CONFERENCE (PSGEC), 2021, :209-213
[42]   Fast Semantic Segmentation of 3D Lidar Point Cloud Based on Random Forest Method [J].
Jiang, Songdi ;
Guo, Wei ;
Fan, Yuzhi ;
Fu, Haiyang .
CHINA SATELLITE NAVIGATION CONFERENCE PROCEEDINGS, CSNC 2022, VOL II, 2022, 909 :415-424
[43]   Comparison of coronary heart disease genetic assessment with conventional cardiovascular risk assessment in primary care: reflections on a feasibility study [J].
Qureshi, Nadeem ;
Kai, Joe ;
Middlemass, Jo ;
Dhiman, Paula ;
Cross-Bardell, Laura ;
Acharya, Jayshree ;
Li, Ka Wan ;
Humphries, Steve E. ;
Standen, Penelope J. .
PRIMARY HEALTH CARE RESEARCH AND DEVELOPMENT, 2015, 16 (06) :607-617
[44]   A Random Forest-based Approach for Automated Heart Diagnosis from Cardiac Tomography Data [J].
Alabrah, Amerah .
INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2022, 22 (04) :769-777
[45]   Machine Learning and Risk Assessment: Random Forest Does Not Outperform Logistic Regression in the Prediction of Sexual Recidivism [J].
Etzler, Sonja ;
Schonbrodt, Felix D. ;
Pargent, Florian ;
Eher, Reinhard ;
Rettenberger, Martin .
ASSESSMENT, 2024, 31 (02) :460-481
[46]   Community pharmacist assessment of 10-year risk of coronary heart disease for union workers and their dependents [J].
Liu, Yifei ;
Mentele, Leslie J. ;
McDonough, Randal P. ;
Carruthers, Kara M. ;
Doucette, William R. .
JOURNAL OF THE AMERICAN PHARMACISTS ASSOCIATION, 2008, 48 (04) :515-517
[47]   A hybrid approach with metaheuristic optimization and random forest in improving heart disease prediction [J].
Narasimhan, Geetha ;
Victor, Akila .
SCIENTIFIC REPORTS, 2025, 15 (01)
[48]   Random forest swarm optimization-based for heart diseases diagnosis [J].
Asadi, Shahrokh ;
Roshan, SeyedEhsan ;
Kattan, Michael W. .
JOURNAL OF BIOMEDICAL INFORMATICS, 2021, 115
[49]   Data-driven multivariate population subgrouping via lipoprotein phenotypes versus apolipoprotein B in the risk assessment of coronary heart disease [J].
Ohukainen, Pauli ;
Kuusisto, Sanna ;
Kettunen, Johannes ;
Perola, Markus ;
Jarvelin, Marjo-Riitta ;
Makinen, Ville-Petteri ;
Ala-Korpela, Mika .
ATHEROSCLEROSIS, 2020, 294 :10-15
[50]   Multimodal Data Analysis of Alzheimer's Disease Based on Clustering Evolutionary Random Forest [J].
Bi, Xia-an ;
Hu, Xi ;
Wu, Hao ;
Wang, Yang .
IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2020, 24 (10) :2973-2983