Risk assessment of coronary heart disease based on cloud-random forest

被引:45
|
作者
Wang, Jing [1 ]
Rao, Congjun [1 ]
Goh, Mark [2 ,3 ]
Xiao, Xinping [1 ]
机构
[1] Wuhan Univ Technol, Sch Sci, Wuhan 430070, Peoples R China
[2] Natl Univ Singapore, NUS Business Sch, Singapore 119623, Singapore
[3] Natl Univ Singapore, Logist Inst Asia Pacific, Singapore 119623, Singapore
基金
中国国家自然科学基金;
关键词
CHD; Risk assessment; CART; Cloud model; C-RF; FEATURE-SELECTION; LEARNING APPROACH; CLASSIFICATION; DIAGNOSIS; CANCER; CARE;
D O I
10.1007/s10462-022-10170-z
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Coronary heart disease (CHD) is a major public health problem affecting a nation's economic and social development. Risk assessing CHD in a timely manner helps to stop, reverse, and reduce the spread of many chronic diseases and health hazards. This paper proposes a cloud-random forest (C-RF) model combining cloud model and random forest to assess the risk of CHD. In this model, based on the traditional classification and regression trees (CART), a weight determining algorithm based on the cloud model and decision-making trial and evaluation laboratory is applied to obtain the weights of the evaluation attributes. The attribute weight and the gain value of the smallest Gini coefficient corresponding to the same attribute are weighted and summed. The weighted sum is then used to replace the original gain value. This value rule is used as a new CART node split criterion to construct a new decision tree, thus forming a new random forest, namely, the C-RF. The Framingham dataset of the Kaggle platform is the research sample for the empirical analysis. Comparing the C-RF model with CART, support vector machine (SVM), convolutional neural network (CNN), and random forest (RF) using standard performance evaluation indexes such as accuracy, error rates, ROC curve and AUC value. The result shows that the classification accuracy of the C-RF model is 85%, which is improved by 8, 9, 4 and 3% respectively compared with CART, SVM, CNN and RF. The error rate of the first type is 13.99%, which is 6.99, 7.44, 4.47 and 3.02% lower than CART, SVM, CNN and RF respectively. The AUC value is 0.85, which is also higher than other comparison models. Thus, the C-RF model is more superior on classification performance and classification effect in the risk assessment of CHD.
引用
收藏
页码:203 / 232
页数:30
相关论文
共 50 条
  • [1] Risk assessment of coronary heart disease based on cloud-random forest
    Jing Wang
    Congjun Rao
    Mark Goh
    Xinping Xiao
    Artificial Intelligence Review, 2023, 56 : 203 - 232
  • [2] Random Forest Ensemble Classifier to Predict the Coronary Heart Disease Using Risk Factors
    Ani, R.
    Augustine, Aneesh
    Akhil, N. C.
    Deepa, O. S.
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON SOFT COMPUTING SYSTEMS, ICSCS 2015, VOL 1, 2016, 397 : 701 - 710
  • [3] Flood hazard risk assessment model based on random forest
    Wang, Zhaoli
    Lai, Chengguang
    Chen, Xiaohong
    Yang, Bing
    Zhao, Shiwei
    Bai, Xiaoyan
    JOURNAL OF HYDROLOGY, 2015, 527 : 1130 - 1141
  • [4] Automated prediction of Coronary Artery Disease using Random Forest and Naive Bayes
    Alotaibi, Sarah Saud
    Almajid, Yasmeen Ahmed
    Alsahali, Samar Fahad
    Asalam, Nida
    Alotaibi, Maha Dhawi
    Ullah, Irfan
    Altabee, Rahaf Mohammed
    ICACSIS 2020: 2020 12TH INTERNATIONAL CONFERENCE ON ADVANCED COMPUTER SCIENCE AND INFORMATION SYSTEMS (ICACSIS), 2020, : 109 - 113
  • [5] Using a machine learning-based risk prediction model to analyze the coronary artery calcification score and predict coronary heart disease and risk assessment
    Huang, Yue
    Ren, YingBo
    Yang, Hai
    Ding, YiJie
    Liu, Yan
    Yang, YunChun
    Mao, AnQiong
    Yang, Tan
    Wang, YingZi
    Xiao, Feng
    He, QiZhou
    Zhang, Ying
    COMPUTERS IN BIOLOGY AND MEDICINE, 2022, 151
  • [6] An Intelligent Learning System Based on Random Search Algorithm and Optimized Random Forest Model for Improved Heart Disease Detection
    Javeed, Ashir
    Zhou, Shijie
    Liao Yongjian
    Qasim, Iqbal
    Noor, Adeeb
    Nour, Redhwan
    IEEE ACCESS, 2019, 7 (180235-180243) : 180235 - 180243
  • [7] Coronary heart disease: women's assessment of risk - a qualitative study
    Ruston, A
    Clayton, J
    HEALTH RISK & SOCIETY, 2002, 4 (02) : 125 - 137
  • [8] Application Research on Risk Assessment of Municipal Pipeline Network Based on Random Forest Machine Learning Algorithm
    Cen, Hang
    Huang, Delong
    Liu, Qiang
    Zong, Zhongling
    Tang, Aiping
    WATER, 2023, 15 (10)
  • [9] Online supply chain financial risk assessment based on improved random forest
    Hao Zhang
    Yuxin Shi
    Jiayu Tong
    Journal of Data, Information and Management, 2021, 3 (1): : 41 - 48
  • [10] Landslide Risk Assessment Using a Combined Approach Based on InSAR and Random Forest
    Liu, Wangcai
    Zhang, Yi
    Liang, Yiwen
    Sun, Pingping
    Li, Yuanxi
    Su, Xiaojun
    Wang, Aijie
    Meng, Xingmin
    REMOTE SENSING, 2022, 14 (09)