Data Mining Techniques for Disease Risk Prediction Model: A Systematic Literature Review

被引:3
作者
Ahmad, Wan Muhamad Taufik Wan [1 ]
Ab Ghani, Nur Laila [1 ]
Drus, Sulfeeza Mohd [1 ]
机构
[1] Univ Tenaga Nas, Coll Comp Sci & Informat Technol, Jalan IKRAM UNITEN, Kajang 43000, Selangor, Malaysia
来源
RECENT TRENDS IN DATA SCIENCE AND SOFT COMPUTING, IRICT 2018 | 2019年 / 843卷
关键词
Disease risk prediction; Data mining; Data mining techniques; Data mining algorithm;
D O I
10.1007/978-3-319-99007-1_4
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Risk prediction model estimates event occurrence based on related data. Conventional statistical metrics that utilized primary data generates simple descriptive analysis that often provide insufficient knowledge for decision making. In contrast, data mining techniques that have the capability to find hidden pattern from the secondary data in large databases and create prediction for de-sired output has become a popular approach to develop any risk prediction model. In healthcare particularly, data mining techniques can be applied in disease risk prediction model to provide reliable prediction on the possibility of acquiring the disease based on individual's clinical and non-clinical data. Due to the increased use of data mining in healthcare, this study aims at identifying the data mining techniques and algorithms that are commonly implemented in studies related to various disease risk prediction model as well as finding the accuracy of the algorithms. The accuracy evaluation consists of various method, but this paper is focusing on overall accuracy which is measured by the total number of correctly predicted output over the total number of prediction. A systematic literature review approach that search across five databases found 170 articles, of which 7 articles were selected in the final process. This review found that most prediction model used classification technique, with a focus on decision tree, neural network, support vector machines, and Naive Bayes algorithms where heart-related disease is commonly studied. Further research can apply similar algorithms to develop risk prediction model for other types of diseases, such as infectious disease prediction.
引用
收藏
页码:40 / 46
页数:7
相关论文
共 17 条
  • [1] [Anonymous], 2013, P COMM NETW TECHN IC
  • [2] Dash SS, 2016, 2016 OPTICAL FIBER COMMUNICATIONS CONFERENCE AND EXHIBITION (OFC)
  • [3] Data-Mining-Based Coronary Heart Disease Risk Prediction Model Using Fuzzy Logic and Decision Tree
    Kim, Jaekwon
    Lee, Jongsik
    Lee, Youngho
    [J]. HEALTHCARE INFORMATICS RESEARCH, 2015, 21 (03) : 167 - 174
  • [4] Risk Prediction System based on Risk Prediction Model with Complex Event Processing Risk Prediction in Real Time on complex event processing engine
    Kim, Yoon-Ki
    Jeong, Chang-Sung
    [J]. 2014 IEEE FOURTH INTERNATIONAL CONFERENCE ON BIG DATA AND CLOUD COMPUTING (BDCLOUD), 2014, : 711 - 715
  • [5] Leopord H., 2016, Int. J. Eng. Sci, V5, P1
  • [6] An artificial neural network prediction model of congenital heart disease based on risk factors A hospital-based case-control study
    Li, Huixia
    Luo, Miyang
    Zheng, Jianfei
    Luo, Jiayou
    Zeng, Rong
    Feng, Na
    Du, Qiyun
    Fang, Junqun
    [J]. MEDICINE, 2017, 96 (06)
  • [7] Pak M., 2014, P 18 IEEE INT S CONS, P1, DOI [10.1109/ISCE.2014.6884338, DOI 10.1109/ISCE.2014.6884338]
  • [8] Patel S., 2016, INT J INF SCI TECHNO, V62, P1
  • [9] Radha P., 2014, INT J ADV RES COMPUT, V2, P2321
  • [10] Sagar P., 2017, INT J COMPUT APPL, V163, P975