A survey on data mining techniques used in medicine

被引:14
作者
Birjandi, Saba Maleki [1 ]
Khasteh, Seyed Hossein [1 ,2 ]
机构
[1] KN Toosi Univ Technol, Sch Comp Engn, Tehran 1631714191, Iran
[2] Fac Comp Engn, Shariati Ave, Tehran, Iran
关键词
Data mining; Statistical methods; Decision tree; Linear regression; Association rules; Medical data mining; BODY-MASS INDEX; ASSOCIATION RULE; HEART-DISEASE; RISK-FACTORS; PREDICTION; CLASSIFICATION; IDENTIFICATION; ALGORITHM; MORTALITY; CHILDREN;
D O I
10.1007/s40200-021-00884-2
中图分类号
R5 [内科学];
学科分类号
1002 ; 100201 ;
摘要
Data mining is the process of analyzing a massive amount of data to identify meaningful patterns and detect relations, which can lead to future trend prediction and appropriate decision making. Data mining applications are significant in marketing, banking, medicine, etc. In this paper, we present an overview of data mining applications in medicine to provide a clear view of the challenges and previous works in this area for researchers. Data mining techniques such as Decision Tree, Random Forest, K-means Clustering, Support Vector Machine, Logistic Regression, Neural Network, Naive Bayes, and association rule mining are used for diagnosing, prognosis, classifying, constructing predictive models, and analyzing risk factors of various diseases. The main objective of the paper is to analyze and compare different data mining techniques used in the medical applications. We present a summary of the results and provide comparison analysis of the data mining methods employed by the reviewed articles.
引用
收藏
页码:2055 / 2071
页数:17
相关论文
共 66 条
[61]   Detection of drug-drug interactions through data mining studies using clinical sources, scientific literature and social media [J].
Vilar, Santiago ;
Friedman, Carol ;
Hripcsak, George .
BRIEFINGS IN BIOINFORMATICS, 2018, 19 (05) :863-877
[62]   A support vector machine-based ensemble algorithm for breast cancer diagnosis [J].
Wang, Haifeng ;
Zheng, Bichen ;
Yoon, Sang Won ;
Ko, Hoo Sang .
EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2018, 267 (02) :687-699
[63]   Machine Learning-Based Method for Obesity Risk Evaluation Using Single-Nucleotide Polymorphisms Derived from Next-Generation Sequencing [J].
Wang, Hsin-Yao ;
Chang, Shih-Cheng ;
Lin, Wan-Ying ;
Chen, Chun-Hsien ;
Chiang, Szu-Hsien ;
Huang, Kai-Yao ;
Chu, Bo-Yu ;
Lu, Jang-Jih ;
Lee, Tzong-Yi .
JOURNAL OF COMPUTATIONAL BIOLOGY, 2018, 25 (12) :1347-1360
[64]   DATA MINING FOR THE IDENTIFICATION OF METABOLIC SYNDROME STATUS [J].
Worachartcheewan, Apilak ;
Schaduangrat, Nalini ;
Prachayasittikul, Virapong ;
Nantasenamat, Chanin .
EXCLI JOURNAL, 2018, 17 :72-88
[65]   A Deep Learning Mammography-based Model for Improved Breast Cancer Risk Prediction [J].
Yala, Adam ;
Lehman, Constance ;
Schuster, Tal ;
Portnoi, Tally ;
Barzilay, Regina .
RADIOLOGY, 2019, 292 (01) :60-66
[66]   Use of Data Mining to Reveal Body Mass Index (BMI): Patterns Among Pennsylvania Schoolchildren, Pre-K to Grade 12 [J].
YoussefAgha, Ahmed H. ;
Lohrmann, David K. ;
Jayawardene, Wasantha P. .
JOURNAL OF SCHOOL HEALTH, 2013, 83 (02) :85-92