Discovering Decision Tree Based Diabetes Prediction Model

被引:0
作者
Han, Jianchao [1 ]
Rodriguez, Juan C. [1 ]
Beheshti, Mohsen [1 ]
机构
[1] Calif State Univ Dominguez Hills, Dept Comp Sci, Carson, CA 90747 USA
来源
ADVANCES IN SOFTWARE ENGINEERING | 2009年 / 30卷
关键词
Decision tree; data mining; prediction model; bioinformatics and biomedicine; diabetes;
D O I
暂无
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Data mining techniques have been extensively applied in bioinformatics to analyze biomedical data. In this paper, we choose the Rapid-I's RapidMiner as our tool to discover decision tree based diabetes prediction model from a Pima Indians Diabetes Data Set, which collects the information of patients with and without developing diabetes. Following the data mining process, our discussion will focus on the data preprocessing, including attribute identification and selection, outlier removal, data normalization and numerical discretization, visual data analysis, hidden relationships discovery, and a diabetes prediction model construction.
引用
收藏
页码:99 / 109
页数:11
相关论文
共 13 条
[1]  
[Anonymous], 2011, Pei. data mining concepts and techniques
[2]  
Asuncion A., 2007, PIMA INDIANS DIABETE
[3]  
Cios K., 2007, Data Mining A Knowledge Discovery
[4]  
Kantardzic M., 2002, DATA MINING CONCEPTS
[5]  
Kass G. V., 1980, J R Stat Soc Ser C Appl Stat., V29, P119, DOI DOI 10.2307/2986296
[6]  
Larose D.T., 2006, DATA MINING METHODS
[7]  
Pyle D., 1999, Data Preparation for Data Mining
[8]  
Quinlan Ross., 1992, C4.5: Programs for Machine Learning
[9]  
*RAP I, 2008, INT DES
[10]  
Seibel J. A., 2007, DIABETES GUIDE