Using data mining techniques for multi-diseases prediction modeling of hypertension and hyperlipidemia by common risk factors

被引:65
作者
Chang, Cheng-Ding [2 ]
Wang, Chien-Chih [1 ]
Jiang, Bernard C. [2 ]
机构
[1] Ming Chi Univ Technol, Dept Ind Engn & Management, Taishan 243, Taipei County, Taiwan
[2] Yuan Zee Univ, Dept Ind Engn & Management, Chungli 320, Taiwan
关键词
Health evaluation center; Cardiovascular disease; Multi-feature selection; MARS; BLOOD-PRESSURE;
D O I
10.1016/j.eswa.2010.10.086
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Many previous studies have employed predictive models for a specific disease, but fail to note that humans often suffer from not only one disease, but associated diseases as well. Because these associated multiple diseases might have reciprocal effects, and abnormalities in physiological indicators can indicate multiple associated diseases, common risk factors can be used to predict the multiple associated diseases. This approach provides a more effective and comprehensive forecasting mechanism for preventive medicine. This paper proposes a two-phase analysis procedure to simultaneously predict hypertension and hyperlipidemia. Firstly, we used six data mining approaches to select the individual risk factors of these two diseases, and then determined the common risk factors using the voting principle. Next, we used the Multivariate Adaptive Regression Splines (MARS) method to construct a multiple predictive model for hypertension and hyperlipidemia. This study uses data from a physical examination center database in Taiwan that includes 2048 subjects. The proposed analysis procedure shows that the common risk factors of hypertension and hyperlipidemia are Systolic Blood Pressure (SBP), Triglycerides, Uric Acid (UA), Glutamate Pyruvate Transaminase (GPT), and gender. The proposed multi-diseases predictor method has a classification accuracy rate of 93.07%. The results of this paper provide an effective and appropriate methodology for simultaneously predicting hypertension and hyperlipidemia. (C) 2010 Elsevier Ltd. All rights reserved.
引用
收藏
页码:5507 / 5513
页数:7
相关论文
共 16 条
[1]   Determination of risk factors for hypertension through the classification tree method [J].
Akdag, Reyza ;
Fenkci, Semin ;
Degirmencioglu, Serkan ;
Rota, Simin ;
Sermez, Yurdaer ;
Camdeviren, Handan .
ADVANCES IN THERAPY, 2006, 23 (06) :885-892
[2]  
*AM COLL PHYS, 2007, HYP HYP ACP DIAB CAR
[3]  
[Anonymous], 2013, WORLD HLTH REPORT 20
[4]  
Armengol E, 2001, METHOD INFORM MED, V40, P46
[5]   ASSOCIATION BETWEEN BLOOD-PRESSURE AND SERUM-LIPIDS IN A POPULATION - THE TROMSO STUDY [J].
BONAA, KH ;
THELLE, DS .
CIRCULATION, 1991, 83 (04) :1305-1314
[6]   SMOOTHING NOISY DATA WITH SPLINE FUNCTIONS [J].
WAHBA, G .
NUMERISCHE MATHEMATIK, 1975, 24 (05) :383-393
[7]   MODELING OF TOPOGRAPHIC EFFECTS ON ANTARCTIC SEA-ICE USING MULTIVARIATE ADAPTIVE REGRESSION SPLINES [J].
DEVEAUX, RD ;
GORDON, AL ;
COMISO, JC ;
BACHERER, NE .
JOURNAL OF GEOPHYSICAL RESEARCH-OCEANS, 1993, 98 (C11) :20307-20319
[8]  
Friedman J H, 1995, Stat Methods Med Res, V4, P197, DOI 10.1177/096228029500400303
[9]   MULTIVARIATE ADAPTIVE REGRESSION SPLINES [J].
FRIEDMAN, JH .
ANNALS OF STATISTICS, 1991, 19 (01) :1-67
[10]   Statistical techniques for the classification of chromites in diamond exploration samples [J].
Griffin, WL ;
Fisher, NI ;
Friedman, JH ;
Ryan, CG .
JOURNAL OF GEOCHEMICAL EXPLORATION, 1997, 59 (03) :233-249