An improved feature extraction approach based on rough sets for the medical diagnosis

被引:15
作者
Jiang, Wei [1 ]
Li, Yi-Jun [1 ]
Pang, Xiu-Li [1 ]
机构
[1] Harbin Inst Technol, Informat Management Res Ctr, Harbin 150001, Peoples R China
来源
PROCEEDINGS OF 2008 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7 | 2008年
关键词
medical diagnosis; rough sets; maximum entropy model; support vector machine; feature extraction;
D O I
10.1109/ICMLC.2008.4620436
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
This paper presents a novel approach based on Rough Sets to extract the complicated features from the medical diagnosis corpus. Some symptoms or basic features in the medical diagnosis are usually correlated. In general, the combinations of several basic symptoms may represent the disease more precision. However, the overmuch feature can reduce the generalization ability, or even many unfit features as the noise can decrease the model's performance. This paper proposes to apply the rough set theory to mine the complicated features, even from noise or inconsistent corpus. Secondly, these complex features are added into the Maximum Entropy model or Support Vector Machine etc. as a new kind of features, consequently, the feature weights can be assigned according to the performance of the whole model. The experiments in the Liver-disorders repository show that our method can improve the Maximum Entropy model by the precision 3.51%, improve the Support Vector Machine model by the precision 3.05%, improve the Naive Bayes model by the precision 3.59%, and improve the Bayes and GoodTuring model by the precision 3.59%.
引用
收藏
页码:385 / 390
页数:6
相关论文
共 13 条
[1]  
Carlin U, 1998, P 7 C INF PROC MAN U, P1528
[2]  
CASTRO F, 1999, MED DECIS MAKING, V19, P178
[3]  
GEORGEPETER K, 2001, IEEE T INF TECHNOL B, V5, P55
[4]  
Pawlak Z., 1991, SYSTEM THEORY KNOWLE, V9
[5]   Towards more optimal medical diagnosing with evolutionary algorithms [J].
Podgorelec V. .
Journal of Medical Systems, 2001, 25 (3) :195-219
[6]  
Pople H.E., 1982, ARTIF INTELL MED, P119
[7]   Finding the optimal multiple-test strategy using a method analogous to logistic regression: The diagnosis of hepatolenticular degeneration (Wilson's disease) [J].
Richards, RJ ;
Hammitt, JK ;
Tsevat, J .
MEDICAL DECISION MAKING, 1996, 16 (04) :367-375
[8]  
Shortliffe EH, 2000, MED INFORM COMPUTER
[9]  
SLOWINSKI K, 1992, HDB APPL ADV ROUGH S, V11, P77
[10]   Diagnose progressive encephalopathy applying the rough set theory [J].
WakuliczDeja, A ;
Paszek, P .
INTERNATIONAL JOURNAL OF MEDICAL INFORMATICS, 1997, 46 (02) :119-127