Class noise and supervised learning in medical domains: The effect of feature extraction

被引:73
作者
Pechenizkiy, Mykola [1 ]
Tsymbal, Alexey [2 ]
Puuronen, Seppo [3 ]
Pechenizkiy, Oleksandr [3 ]
机构
[1] Univ Jyvaskyla, Dept Math, Jyvaskyla, Finland
[2] Trinity Coll Dublin, Dept CS, Dublin, Ireland
[3] Univ Jyvaskyla, Dept CS & ISs, Jyvaskyla, Finland
来源
19TH IEEE INTERNATIONAL SYMPOSIUM ON COMPUTER-BASED MEDICAL SYSTEMS, PROCEEDINGS | 2006年
基金
芬兰科学院; 爱尔兰科学基金会;
关键词
D O I
10.1109/CBMS.2006.65
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Inductive learning systems have been successfully applied in a number of medical domains. It is generally accepted that the highest accuracy results that an inductive learning system can achieve depend on the quality of data and on the appropriate selection of a learning algorithm for the data. In this paper we analyze the effect of class noise on supervised learning in medical domains. We review the related work on learning from noisy data and propose to use feature extraction as a pre-processing step to diminish the effect of class noise on the learning process. Our experiments with 8 medical datasets show that feature extraction indeed helps to deal with class noise. It clearly results in higher classification accuracy of learnt models without the separate explicit elimination of noisy instances.
引用
收藏
页码:708 / +
页数:2
相关论文
共 23 条
[1]   Identifying mislabeled training data [J].
Brodley, CE ;
Friedl, MA .
JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 1999, 11 :131-167
[2]  
FAYYAD U, 1997, ADV KNOWLEDGE DISCOV
[3]  
Fukunaga K, 1999, INTRO STAT PATTERN R
[4]  
Gamberger D, 1999, MACHINE LEARNING, PROCEEDINGS, P143
[5]   Noise detection and elimination in data preprocessing: Experiments in medical domains [J].
Gamberger, D ;
Lavrac, N ;
Dzeroski, S .
APPLIED ARTIFICIAL INTELLIGENCE, 2000, 14 (02) :205-223
[6]  
GAMBERGER D, 1996, P 7 INT WORKSH ALG L, P199
[7]  
John G. H., 1995, P 1 INT C KNOWL DISC, P174
[8]  
Jollife I., 1986, Principal Component Analysis
[9]   INDUCTIVE AND BAYESIAN LEARNING IN MEDICAL DIAGNOSIS [J].
KONONENKO, I .
APPLIED ARTIFICIAL INTELLIGENCE, 1993, 7 (04) :317-337
[10]  
Liu H., 1998, FEATURE EXTRACTION C, DOI 10.1007/978-1-4615-5725-8