Improved concept drift handling in surgery prediction and other applications

被引:15
作者
Beyene, Ayne A. [1 ]
Welemariam, Tewelle [1 ]
Persson, Marie [1 ]
Lavesson, Niklas [1 ]
机构
[1] Blekinge Inst Technol, Dept Comp Sci & Engn, S-37179 Karlskrona, Sweden
关键词
Concept drift; Surgery prediction; Trigger-based ensemble; CLASSIFIERS;
D O I
10.1007/s10115-014-0756-9
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The article presents a new algorithm for handling concept drift: the Trigger-based Ensemble (TBE) is designed to handle concept drift in surgery prediction but it is shown to perform well for other classification problems as well. At the primary care, queries about the need for surgical treatment are referred to a surgeon specialist. At the secondary care, referrals are reviewed by a team of specialists. The possible outcomes of this review are that the referral: (i) is canceled, (ii) needs to be complemented, or (iii) is predicted to lead to surgery. In the third case, the referred patient is scheduled for an appointment with a surgeon specialist. This article focuses on the binary prediction of case three (surgery prediction). The guidelines for the referral and the review of the referral are changed due to, e.g., scientific developments and clinical practices. Existing decision support is based on the expert systems approach, which usually requires manual updates when changes in clinical practice occur. In order to automatically revise decision rules, the occurrence of concept drift (CD) must be detected and handled. The existing CD handling techniques are often specialized; it is challenging to develop a more generic technique that performs well regardless of CD type. Experiments are conducted to measure the impact of CD on prediction performance and to reduce CD impact. The experiments evaluate and compare TBE to three existing CD handling methods (AWE, Active Classifier, and Learn++) on one real-world dataset and one artificial dataset. TBA significantly outperforms the other algorithms on both datasets but is less accurate on noisy synthetic variations of the real-world dataset.
引用
收藏
页码:177 / 196
页数:20
相关论文
共 27 条
[1]  
Alippi C, 2011, 2011 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), P1675, DOI 10.1109/IJCNN.2011.6033426
[2]  
[Anonymous], 1997, MACHINE LEARNING, MCGRAW-HILL SCIENCE/ENGINEERING/MATH
[3]  
[Anonymous], 2004, COMPUTER SCI DEP TRI
[4]  
Baena-Garcia M, 2006, 4 INT WORKSH KNOWL D, V6, P77
[5]  
Chengguan Xiang, 2009, Proceedings of the 2009 Sixth International Conference on Fuzzy Systems and Knowledge Discovery (FSKD 2009), P100, DOI 10.1109/FSKD.2009.245
[6]  
Demsar J, 2006, J MACH LEARN RES, V7, P1
[7]   Incremental Learning of Concept Drift in Nonstationary Environments [J].
Elwell, Ryan ;
Polikar, Robi .
IEEE TRANSACTIONS ON NEURAL NETWORKS, 2011, 22 (10) :1517-1531
[8]  
Gama J, 2004, LECT NOTES ARTIF INT, V3171, P286
[9]  
Magoulas G. D., 2001, Machine learning and its applications. Advanced lectures, P300
[10]   Novelty detection: a review - part 1: statistical approaches [J].
Markou, M ;
Singh, S .
SIGNAL PROCESSING, 2003, 83 (12) :2481-2497