A one-class classification decision tree based on kernel density estimation

被引:29
作者
Itani, Sarah [1 ,2 ]
Lecron, Fabian [3 ]
Fortemps, Philippe [3 ]
机构
[1] Fund Sci Res FNRS FRS FNRS, Brussels, Belgium
[2] Univ Mons, Fac Engn, Dept Math & Operat Res, Mons, Belgium
[3] Univ Mons, Fac Engn, Dept Engn Innovat Management, Mons, Belgium
关键词
One-class classification; Decision trees; Kernel density estimation; Explainable artificial intelligence; SUPPORT; VARIABILITY; ENSEMBLES;
D O I
10.1016/j.asoc.2020.106250
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
One-class Classification (OCC) is an important field of machine learning which aims at predicting a single class on the basis of its lonely representatives and potentially some additional counter-examples. OCC is thus opposed to traditional classification problems involving two or more classes, and addresses the issue of class unbalance. There is a wide range of one-class models which give satisfaction in terms of performance. But at the time of explainable artificial intelligence, there is an increasing need for interpretable models. The present work advocates a novel one-class model which tackles this challenge. Within a greedy and recursive approach, our proposal for an explainable One-Class decision Tree (OC-Tree) rests on kernel density estimation to split a data subset on the basis of one or several intervals of interest. Thus, the OC-Tree encloses data within hyper-rectangles of interest which can be described by a set of rules. Against state-of-the-art methods such as Cluster Support Vector Data Description (ClusterSVDD), One-Class Support Vector Machine (OCSVM) and isolation Forest (iForest), the OC-Tree performs favorably on a range of benchmark datasets. Furthermore, we propose a real medical application for which the OC-Tree has demonstrated effectiveness, through the ability to tackle interpretable medical diagnosis aid based on unbalanced datasets. (C) 2020 Elsevier B.V. All rights reserved.
引用
收藏
页数:14
相关论文
共 70 条
[1]   The ADHD-200 Consortium: a model to advance the translational potential of neuroimaging in clinical neuroscience [J].
Acuna, Carlos .
FRONTIERS IN SYSTEMS NEUROSCIENCE, 2012, 6
[2]   The variability of human, BOLD hemodynamic responses [J].
Aguirre, GK ;
Zarahn, E ;
D'Esposito, M .
NEUROIMAGE, 1998, 8 (04) :360-369
[3]  
[Anonymous], ADHD EP
[4]   Detection of abnormal behaviour for dementia sufferers using Convolutional Neural Networks [J].
Arifoglu, Damla ;
Bouchachia, Abdelhamid .
ARTIFICIAL INTELLIGENCE IN MEDICINE, 2019, 94 :88-95
[5]   Active Learning for One-Class Classification [J].
Barnabe-Lortie, Vincent ;
Bellinger, Colin ;
Japkowicz, Nathalie .
2015 IEEE 14TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA), 2015, :390-395
[6]   The Neuro Bureau ADHD-200 Preprocessed repository [J].
Bellec, Pierre ;
Chu, Carlton ;
Chouinard-Decorte, Francois ;
Benhajali, Yassine ;
Margulies, Daniel S. ;
Craddock, R. Cameron .
NEUROIMAGE, 2017, 144 :275-286
[7]   DYNAMIC PROGRAMMING [J].
BELLMAN, R .
SCIENCE, 1966, 153 (3731) :34-&
[8]   LOF: Identifying density-based local outliers [J].
Breunig, MM ;
Kriegel, HP ;
Ng, RT ;
Sander, J .
SIGMOD RECORD, 2000, 29 (02) :93-104
[9]   Default-mode brain dysfunction in mental disorders: A systematic review [J].
Broyd, Samantha J. ;
Demanuele, Charmaine ;
Debener, Stefan ;
Helps, Suzannah K. ;
James, Christopher J. ;
Sonuga-Barke, Edmund J. S. .
NEUROSCIENCE AND BIOBEHAVIORAL REVIEWS, 2009, 33 (03) :279-296
[10]  
Chalapathy R., 2018, Anomaly detection using one-class neural networks