A Multi-Armed Bandit Approach to Cost-Sensitive Decision Tree Learning

被引:3
作者
Lomax, Susan [1 ]
Vadera, Sunil [1 ]
Saraee, Mohammed [1 ]
机构
[1] Univ Salford, Sch Comp Sci & Engn, Salford M5 4WT, Lancs, England
来源
12TH IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS (ICDMW 2012) | 2012年
关键词
component; Cost-sensitive learning; Multi-armed bandit problems; Decision tree learning; Game Theory; INDUCTION;
D O I
10.1109/ICDMW.2012.33
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Several authors have studied the problem of inducing decision trees that aim to minimize costs of misclassification and take account of costs of tests. The approaches adopted vary from modifying the information theoretic attribute selection measure used in greedy algorithms such as C4.5 to using methods such as bagging and boosting. This paper presents a new framework, based on game theory, which recognizes that there is a trade-off between the cost of using a test and the misclassification costs. Cost-sensitive learning is viewed as a Multi-Armed Bandit problem, leading to a novel cost-sensitive decision tree algorithm. The new algorithm is evaluated on five data sets and compared to six well known algorithms J48, EG2, MetaCost, AdaCostM1, ICET and ACT. The preliminary results are promising showing that the new multi-armed based algorithm can produce more cost-effective trees without compromising accuracy.
引用
收藏
页码:162 / 168
页数:7
相关论文
共 35 条
[1]  
[Anonymous], 2014, C4. 5: programs for machine learning
[2]  
[Anonymous], P 21 INT C MACH LEAR, DOI DOI 10.1145/1015330
[3]  
[Anonymous], 1984, OLSHEN STONE CLASSIF, DOI 10.2307/2530946
[4]  
[Anonymous], 2007, Game Theory: A Very Short Introduction
[5]  
[Anonymous], 2003, RC22666 IBM
[6]  
[Anonymous], 2009, WEKA DATA MINING SOF
[7]   Finite-time analysis of the multiarmed bandit problem [J].
Auer, P ;
Cesa-Bianchi, N ;
Fischer, P .
MACHINE LEARNING, 2002, 47 (2-3) :235-256
[8]  
Auer P, 1995, AN S FDN CO, P322, DOI 10.1109/SFCS.1995.492488
[9]  
Auer P., 2002, J MACHINE LEARNING R, V3, P397, DOI DOI 10.4271/610369
[10]  
Cesa-Bianchi N., 2006, PREDICTION LEARNING