Using POMDPs for learning cost sensitive decision trees

被引:9
作者
Maliah, Shlomi [1 ]
Shani, Guy [2 ]
机构
[1] Microsoft, Redmond, WA USA
[2] Ben Gurion Univ Negev, Software & Informat Syst Engn, Beer Sheva, Israel
关键词
POMDP; MDP; Cost sensitive classification; Decision trees; KNOWLEDGE;
D O I
10.1016/j.artint.2020.103400
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In classification, an algorithm learns to classify a given instance based on a set of observed attribute values. In many real world cases testing the value of an attribute incurs a cost. Furthermore, there can also be a cost associated with the misclassification of an instance. Cost sensitive classification attempts to minimize the expected cost of classification, by deciding after each observed attribute value, which attribute to measure next. In this paper we suggest Partially Observable Markov Decision Processes (POMDPs) as a modeling tool for cost sensitive classification. POMDPs are typically solved through a policy over belief states. We show how a relatively small set of potentially important belief states can be identified, and define an MDP over these belief states. To identify these potentially important belief states, we construct standard decision trees over all attribute subsets, and the leaves of these trees become the state space of our tree-based MDP. At each phase we decide on the next attribute to measure, balancing the cost of the measurement and the classification accuracy. We compare our approach to a set of previous approaches, showing our approach to work better for a range of misclassification costs. (C) 2020 Elsevier B.V. All rights reserved.
引用
收藏
页数:26
相关论文
共 32 条
  • [1] [Anonymous], 2013, Artificial Intelligence and Statistics
  • [2] A comparison of decision tree ensemble creation techniques
    Banfield, Robert E.
    Hall, Lawrence O.
    Bowyer, Kevin W.
    Kegelmeyer, W. P.
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2007, 29 (01) : 173 - 180
  • [3] Bellman R.E., 1957, DYNAMIC PROGRAMMING
  • [4] Bonet B., 1998, Machine Learning. Proceedings of the Fifteenth International Conference (ICML'98), P73
  • [5] Cassandra A., 1997, P C UNCERTAINTIES AR, P54
  • [6] Domingos Pedro, 1999, P 5 ACM SIGKDD INT C, P155, DOI [10.1145/312129.312220, 10.1145/ 312129.312220, DOI 10.1145/312129.312220]
  • [7] Elkan C., 2001, P 17 INT JOINT C ART, V2, P973, DOI 10.5555/1642194.1642224
  • [8] Esmeir S, 2008, ADV NEURAL INFORM PR, P425
  • [9] Fan W, 1999, MACHINE LEARNING, PROCEEDINGS, P97
  • [10] Hall Mark, 2009, SIGKDD EXPLORATIONS, V11, P10, DOI DOI 10.1145/1656274.1656278