An instance-weighting method to induce cost-sensitive trees

被引:0
作者
Ting, KM [1 ]
机构
[1] Monash Univ, Gippsland Sch Comp & Informat Technol, Churchill, Vic 3842, Australia
关键词
cost-sensitive; decision trees; induction; greedy divide-and-conquer algorithm; instance weighting;
D O I
10.1109/tkde.2002.1000348
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We introduce an instance-weighting method to induce cost-sensitive trees. It is a generalization of the standard tree induction process where only the initial instance weights determine the type of tree to be induced-minimum error trees or minimum high cost error trees. We demonstrate that it can be easily adapted to an existing tree learning algorithm. Previous research provides insufficient evidence to support the idea that the greedy divide-and-conquer algorithm can effectively induce a truly cost-sensitive tree directly from the training data. We provide this empirical evidence in this paper. The algorithm incorporating the instance-weighting method is found to be better than the original algorithm in terms of total misclassification costs, the number of high cost errors, and tree size in two-class data sets. The instance-weighting method is simpler and more effective in implementation than a previous method based on altered priors.
引用
收藏
页码:659 / 665
页数:7
相关论文
共 17 条
  • [1] [Anonymous], P 10 EUR C MACH LEAR
  • [2] [Anonymous], P 11 INT JOINT C ART
  • [3] [Anonymous], 1994, MACHINE LEARNING NEU
  • [4] Blake C.L., 1998, UCI repository of machine learning databases
  • [5] Breiman L., 1984, CLASSIFICATION REGRS
  • [6] KNOLL U, 1994, P 8 EUR C MACH LEARN, P383
  • [7] NUNEZ M, 1991, MACH LEARN, V6, P231, DOI 10.1007/BF00114778
  • [8] Pazzani M., 1994, MACH LEARN P, P217
  • [9] Quinlan JR, 1996, PROCEEDINGS OF THE THIRTEENTH NATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND THE EIGHTH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE, VOLS 1 AND 2, P725
  • [10] Quinlan JR., 1993, C4 5 PROGRAM MACHINE