Forest Pruning Based on Branch Importance

被引:6
作者
Jiang, Xiangkui [1 ]
Wu, Chang-an [2 ]
Guo, Huaping [2 ]
机构
[1] Xian Univ Posts & Telecommun, Sch Automat, Xian 710121, Shaanxi, Peoples R China
[2] Xinyang Normal Univ, Sch Comp & Informat Technol, Xinyang 464000, Henan, Peoples R China
基金
中国国家自然科学基金;
关键词
D O I
10.1155/2017/3162571
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
A forest is an ensemble with decision trees as members. This paper proposes a novel strategy to pruning forest to enhance ensemble generalization ability and reduce ensemble size. Unlike conventional ensemble pruning approaches, the proposed method tries to evaluate the importance of branches of trees with respect to the whole ensemble using a novel proposed metric called importance gain. The importance of a branch is designed by considering ensemble accuracy and the diversity of ensemble members, and thus the metric reasonably evaluates how much improvement of the ensemble accuracy can be achieved when a branch is pruned. Our experiments show that the proposed method can significantly reduce ensemble size and improve ensemble accuracy, no matter whether ensembles are constructed by a certain algorithm such as bagging or obtained by an ensemble selection algorithm, no matter whether each decision tree is pruned or unpruned.
引用
收藏
页数:11
相关论文
共 34 条
  • [1] [Anonymous], 2008, P 23 AAAI C ART INT
  • [2] Asuncion A., 2007, Uci machine learning repository
  • [3] Banfield R. E., 2005, Information Fusion, V6, P49, DOI 10.1016/j.inffus.2004.04.005
  • [4] SmcHD1, containing a structural-maintenance-of-chromosomes hinge domain, has a critical role in X inactivation
    Blewitt, Marnie E.
    Gendrel, Anne-Valerie
    Pang, Zhenyi
    Sparrow, Duncan B.
    Whitelaw, Nadia
    Craig, Jeffrey M.
    Apedaile, Anwyn
    Hilton, Douglas J.
    Dunwoodie, Sally L.
    Brockdorff, Neil
    Kay, Graham F.
    Whitelaw, Emma
    [J]. NATURE GENETICS, 2008, 40 (05) : 663 - 669
  • [5] Random forests
    Breiman, L
    [J]. MACHINE LEARNING, 2001, 45 (01) : 5 - 32
  • [6] Random forests
    Breiman, L
    [J]. MACHINE LEARNING, 2001, 45 (01) : 5 - 32
  • [7] Introducing randomness into greedy ensemble pruning algorithms
    Dai, Qun
    Li, Meiling
    [J]. APPLIED INTELLIGENCE, 2015, 42 (03) : 406 - 429
  • [8] A new reverse reduce-error ensemble pruning algorithm
    Dai, Qun
    Zhang, Ting
    Liu, Ningzhong
    [J]. APPLIED SOFT COMPUTING, 2015, 28 : 237 - 249
  • [9] Demsar J, 2006, J MACH LEARN RES, V7, P1
  • [10] Ensemble methods in machine learning
    Dietterich, TG
    [J]. MULTIPLE CLASSIFIER SYSTEMS, 2000, 1857 : 1 - 15