Data-driven multinomial random forest: a new random forest variant with strong consistency

被引:3
作者
Chen, Junhao [1 ]
Wang, Xueli [1 ]
Lei, Fei [2 ]
机构
[1] Beijing Univ Technol, Sch Math Stat & Mech, Beijing, Peoples R China
[2] Beijing Univ Technol, Fac Informat Technol, Beijing, Peoples R China
关键词
Random forest; Strong consistency; Classification; Regression; Machine learning; CLASSIFICATION;
D O I
10.1186/s40537-023-00874-6
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
In this paper, we modify the proof methods of some previously weakly consistent variants of random forest into strongly consistent proof methods, and improve the data utilization of these variants in order to obtain better theoretical properties and experimental performance. In addition, we propose the Data-driven Multinomial Random Forest (DMRF) algorithm, which has the same complexity with BreimanRF (proposed by Breiman) while satisfying strong consistency with probability 1. It has better performance in classification and regression tasks than previous RF variants that only satisfy weak consistency, and in most cases even surpasses BreimanRF in classification tasks. To the best of our knowledge, DMRF is currently a low-complexity and high-performing variation of random forest that achieves strong consistency with probability 1.
引用
收藏
页数:32
相关论文
共 22 条
  • [1] Integration of multi-omics data for prediction of phenotypic traits using random forest
    Acharjee, Animesh
    Kloosterman, Bjorn
    Visser, Richard G. F.
    Maliepaard, Chris
    [J]. BMC BIOINFORMATICS, 2016, 17
  • [2] Multinomial random forest
    Bai, Jiawang
    Li, Yiming
    Li, Jiawei
    Yang, Xue
    Jiang, Yong
    Xia, Shu-Tao
    [J]. PATTERN RECOGNITION, 2022, 122
  • [3] Neural Random Forests
    Biau, Gerard
    Scornet, Erwan
    Welbl, Johannes
    [J]. SANKHYA-SERIES A-MATHEMATICAL STATISTICS AND PROBABILITY, 2019, 81 (02): : 347 - 386
  • [4] Biau G, 2012, J MACH LEARN RES, V13, P1063
  • [5] Biau G, 2008, J MACH LEARN RES, V9, P2015
  • [6] Bifet A, 2009, KDD-09: 15TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, P139
  • [7] Random forests
    Breiman, L
    [J]. MACHINE LEARNING, 2001, 45 (01) : 5 - 32
  • [8] Cootes TF, 2012, LECT NOTES COMPUT SC, V7578, P278, DOI 10.1007/978-3-642-33786-4_21
  • [9] Random forests for classification in ecology
    Cutler, D. Richard
    Edwards, Thomas C., Jr.
    Beard, Karen H.
    Cutler, Adele
    Hess, Kyle T.
    [J]. ECOLOGY, 2007, 88 (11) : 2783 - 2792
  • [10] Denil M, 2014, PR MACH LEARN RES, V32