Random forests

被引:82540
|
作者
Breiman, L [1 ]
机构
[1] Univ Calif Berkeley, Dept Stat, Berkeley, CA 94720 USA
关键词
classification; regression; ensemble;
D O I
10.1023/A:1010933404324
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Random forests are a combination of tree predictors such that each tree depends on the values of a random vector sampled independently and with the same distribution for all trees in the forest. The generalization error for forests converges a.s. to a limit as the number of trees in the forest becomes large. The generalization error of a forest of tree classifiers depends on the strength of the individual trees in the forest and the correlation between them. Using a random selection of features to split each node yields error rates that compare favorably to Adaboost (Y. Freund & R. Schapire, Machine Learning: Proceedings of the Thirteenth International conference, ***, 148-156), but are more robust with respect to noise. Internal estimates monitor error, strength, and correlation and these are used to show the response to increasing the number of features used in the splitting. Internal estimates are also used to measure variable importance. These ideas are also applicable to regression.
引用
收藏
页码:5 / 32
页数:28
相关论文
共 50 条
  • [1] Imprecise Extensions of Random Forests and Random Survival Forests
    Utkin, Lev, V
    Kovalev, Maxim S.
    Meldo, Anna A.
    Coolen, Frank P. A.
    PROCEEDINGS OF THE ELEVENTH INTERNATIONAL SYMPOSIUM ON IMPRECISE PROBABILITIES: THEORIES AND APPLICATIONS (ISIPTA 2019), 2019, 103 : 404 - 413
  • [2] Random Forests
    Leo Breiman
    Machine Learning, 2001, 45 : 5 - 32
  • [3] Random forests
    Pavlov, YL
    PROBABILISTIC METHODS IN DISCRETE MATHEMATICS, 1997, : 11 - 18
  • [4] Random Prism: An Alternative to Random Forests
    Stahl, Frederic
    Bramer, Max
    RESEARCH AND DEVELOPMENT IN INTELLIGENT SYSTEMS XXVIII: INCORPORATING APPLICATIONS AND INNOVATIONS IN INTELLIGENT SYSTEMS XIX, 2011, : 5 - 18
  • [5] CONSISTENCY OF RANDOM FORESTS
    Scornet, Erwan
    Biau, Gerard
    Vert, Jean-Philippe
    ANNALS OF STATISTICS, 2015, 43 (04): : 1716 - 1741
  • [6] Unsupervised random forests
    Mantero, Alejandro
    Ishwaran, Hemant
    STATISTICAL ANALYSIS AND DATA MINING, 2021, 14 (02) : 144 - 167
  • [7] Joints in Random Forests
    Correia, Alvaro H. C.
    Peharz, Robert
    de Campos, Cassio
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
  • [8] Extremal Random Forests
    Gnecco, Nicola
    Terefe, Edossa Merga
    Engelke, Sebastian
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2024,
  • [9] Random Forests with R
    Maindonald, John H.
    INTERNATIONAL STATISTICAL REVIEW, 2021, 89 (02) : 422 - 423
  • [10] Enriched random forests
    Amaratunga, Dhammika
    Cabrera, Javier
    Lee, Yung-Seop
    BIOINFORMATICS, 2008, 24 (18) : 2010 - 2014