A comparison of random forest based algorithms: random credal random forest versus oblique random forest

被引:71
作者
Mantas, Carlos J. [1 ]
Castellano, Javier G. [1 ]
Moral-Garcia, Serafin [1 ]
Abellan, Joaquin [1 ]
机构
[1] Univ Granada, Dept Comp Sci & Artificial Intelligence, Granada, Spain
关键词
Classification; Ensemble schemes; Random forest; Imprecise probabilities; Credal sets; RIDGE-REGRESSION; IMPRECISE PROBABILITIES; UNCERTAINTY MEASURES; DECISION TREES; CLASS NOISE; ENSEMBLE; CLASSIFICATION; CLASSIFIERS; MODEL;
D O I
10.1007/s00500-018-3628-5
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Random forest (RF) is an ensemble learning method, and it is considered a reference due to its excellent performance. Several improvements in RF have been published. A kind of improvement for the RF algorithm is based on the use of multivariate decision trees with local optimization process (oblique RF). Another type of improvement is to provide additional diversity for the univariate decision trees by means of the use of imprecise probabilities (random credal random forest, RCRF). The aim of this work is to compare experimentally these improvements of the RF algorithm. It is shown that the improvement in RF with the use of additional diversity and imprecise probabilities achieves better results than the use of RF with multivariate decision trees.
引用
收藏
页码:10739 / 10754
页数:16
相关论文
共 41 条
[11]   An alternative model for the analysis of detecting electronic industries earnings management using stepwise regression, random forest, and decision tree [J].
Chen, Fu-Hsiang ;
Howard, Hu .
SOFT COMPUTING, 2016, 20 (05) :1945-1960
[12]  
Demsar J, 2006, J MACH LEARN RES, V7, P1
[13]   An experimental comparison of three methods for constructing ensembles of decision trees: Bagging, boosting, and randomization [J].
Dietterich, TG .
MACHINE LEARNING, 2000, 40 (02) :139-157
[14]   Ensemble methods in machine learning [J].
Dietterich, TG .
MULTIPLE CLASSIFIER SYSTEMS, 2000, 1857 :1-15
[15]   Using machine learning and big data approaches to predict travel time based on historical and real-time data from Taiwan electronic toll collection [J].
Fan, Shu-Kai S. ;
Su, Chuan-Jun ;
Nien, Han-Tang ;
Tsai, Pei-Fang ;
Cheng, Chen-Yang .
SOFT COMPUTING, 2018, 22 (17) :5707-5718
[16]  
Frank E., 2005, Data Mining: Practical Machine Learning Tools and Techniques
[17]   Classification in the Presence of Label Noise: a Survey [J].
Frenay, Benoit ;
Verleysen, Michel .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2014, 25 (05) :845-869
[19]   A comparison of alternative tests of significance for the problem of m rankings [J].
Friedman, M .
ANNALS OF MATHEMATICAL STATISTICS, 1940, 11 :86-92
[20]   Ridge regression: Biased estimation for nonorthogonal problems [J].
Hoerl, AE ;
Kennard, RW .
TECHNOMETRICS, 2000, 42 (01) :80-86