Oblique and rotation double random forest

被引:36
|
作者
Ganaie, M. A. [1 ]
Tanveer, M. [1 ]
Suganthan, P. N. [2 ,3 ]
Snasel, V. [4 ]
机构
[1] Indian Inst Technol Indore, Dept Math, Simrol, Indore 453552, India
[2] Nanyang Technol Univ, Sch Elect & Elect Engn, Singapore, Singapore
[3] Qatar Univ, Coll Engn, KINDI Ctr Comp Res, Doha, Qatar
[4] VSB Tech Univ Ostrava, Dept Comp Sci, Ostrava, Czech Republic
关键词
Double random forest; Oblique random forest; Ensemble learning; Bootstrap; Decision tree; classification; FEATURE-SELECTION; DECISION TREES; SAMPLE-SIZE; ENSEMBLE; CLASSIFICATION; CLASSIFIERS; VARIANCE; BIAS; SOLVE; SET;
D O I
10.1016/j.neunet.2022.06.012
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Random Forest is an ensemble of decision trees based on the bagging and random subspace concepts. As suggested by Breiman, the strength of unstable learners and the diversity among them are the ensemble models' core strength. In this paper, we propose two approaches known as oblique and rotation double random forests. In the first approach, we propose rotation based double random forest. In rotation based double random forests, transformation or rotation of the feature space is generated at each node. At each node different random feature subspace is chosen for evaluation, hence the transformation at each node is different. Different transformations result in better diversity among the base learners and hence, better generalization performance. With the double random forest as base learner, the data at each node is transformed via two different transformations namely, principal component analysis and linear discriminant analysis. In the second approach, we propose oblique double random forest. Decision trees in random forest and double random forest are univariate, and this results in the generation of axis parallel split which fails to capture the geometric structure of the data. Also, the standard random forest may not grow sufficiently large decision trees resulting in suboptimal performance. To capture the geometric properties and to grow the decision trees of sufficient depth, we propose oblique double random forest. The oblique double random forest models are multivariate decision trees. At each non-leaf node, multisurface proximal support vector machine generates the optimal plane for better generalization performance. Also, different regularization techniques (Tikhonov regularization, axis-parallel split regularization, Null space regularization) are employed for tackling the small sample size problems in the decision trees of oblique double random forest. The proposed ensembles of decision trees produce trees with bigger size compared to the standard ensembles of decision trees as bagging is used at each non-leaf node which results in improved performance. The evaluation of the baseline models and the proposed oblique and rotation double random forest models is performed on benchmark 121 UCI datasets and real-world fisheries datasets. Both statistical analysis and the experimental results demonstrate the efficacy of the proposed oblique and rotation double random forest models compared to the baseline models on the benchmark datasets. (C) 2022 Elsevier Ltd. All rights reserved.
引用
收藏
页码:496 / 517
页数:22
相关论文
共 50 条
  • [1] Heterogeneous oblique random forest
    Katuwal, Rakesh
    Suganthan, P. N.
    Zhang, Le
    PATTERN RECOGNITION, 2020, 99 (99)
  • [2] A comparison of random forest based algorithms: random credal random forest versus oblique random forest
    Carlos J. Mantas
    Javier G. Castellano
    Serafín Moral-García
    Joaquín Abellán
    Soft Computing, 2019, 23 : 10739 - 10754
  • [3] A comparison of random forest based algorithms: random credal random forest versus oblique random forest
    Mantas, Carlos J.
    Castellano, Javier G.
    Moral-Garcia, Serafin
    Abellan, Joaquin
    SOFT COMPUTING, 2019, 23 (21) : 10739 - 10754
  • [4] Double random forest
    Sunwoo Han
    Hyunjoong Kim
    Yung-Seop Lee
    Machine Learning, 2020, 109 : 1569 - 1586
  • [5] Double random forest
    Han, Sunwoo
    Kim, Hyunjoong
    Lee, Yung-Seop
    MACHINE LEARNING, 2020, 109 (08) : 1569 - 1586
  • [6] The Impact of Simulated Spectral Noise on Random Forest and Oblique Random Forest Classification Performance
    Agjee, Na'eem Hoosen
    Mutanga, Onisimo
    Peerbhay, Kabir
    Ismail, Riyad
    JOURNAL OF SPECTROSCOPY, 2018, 2018
  • [7] Rotation forest of random subspace models
    Alexandropoulos, Stamatios-Aggelos N.
    Aridas, Christos K.
    Kotsiantis, Sotiris B.
    Gravvanis, George A.
    Vrahatis, Michael N.
    INTELLIGENT DECISION TECHNOLOGIES-NETHERLANDS, 2022, 16 (02): : 315 - 324
  • [8] Improving Random Forest and Rotation Forest for highly imbalanced datasets
    Su, Chong
    Ju, Shenggen
    Liu, Yiguang
    Yu, Zhonghua
    INTELLIGENT DATA ANALYSIS, 2015, 19 (06) : 1409 - 1432
  • [9] Double Oblique Osteotomy and Rotation of the trapeziometacarpal Joint (DOOR procedure)
    Roux, J. -L.
    HAND SURGERY & REHABILITATION, 2021, 40 : S53 - S61
  • [10] Intelligent Malware Detection using Oblique Random Forest Paradigm
    Roseline, S. Abijah
    Geetha, S.
    2018 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATIONS AND INFORMATICS (ICACCI), 2018, : 330 - 336