Performance enrichment through parameter tuning of random forest classification for imbalanced data applications

被引:4
作者
More, Anjali S. [1 ]
Rana, Dipti P. [1 ]
机构
[1] SV Natl Inst Technol, Comp Engn Dept, Surat, India
关键词
Random forest classification; Feature selection; Cross-validation; Regression; Machine learning; Imbalance datasets;
D O I
10.1016/j.matpr.2021.12.020
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
One of the foremost application domains in today's real-life scenario is unequal data distribution within datasets, related classifiers, and its Performance Enrichment Techniques (PET). Random Forest Classification (RFC) is one of the most efficient techniques that can function speedily over binary or multiclass imbalanced characteristics datasets. With its built-in ensemble capacity, building a generalized model on any Binary Imbalanced Datasets (BID) and Multiclass Imbalanced Datasets (MID) gets much easier. Related work carried out here implies that the attention of researchers is inclined towards ID applications and related machine learning techniques. RFC gives improvised performance due to Ensemble Approach (EA). EA generates several classifiers and segregates the results as PET. The performance of the single classifier is lower than the performance of the set of multiple classifiers. The associated study in this research paper provides insight into three PET, which are the count of folds in cross validation, variable count of trees and the feature selection method through Regression Analysis (RA) and it's tuning with RFC Classifier. Time and Cost required for classification is saved with the help of RA. Extensive experimental analysis carried out with a predictive modeler Salford System with proposed PET and tested on 32 benchmark datasets, which justifies the enhanced performance of RFC.Copyright (c) 2022 Elsevier Ltd. All rights reserved.Selection and peer-review under responsibility of the scientific committee of the First International Conference on Design and Materials (ICDM)-2021
引用
收藏
页码:3585 / 3593
页数:9
相关论文
共 48 条
[1]  
[Anonymous], 2010, J COMPUT AIDED CIVIL, V25, P387
[2]  
[Anonymous], 2010, ELGAWIRANDOM FOREST, P152
[3]  
[Anonymous], 2017, INTELL AUTOM SOFT CO, DOI DOI 10.1080/10798587.2017.1321228.544
[4]  
AristidisLikas D., 2014, KONSTANTINOS BLEKASA, P399
[5]  
Avetisyan Hayk, 2013, FINANCIAL STABILITY, P57
[6]  
Babu Anantha, 2017, INT J SCI RES SCI TE, V3, P201
[7]  
Bhagat RC, 2015, IEEE INT ADV COMPUT, P403, DOI 10.1109/IADCC.2015.7154739
[8]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[9]  
Cao Wangjing, 2017, ACM, P3456
[10]  
Czachrski Tadeusz, 2007, INT J-TORONTO, P1