Random feature selection using random subspace logistic regression

被引:16
作者
Wichitaksorn, Nuttanan [1 ]
Kang, Yingyue [1 ]
Zhang, Faqiang [1 ]
机构
[1] Auckland Univ Technol, Dept Math Sci, Private Bag 92006, Auckland 1142, New Zealand
关键词
Feature selection; Logistic regression; Lasso logistic; Random subspace; Bootstrap; CANCER CLASSIFICATION; GENE SELECTION; INFORMATION; ALGORITHMS; PREDICTION; MODEL; SVM; GIS;
D O I
10.1016/j.eswa.2023.119535
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Feature selection becomes a prominent method in the big data era. The logistic regression model is a wrapper method that provides better classification or prediction accuracy but it is computationally expensive. In this study, we propose the random subspace logistic regression where features are randomly selected through bootstrap cycles. The random subspace regression method is applied to both standard and lasso logistic regression models. Using the simulated and empirical data, our proposed random subspace logistic regression shows favorable results and can be a promising alternative for flat feature selection.
引用
收藏
页数:8
相关论文
共 78 条
[1]   Non-parametric classifier-independent feature selection [J].
Abe, N ;
Kudo, M .
PATTERN RECOGNITION, 2006, 39 (05) :737-746
[2]  
[Anonymous], 2011, P 27 C UNC ART INT U
[3]   Empirical study of feature selection methods based on individual feature evaluation for classification problems [J].
Arauzo-Azofra, Antonio ;
Aznarte, Jose Luis ;
Benitez, Jose M. .
EXPERT SYSTEMS WITH APPLICATIONS, 2011, 38 (07) :8170-8177
[4]   The application of GIS-based logistic regression for landslide susceptibility mapping in the Kakuda-Yahiko Mountains, Central Japan [J].
Ayalew, L ;
Yamagishi, H .
GEOMORPHOLOGY, 2005, 65 (1-2) :15-31
[5]   Effective and Efficient Feature Selection for Large-scale Data Using Bayes' Theorem [J].
Balamurugan, Subramanian Appavu Alias ;
Rajaram, Ramasamy .
INTERNATIONAL JOURNAL OF AUTOMATION AND COMPUTING, 2009, 6 (01) :62-71
[6]  
Batra S. S., 2014, ARXIV
[7]   MODELLING SMALL-BUSINESS CREDIT SCORING BY USING LOGISTIC REGRESSION, NEURAL NETWORKS AND DECISION TREES [J].
Bensic, Mirta ;
Sarlija, Natasa ;
Zekic-Susac, Marijana .
INTELLIGENT SYSTEMS IN ACCOUNTING FINANCE & MANAGEMENT, 2005, 13 (03) :133-150
[8]   Application of the logistic function to bio-assay [J].
Berkson, J .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1944, 39 (227) :357-365
[9]   Bio-molecular cancer prediction with random subspace ensembles of support vector machines [J].
Bertoni, A ;
Folgieri, R ;
Valentini, G .
NEUROCOMPUTING, 2005, 63 :535-539
[10]   Spatial prediction of landslides using a hybrid machine learning approach based on Random Subspace and Classification and Regression Trees [J].
Binh Thai Pham ;
Prakash, Indra ;
Dieu Tien Bui .
GEOMORPHOLOGY, 2018, 303 :256-270