Feature selection schema based on game theory and biology migration algorithm for regression problems

被引:5
作者
Javidi, Mohammad Masoud [1 ]
机构
[1] Shahid Bahonar Univ Kerman, Dept Comp Sci, Kerman, Iran
关键词
Feature selection; Nash equilibrium; Multi-objective optimization; Biology migration algorithm; Game theory; PARTICLE SWARM OPTIMIZATION; ARTIFICIAL BEE COLONY; FEATURE-EXTRACTION; CLASSIFICATION; PREDICTION; MANAGEMENT; SYSTEM;
D O I
10.1007/s13042-020-01174-8
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Many real-world datasets nowadays are of regression type, while only a few dimensionality reduction methods have been developed for regression problems. On the other hand, most existing regression methods are based on the computation of the covariance matrix, rendering them inefficient in the reduction process. Therefore, a BMA-based multi-objective feature selection method, GBMA, is introduced by incorporating the Nash equilibrium approach. GBMA is intended to maximize model accuracy and minimize the number of features through a less complex procedure. The proposed method is composed of four steps. The first step involves defining three players, each of which is trying to improve its objective function (i.e., model error, number of features, and precision adjustment). The second step includes clustering features based on the correlation therebetween and detecting the most appropriate ordering of features to enhance cluster efficiency. The third step comprises extracting a new feature from each cluster based on various weighting methods (i.e., moderate, strict, and hybrid). Finally, the fourth step encompasses updating players based on stochastic search operators. The proposed GBMA strategy explores the search space and finds optimal solutions in an acceptable amount of time without examining every possible solution. The experimental results and statistical tests based on ten well-known datasets from the UCI repository proved the high performance of GBMA in selecting features for solving regression problems.
引用
收藏
页码:303 / 342
页数:40
相关论文
共 59 条
[1]  
[Anonymous], 1973, Pattern Classification and Scene Analysis
[2]  
Arauzo-Azofra A., 2004, Proceedings of the fifth international conference on Recent Advances in Soft Computing, P104
[3]   The mRMR variable selection method: a comparative study for functional data [J].
Berrendero, J. R. ;
Cuevas, A. ;
Torrecilla, J. L. .
JOURNAL OF STATISTICAL COMPUTATION AND SIMULATION, 2016, 86 (05) :891-907
[4]  
Bhagat S, 2011, SOCIAL NETWORK DATA ANALYTICS, P115
[5]  
Cheng F.Y., 1999, Comput. Mech. Struct. Eng., P1, DOI DOI 10.1016/B978-008043008-9/50039-9
[6]   Hierarchical co-evolutionary clustering tree-based rough feature game equilibrium selection and its application in neonatal cerebral cortex MRI [J].
Ding, Weiping ;
Lin, Chin-Teng ;
Prasad, Mukesh .
EXPERT SYSTEMS WITH APPLICATIONS, 2018, 101 :243-257
[7]  
Eberhart R., 1995, MHS 95 P 6 INT S MIC, P39, DOI DOI 10.1109/MHS.1995.494215
[8]  
Fukunaga K, 1990, INTRO STAT PATTERN R, P2
[9]   An improved feature selection algorithm based on graph clustering and ant colony optimization [J].
Ghimatgar, Hojat ;
Kazemi, Kamran ;
Helfroush, Mohamamd Sadegh ;
Aarabi, Ardalan .
KNOWLEDGE-BASED SYSTEMS, 2018, 159 :270-285
[10]   A survey on pre-processing techniques: Relevant issues in the context of environmental data mining [J].
Gibert, Karina ;
Sanchez-Marre, Miquel ;
Izquierdo, Joaquin .
AI COMMUNICATIONS, 2016, 29 (06) :627-663