A novel Hybrid Genetic Local Search Algorithm for feature selection and weighting with an application in strategic decision making in innovation management

被引:35
作者
Eroglu, Duygu Yilmaz [1 ]
Kilic, Kemal [2 ]
机构
[1] Uludag Univ, Dept Ind Engn, Gorukle Campus, TR-16059 Bursa, Turkey
[2] Sabanci Univ, Fac Engn & Nat Sci, Istanbul, Turkey
关键词
Feature subset selection; Feature weighting; Hybrid genetic local search algorithm; Strategic decision support; Innovation management; Data mining; SYSTEM;
D O I
10.1016/j.ins.2017.04.009
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In some applications, one needs not only to determine the relevant features but also provide a preferential ordering among the set of relevant features by weights. This paper presents a novel Hybrid Genetic Local Search Algorithm (HGA) in combination with the k-nearest neighbor classifier for simultaneous feature subset selection and feature weighting, particularly for medium-sized data sets. The performance of the proposed algorithm is compared with the performance of alternative feature subset selection algorithms and classifiers through experimental analyses in the various benchmark data sets publicly available on the UCI database. The developed HGA is then applied to a data set gathered from 184 manufacturing firms in the context of innovation management. The data set consists of scores of manufacturing firms in terms of various factors that are known to influence the innovation performance of manufacturing firms and referred to as innovation determinants, and their innovation performances. HGA is used to determine the relative significance of the innovation determinants. Our results demonstrated that the developed HGA is capable of eliminating the irrelevant features and successfully assess feature weights. Moreover, our work is an example how data mining can play a role in the context of strategic management decision making. (C) 2017 Elsevier Inc. All rights reserved.
引用
收藏
页码:18 / 32
页数:15
相关论文
共 45 条
[1]   INSTANCE-BASED LEARNING ALGORITHMS [J].
AHA, DW ;
KIBLER, D ;
ALBERT, MK .
MACHINE LEARNING, 1991, 6 (01) :37-66
[2]   NEW LOOK AT STATISTICAL-MODEL IDENTIFICATION [J].
AKAIKE, H .
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 1974, AC19 (06) :716-723
[3]   A hybrid genetic algorithm for mixed model assembly line balancing problem with parallel workstations and zoning constraints [J].
Akpinar, Sener ;
Bayhan, G. Mirac .
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2011, 24 (03) :449-457
[4]  
[Anonymous], OPERATIONAL RES
[5]  
Bache K., 2013, UCI Machine Learning Repository
[6]   Speeding up incremental wrapper feature subset selection with Naive Bayes classifier [J].
Bermejo, Pablo ;
Gamez, Jose A. ;
Puerta, Jose M. .
KNOWLEDGE-BASED SYSTEMS, 2014, 55 :140-147
[7]   A GRASP algorithm for fast hybrid (filter-wrapper) feature subset selection in high-dimensional datasets [J].
Bermejo, Pablo ;
Gamez, Jose A. ;
Puerta, Jose M. .
PATTERN RECOGNITION LETTERS, 2011, 32 (05) :701-711
[8]   A review of microarray datasets and applied feature selection methods [J].
Bolon-Canedo, V. ;
Sanchez-Marono, N. ;
Alonso-Betanzos, A. ;
Benitez, J. M. ;
Herrera, F. .
INFORMATION SCIENCES, 2014, 282 :111-135
[9]   Different metaheuristic strategies to solve the feature selection problem [J].
Casado Yusta, Silvia .
PATTERN RECOGNITION LETTERS, 2009, 30 (05) :525-534
[10]   Integrating dominance properties with genetic algorithms for parallel machine scheduling problems with setup times [J].
Chang, Pei-Chann ;
Chen, Shih-Hsin .
APPLIED SOFT COMPUTING, 2011, 11 (01) :1263-1274