Using genetic algorithms to optimize nearest neighbors for data mining

被引:15
作者
Ahn, Hyunchul [2 ]
Kim, Kyoung-jae [1 ]
机构
[1] Dongguk Univ, Dept Management Informat Syst, Seoul 100715, South Korea
[2] Sungshin Womens Univ, Dept Business Adm, Coll Social Sci, Seoul 136742, South Korea
关键词
case-based reasoning; genetic algorithms; number of neighbors to combine; stock market prediction; purchase prediction;
D O I
10.1007/s10479-008-0325-2
中图分类号
C93 [管理学]; O22 [运筹学];
学科分类号
070105 ; 12 ; 1201 ; 1202 ; 120202 ;
摘要
Case-based reasoning (CBR) is widely used in data mining for managerial applications because it often shows significant promise for improving the effectiveness of complex and unstructured decision making. There are, however, some limitations in designing appropriate case indexing and retrieval mechanisms including feature selection and feature weighting. Some of the prior studies pointed out that finding the optimal k parameter for the k-nearest neighbor (k-NN) is also one of the most important factors for designing an effective CBR system. Nonetheless, there have been few attempts to optimize the number of neighbors, especially using artificial intelligence (AI) techniques. This study proposes a genetic algorithm (GA) approach to optimize the number of neighbors to combine. In this study, we apply this novel model to two real-world cases involving stock market and online purchase prediction problems. Experimental results show that a GA-optimized k-NN approach may outperform traditional k-NN. In addition, these results also show that our proposed method is as good as or sometime better than other AI techniques in performance-comparison.
引用
收藏
页码:5 / 18
页数:14
相关论文
共 23 条
[1]   A case-based reasoning system with the two-dimensional reduction technique for customer classification [J].
Ahn, Hyunchul ;
Kim, Kyoung-jae ;
Han, Ingoo .
EXPERT SYSTEMS WITH APPLICATIONS, 2007, 32 (04) :1011-1019
[2]   Global optimization of feature weights and the number of neighbors that combine in a case-based reasoning system [J].
Ahn, Hyunchul ;
Kim, Kyoung-jae ;
Han, Ingoo .
EXPERT SYSTEMS, 2006, 23 (05) :290-301
[3]   Hybrid genetic algorithms and case-based reasoning systems for customer classification [J].
Ahn, Hyunchul ;
Kim, Kyoung-Jae ;
Han, Ingoo .
EXPERT SYSTEMS, 2006, 23 (03) :127-144
[4]   CASE-BASED REASONING - BUSINESS APPLICATIONS [J].
ALLEN, BP .
COMMUNICATIONS OF THE ACM, 1994, 37 (03) :40-42
[5]  
[Anonymous], 1991, STAT METHODS BUSINES
[6]  
[Anonymous], 1999, KOREAN J MANAG RES
[7]   A case-based expert support system for due-date assignment in a wafer fabrication factory [J].
Chiu, CC ;
Chang, PC ;
Chiu, NH .
JOURNAL OF INTELLIGENT MANUFACTURING, 2003, 14 (3-4) :287-296
[8]   A case-based customer classification approach for direct marketing [J].
Chiu, CC .
EXPERT SYSTEMS WITH APPLICATIONS, 2002, 22 (02) :163-168
[9]   GA based CBR approach in Q&A system [J].
Fu, YG ;
Shen, RM .
EXPERT SYSTEMS WITH APPLICATIONS, 2004, 26 (02) :167-170
[10]   Automatic diagnosis with genetic algorithms and case-based reasoning [J].
Guiu, JMGI ;
Ribé, EGI ;
Mansilla, EBI ;
Fàbrega, XLI .
ARTIFICIAL INTELLIGENCE IN ENGINEERING, 1999, 13 (04) :367-372