Metaheuristics for data mining: survey and opportunities for big data

被引:13
作者
Dhaenens, Clarisse [1 ]
Jourdan, Laetitia [1 ]
机构
[1] Univ Lille, Cent Lille, CNRS, UMR 9189 CRIStAL, F-59000 Lille, France
关键词
Metaheuristics; Clustering; Association rules; Classification; Feature selection; Big data; PARTICLE SWARM OPTIMIZATION; FUZZY ASSOCIATION RULES; GENETIC ALGORITHM; EVOLUTIONARY ALGORITHMS; OPERATIONS-RESEARCH; FEATURE-SELECTION; DECISION TREES; EFFICIENT ALGORITHM; BAT ALGORITHM; SEARCH;
D O I
10.1007/s10479-021-04496-0
中图分类号
C93 [管理学]; O22 [运筹学];
学科分类号
070105 ; 12 ; 1201 ; 1202 ; 120202 ;
摘要
In the context of big data, many scientific communities aim to provide efficient approaches to accommodate large-scale datasets. This is the case of the machine-learning community, and more generally, the artificial intelligence community. The aim of this article is to explain how data mining problems can be considered as combinatorial optimization problems, and how metaheuristics can be used to address them. Four primary data mining tasks are presented: clustering, association rules, classification, and feature selection. This article follows the publication of a book in 2016 concerning this subject (Dhaenens and Jourdan in Metaheuristics for big data, Wiley, Hoboken, 2016), and an article published in 4OR (Dhaenens and Jourdan in 4OR 17 (2):115-139, 2019); additionally, updated references and an analysis of the current trends are presented.
引用
收藏
页码:117 / 140
页数:24
相关论文
共 168 条
  • [91] Khan K., 2012, I.J. Intelligent Systems and Applications, V4, P23, DOI [10.5815/ijisa.2012.07.03, DOI 10.5815/IJISA.2012.07.03, 10.5815/ijisa10.5815/ijisa:2012.0710.5815/ijisa:2012.07.03]
  • [92] Kim Y, 2003, DATA MINING: OPPORTUNITIES AND CHALLENGES, P80
  • [93] KIRA K, 1992, MACHINE LEARNING /, P249
  • [94] Wrappers for feature subset selection
    Kohavi, R
    John, GH
    [J]. ARTIFICIAL INTELLIGENCE, 1997, 97 (1-2) : 273 - 324
  • [95] Genetic K-means algorithm
    Krishna, K
    Murty, MN
    [J]. IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 1999, 29 (03): : 433 - 439
  • [96] Kurada R. R., 2013, ARXIV PREPRINT ARXIV
  • [97] Laney D., 2001, Meta Group Research Note, V6, P1
  • [98] Larose D. T., 2014, k-nearest neighbor algorithm
  • [99] A hybrid particle swarm optimization and its application in neural networks
    Leung, S. Y. S.
    Tang, Yang
    Wong, W. K.
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2012, 39 (01) : 395 - 405
  • [100] Liu H, 2008, CH CRC DATA MIN KNOW, P3