Learning under Concept Drift: A Review

被引:890
作者
Lu, Jie [1 ]
Liu, Anjin [1 ]
Dong, Fan [1 ]
Gu, Feng [1 ]
Gama, Joao [2 ]
Zhang, Guangquan [1 ]
机构
[1] Univ Technol Sydney, Fac Engn & Informat Technol, Ctr Artificial Intelligence, Decis Syst & E Serv Intelligence Lab, Sydney, NSW 2007, Australia
[2] Univ Porto, Fac Econ, Lab Artificial Intelligence & Decis Support, P-4099002 Porto, Portugal
基金
澳大利亚研究理事会;
关键词
Machine learning; Market research; Data analysis; Big Data; Mobile handsets; Data models; Cameras; Concept drift; change detection; adaptive learning; data streams; TIME ADAPTIVE CLASSIFIERS; TRACKING CONCEPT DRIFT; DATA STREAMS; DECISION TREES; ENSEMBLE; ONLINE; CLASSIFICATION; MACHINE; DIFFERENCE;
D O I
10.1109/TKDE.2018.2876857
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Concept drift describes unforeseeable changes in the underlying distribution of streaming data over time. Concept drift research involves the development of methodologies and techniques for drift detection, understanding, and adaptation. Data analysis has revealed that machine learning in a concept drift environment will result in poor learning results if the drift is not addressed. To help researchers identify which research topics are significant and how to apply related techniques in data analysis tasks, it is necessary that a high quality, instructive review of current research developments and trends in the concept drift field is conducted. In addition, due to the rapid development of concept drift in recent years, the methodologies of learning under concept drift have become noticeably systematic, unveiling a framework which has not been mentioned in literature. This paper reviews over 130 high quality publications in concept drift related research areas, analyzes up-to-date developments in methodologies and techniques, and establishes a framework of learning under concept drift including three main components: concept drift detection, concept drift understanding, and concept drift adaptation. This paper lists and discusses 10 popular synthetic datasets and 14 publicly available benchmark datasets used for evaluating the performance of learning algorithms aiming at handling concept drift. Also, concept drift related research directions are covered and discussed. By providing state-of-the-art knowledge, this survey will directly support researchers in their understanding of research developments in the field of learning under concept drift.
引用
收藏
页码:2346 / 2363
页数:18
相关论文
共 144 条
  • [1] Alippi C., 2012, PROC INT JOINT C NEU, P1
  • [2] Just-in-time adaptive classifiers - Part I: Detecting nonstationary changes
    Alippi, Cesare
    Roveri, Manuel
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS, 2008, 19 (07): : 1145 - 1153
  • [3] Hierarchical Change-Detection Tests
    Alippi, Cesare
    Boracchi, Giacomo
    Roveri, Manuel
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2017, 28 (02) : 246 - 258
  • [4] Just-In-Time Classifiers for Recurrent Concepts
    Alippi, Cesare
    Boracchi, Giacomo
    Roveri, Manuel
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2013, 24 (04) : 620 - 634
  • [5] A just-in-time adaptive classification system based on the intersection of confidence intervals rule
    Alippi, Cesare
    Boracchi, Giacomo
    Roveri, Manuel
    [J]. NEURAL NETWORKS, 2011, 24 (08) : 791 - 800
  • [6] Just-in-Time Adaptive Classifiers-Part II: Designing the Classifier
    Alippi, Cesare
    Roveri, Manuel
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS, 2008, 19 (12): : 2053 - 2064
  • [7] Parallel Concept Drift Detection with Online Map-Reduce
    Andrzejak, Artur
    Gomes, Joao Bartolo
    [J]. 12TH IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS (ICDMW 2012), 2012, : 402 - 407
  • [8] [Anonymous], 2007, P SDM
  • [9] [Anonymous], MACH LEARN
  • [10] [Anonymous], P 32 INT C DAT ENG