Analyzing and repairing concept drift adaptation in data stream classification

被引:18
|
作者
Halstead, Ben [1 ]
Koh, Yun Sing [1 ]
Riddle, Patricia [1 ]
Pears, Russel [2 ]
Pechenizkiy, Mykola [3 ]
Bifet, Albert [4 ,5 ]
Olivares, Gustavo [6 ]
Coulson, Guy [6 ]
机构
[1] Univ Auckland, Sch Comp Sci, Auckland, New Zealand
[2] Auckland Univ Technol, Auckland, New Zealand
[3] Eindhoven Univ Technol, Eindhoven, Netherlands
[4] Univ Waikato, Hamilton, New Zealand
[5] IP Paris, Telecom Paris, LTCI, Paris, France
[6] Natl Inst Water & Atmospher Res, Auckland, New Zealand
关键词
Concept drift; Data stream classification; Recurring concepts; CLASSIFIERS; SELECTION;
D O I
10.1007/s10994-021-05993-w
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Data collected over time often exhibit changes in distribution, or concept drift, caused by changes in factors relevant to the classification task, e.g. weather conditions. Incorporating all relevant factors into the model may be able to capture these changes, however, this is usually not practical. Data stream based methods, which instead explicitly detect concept drift, have been shown to retain performance under unknown changing conditions. These methods adapt to concept drift by training a model to classify each distinct data distribution. However, we hypothesize that existing methods do not robustly handle real-world tasks, leading to adaptation errors where context is misidentified. Adaptation errors may cause a system to use a model which does not fit the current data, reducing performance. We propose a novel repair algorithm to identify and correct errors in concept drift adaptation. Evaluation on synthetic data shows that our proposed AiRStream system has higher performance than baseline methods, while is also better at capturing the dynamics of the stream. Evaluation on an air quality inference task shows AiRStream provides increased real-world performance compared to eight baseline methods. A case study shows that AiRStream is able to build a robust model of environmental conditions over this task, allowing the adaptions made to concept drift to be analysed and related to changes in weather. We discovered a strong predictive link between the adaptions made by AiRStream and changes in meteorological conditions.
引用
收藏
页码:3489 / 3523
页数:35
相关论文
共 50 条
  • [41] Data stream mining: methods and challenges for handling concept drift
    Wares, Scott
    Isaacs, John
    Elyan, Eyad
    SN APPLIED SCIENCES, 2019, 1 (11):
  • [42] Novel Class Detection with Concept Drift in Data Stream - AhtNODE
    Gandhi, Jay
    Gandhi, Vaibhav
    INTERNATIONAL JOURNAL OF DISTRIBUTED SYSTEMS AND TECHNOLOGIES, 2020, 11 (01) : 15 - 26
  • [43] TS-DM: A Time Segmentation-Based Data Stream Learning Method for Concept Drift Adaptation
    Wang, Kun
    Lu, Jie
    Liu, Anjin
    Zhang, Guangquan
    IEEE TRANSACTIONS ON CYBERNETICS, 2024, 54 (10) : 6000 - 6011
  • [44] Dynamically Adjusting Diversity in Ensembles for the Classification of Data Streams with Concept Drift
    Hidalgo, Juan I. G.
    Santos, Silas G. T. C.
    Barros, Roberto S. M.
    ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2022, 16 (02)
  • [45] Overview of Wind and Photovoltaic Data Stream Classification and Data Drift Issues
    Zhu, Xinchun
    Wu, Yang
    Zhao, Xu
    Yang, Yunchen
    Liu, Shuangquan
    Shi, Luyi
    Wu, Yelong
    ENERGIES, 2024, 17 (17)
  • [46] Concept Drift Detection in Data Stream Clustering and its Application on Weather Data
    Namitha, K.
    Kumar, Santhosh G.
    INTERNATIONAL JOURNAL OF AGRICULTURAL AND ENVIRONMENTAL INFORMATION SYSTEMS, 2020, 11 (01) : 67 - 85
  • [47] Enhanced Intrusion Detection with Data Stream Classification and Concept Drift Guided by the Incremental Learning Genetic Programming Combiner
    Shyaa, Methaq A.
    Zainol, Zurinahni
    Abdullah, Rosni
    Anbar, Mohammed
    Alzubaidi, Laith
    Santamaria, Jose
    SENSORS, 2023, 23 (07)
  • [48] Concept Drift-Based Intrusion Detection For Evolving Data Stream Classification In IDS: Approaches And Comparative Study
    Seth, Sugandh
    Chahal, Kuljit Kaur
    Singh, Gurvinder
    COMPUTER JOURNAL, 2024, 67 (07) : 2529 - 2547
  • [49] Rival Learner Algorithm with Drift Adaptation for Online Data Stream Regression
    Liao, Zhenwei
    Wang, Yongheng
    2018 INTERNATIONAL CONFERENCE ON ALGORITHMS, COMPUTING AND ARTIFICIAL INTELLIGENCE (ACAI 2018), 2018,
  • [50] Concept Drift Adaptation by Exploiting Drift Type
    Li, Jinpeng
    Yu, Hang
    Zhang, Zhenyu
    Luo, Xiangfeng
    Xie, Shaorong
    ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2024, 18 (04)