Online AutoML: an adaptive AutoML framework for online learning

被引:6
作者
Celik, Bilge [1 ]
Singh, Prabhant [1 ]
Vanschoren, Joaquin [1 ]
机构
[1] Eindhoven Univ Technol, Dept Comp Sci & Math, Groene Loper 5, NL-5600MB Eindhoven, Netherlands
关键词
Online automl; Automated online learning; Concept drift; Automated drift adaptation;
D O I
10.1007/s10994-022-06262-0
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Automated Machine Learning (AutoML) has been used successfully in settings where the learning task is assumed to be static. In many real-world scenarios, however, the data distribution will evolve over time, and it is yet to be shown whether AutoML techniques can effectively design online pipelines in dynamic environments. This study aims to automate pipeline design for online learning while continuously adapting to data drift. For this purpose, we design an adaptive Online Automated Machine Learning (OAML) system, searching the complete pipeline configuration space of online learners, including preprocessing algorithms and ensembling techniques. This system combines the inherent adaptation capabilities of online learners with fast automated pipeline (re)optimization. Focusing on optimization techniques that can adapt to evolving objectives, we evaluate asynchronous genetic programming and asynchronous successive halving to optimize these pipelines continually. We experiment on real and artificial data streams with varying types of concept drift to test the performance and adaptation capabilities of the proposed system. The results confirm the utility of OAML over popular online learning algorithms and underscore the benefits of continuous pipeline redesign in the presence of data drift.
引用
收藏
页码:1897 / 1921
页数:25
相关论文
共 30 条
  • [1] Baena-Garcia M., 2006, P 4 INT WORKSH KNOWL, P77
  • [2] Automated adaptation strategies for stream learning
    Bakirov, Rashid
    Fay, Damien
    Gabrys, Bogdan
    [J]. MACHINE LEARNING, 2021, 110 (06) : 1429 - 1462
  • [3] Bifet A, 2011, LECT NOTES ARTIF INT, V6913, P617, DOI 10.1007/978-3-642-23808-6_41
  • [4] Bifet A, 2010, LECT NOTES ARTIF INT, V6321, P135, DOI 10.1007/978-3-642-15880-3_15
  • [5] Bifet A, 2007, PROCEEDINGS OF THE SEVENTH SIAM INTERNATIONAL CONFERENCE ON DATA MINING, P443
  • [6] Towards Automated Configuration of Stream Clustering Algorithms
    Carnein, Matthias
    Trautmann, Heike
    Bifet, Albert
    Pfahringer, Bernhard
    [J]. MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2019, PT I, 2020, 1167 : 137 - 143
  • [7] Adaptation Strategies for Automated Machine Learning on Evolving Data
    Celik, Bilge
    Vanschoren, Joaquin
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2021, 43 (09) : 3067 - 3078
  • [8] Domingos P., 2000, Proceedings. KDD-2000. Sixth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, P71, DOI 10.1145/347090.347107
  • [9] Vehicle classification in distributed sensor networks
    Duarte, MF
    Hu, YH
    [J]. JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2004, 64 (07) : 826 - 838
  • [10] Feurer M, 2015, ADV NEUR IN, V28