AutoML: state of the art with a focus on anomaly detection, challenges, and research directions

被引:66
作者
Bahri, Maroua [1 ]
Salutari, Flavia [2 ]
Putina, Andrian [2 ]
Sozio, Mauro [2 ]
机构
[1] INRIA, MiMove, Paris, France
[2] IP Paris, Telecom Paris, LTCI, Palaiseau, France
关键词
Machine learning; AutoML; Anomaly detection; Unsupervised learning; Hyper-parameter tuning; ALGORITHM; RANKING; CONFIGURATION;
D O I
10.1007/s41060-022-00309-0
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The last decade has witnessed the explosion of machine learning research studies with the inception of several algorithms proposed and successfully adopted in different application domains. However, the performance of multiple machine learning algorithms is very sensitive to multiple ingredients (e.g., hyper-parameters tuning and data cleaning) where a significant human effort is required to achieve good results. Thus, building well-performing machine learning algorithms requires domain knowledge and highly specialized data scientists. Automated machine learning (autoML) aims to make easier and more accessible the use of machine learning algorithms for researchers with varying levels of expertise. Besides, research effort to date has mainly been devoted to autoML for supervised learning, and only a few research proposals have been provided for the unsupervised learning. In this paper, we present an overview of the autoML field with a particular emphasis on the automated methods and strategies that have been proposed for unsupervised anomaly detection.
引用
收藏
页码:113 / 126
页数:14
相关论文
共 90 条
[1]  
Aggarwal, 2013, OUTLIER ENSEMBLES PO
[2]  
Aggarwal Charu C., 2015, Data Mining: The Textbook, DOI [DOI 10.1007/978-3-319-14142-8, 10.1007/978-3-319-14142-8]
[3]   Better software analytics via "DUO": Data mining algorithms using/used-by optimizers [J].
Agrawal, Amritanshu ;
Menzies, Tim ;
Minku, Leandro L. ;
Wagner, Markus ;
Yu, Zhe .
EMPIRICAL SOFTWARE ENGINEERING, 2020, 25 (03) :2099-2136
[4]   Optuna: A Next-generation Hyperparameter Optimization Framework [J].
Akiba, Takuya ;
Sano, Shotaro ;
Yanase, Toshihiko ;
Ohta, Takeru ;
Koyama, Masanori .
KDD'19: PROCEEDINGS OF THE 25TH ACM SIGKDD INTERNATIONAL CONFERENCCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2019, :2623-2631
[5]  
[Anonymous], 2016, ARXIV PREPRINT ARXIV
[6]  
[Anonymous], 2014, ACM SIGKDD EXPLOR NE, DOI DOI 10.1145/2594473.2594476
[7]  
[Anonymous], 2007, ACM Trans. Knowl. Discov. Data, DOI DOI 10.1145/1297332.1297338
[8]  
Ansótegui C, 2009, LECT NOTES COMPUT SC, V5732, P142, DOI 10.1007/978-3-642-04244-7_14
[9]  
Bahri M., 2020, INT JOINT C ART INT
[10]   AutoML for Stream k-Nearest Neighbors Classification [J].
Bahri, Maroua ;
Veloso, Bruno ;
Bifet, Albert ;
Gama, Joao .
2020 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2020, :597-602