Anomaly detection in predictive maintenance: A new evaluation framework for temporal unsupervised anomaly detection algorithms

被引:30
作者
Carrasco, Jacinto [1 ]
Lopez, David [1 ]
Aguilera-Martos, Ignacio [1 ]
Garcia-Gil, Diego [1 ]
Markova, Irina [2 ]
Garcia-Barzana, Marta [2 ]
Arias-Rodil, Manuel [2 ]
Luengo, Julian [1 ]
Herrera, Francisco [1 ]
机构
[1] Univ Granada, Inst Data Sci & Computat Intelligence, Dept Comp Sci & AI, Andalusian Res, Granada, Spain
[2] ArcelorMittal Global R&D, New Frontier, Digital Portfolio, Hamilton, ON, Canada
关键词
Anomaly; Outlier; Score system; Evaluation; Benchmark; EVENTS;
D O I
10.1016/j.neucom.2021.07.095
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The research in anomaly detection lacks a unified definition of what represents an anomalous instance. Discrepancies in the nature itself of an anomaly lead to multiple paradigms of algorithms design and experimentation. Predictive maintenance is a special case, where the anomaly represents a failure that must be prevented. Related time series research as outlier and novelty detection or time series classification does not apply to the concept of an anomaly in this field, because they are not single points which have not been seen previously and may not be precisely annotated. Moreover, due to the lack of annotated anomalous data, many benchmarks are adapted from supervised scenarios. To address these issues, we generalise the concept of positive and negative instances to intervals to be able to evaluate unsupervised anomaly detection algorithms. We also preserve the imbalance scheme for evaluation through the proposal of the Preceding Window ROC, a generalisation for the calculation of ROC curves for time series scenarios. We also adapt the mechanism from a established time series anomaly detection benchmark to the proposed generalisations to reward early detection. Therefore, the proposal represents a flexible evaluation framework for the different scenarios. To show the usefulness of this definition, we include a case study of Big Data algorithms with a real-world time series problem provided by the company ArcelorMittal, and compare the proposal with an evaluation method. (c) 2021 Published by Elsevier B.V.
引用
收藏
页码:440 / 452
页数:13
相关论文
共 30 条
[1]  
Aggarwal Charu C., 2017, Outlier Analysis
[2]   Optuna: A Next-generation Hyperparameter Optimization Framework [J].
Akiba, Takuya ;
Sano, Shotaro ;
Yanase, Toshihiko ;
Ohta, Takeru ;
Koyama, Masanori .
KDD'19: PROCEEDINGS OF THE 25TH ACM SIGKDD INTERNATIONAL CONFERENCCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2019, :2623-2631
[3]  
[Anonymous], 2009, Precision at n, DOI DOI 10.1007/978-0-387-39940-9_484
[4]  
[Anonymous], 2018, Learning from Imbalanced Data Sets, DOI DOI 10.1007/978-3-319-98074-4
[5]   Recent trends in the use of statistical tests for comparing swarm and evolutionary computing algorithms: Practical guidelines and a critical review [J].
Carrasco, J. ;
Garcia, S. ;
Rueda, M. M. ;
Das, S. ;
Herrera, F. .
SWARM AND EVOLUTIONARY COMPUTATION, 2020, 54
[6]   Analyzing rare event, anomaly, novelty and outlier detection terms under the supervised classification framework [J].
Carreno, Ander ;
Inza, Inaki ;
Lozano, Jose A. .
ARTIFICIAL INTELLIGENCE REVIEW, 2020, 53 (05) :3575-3594
[7]   One-Class Convex Hull-Based Algorithm for Classification in Distributed Environments [J].
Fernandez-Francos, Diego ;
Fontenla-Romero, Oscar ;
Alonso-Betanzos, Amparo .
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2020, 50 (02) :386-396
[8]  
Goldstein M, 2012, KI 2012 POSTER DEMO, P59, DOI DOI 10.1007/978-3-642-21329-8_4
[9]   A Comparative Evaluation of Unsupervised Anomaly Detection Algorithms for Multivariate Data [J].
Goldstein, Markus ;
Uchida, Seiichi .
PLOS ONE, 2016, 11 (04)
[10]  
Gunay B, 2019, IEEE INT CON AUTO SC, P222, DOI [10.1109/coase.2019.8843290, 10.1109/COASE.2019.8843290]