A procedure for anomaly detection and analysis

被引:15
|
作者
Koren, Oded [1 ]
Koren, Michal [1 ]
Peretz, Or [1 ]
机构
[1] Shenkar Engn Design Art, Sch Ind Engn & Management, Anne Frank 12, Ramat Gan, Israel
关键词
Anomaly detection; AutoML; Isolation forest; Local outlier factor; SVM; INTRUSION DETECTION; OUTLIER DETECTION;
D O I
10.1016/j.engappai.2022.105503
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Anomaly detection is often used to identify and remove outliers in datasets. However, detecting and analyzing the pattern of outliers can contribute to future business decisions or increase the accuracy of a learning algorithm. Selecting the applicable outlier detection method for a dataset requires human intervention and analysis due to the challenge of choosing an efficient technique suitable for all types of attributes. This work presents a procedure for anomaly detection and analysis. The procedure is feature-wise (i.e., processes each feature independently), uses T different anomaly detection techniques (for T > 1), and estimates the best technique using predefined thresholds. It is a generic method that does not depend on the model type and can be applied to supervised and unsupervised learning. In addition, this method does not impute or remove the outliers, as they should be adapted according to the dataset or business requirements. The significant advantage of this method is the ability to use different techniques to detect anomalies since it is applied per feature and not per record, as in traditional anomaly detection methods. Furthermore, the method uses a new measure, Noise Ratio (NR), which describes the level of agreement between our method's result and traditional anomaly detection techniques. The results showed that all the compared techniques identified non-anomalous features with consistent results between the various algorithms. In the proposed method, NR found up to 20% of the non-anomalous values marked as outliers and improved up to 10% in finding outliers in datasets compared to traditional anomaly detection algorithms.
引用
收藏
页数:8
相关论文
empty
未找到相关数据