Local Correlation Integral Approach for Anomaly Detection Using Functional Data

被引:3
作者
Donoso, Jorge R. Sosa [1 ]
Flores, Miguel [2 ]
Naya, Salvador [3 ]
Tarrio-Saavedra, Javier [3 ]
机构
[1] Escuela Politec Nacl, Fac Sci, Dept Math, Quito 170517, Ecuador
[2] Escuela Politec Nacl, Fac Sci, Dept Math, MODES Grp, Quito 170517, Ecuador
[3] Univ A Coruna, Dept Math, MODES Grp, CITIC,Escola Politecn Enxenaria Ferrol, Ferrol 15403, Spain
关键词
outlier detection; anomaly detection; FDA; LOCI; Hilbert space; OUTLIER DETECTION; CLASSIFICATION; PREDICTION; DEPTH; STATISTICS; CURVES; WOOD;
D O I
10.3390/math11040815
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
The present work develops a methodology for the detection of outliers in functional data, taking into account both their shape and magnitude. Specifically, the multivariate method of anomaly detection called Local Correlation Integral (LOCI) has been extended and adapted to be applied to the particular case of functional data, using the calculation of distances in Hilbert spaces. This methodology has been validated with a simulation study and its application to real data. The simulation study has taken into account scenarios with functional data or curves with different degrees of dependence, as is usual in cases of continuously monitored data versus time. The results of the simulation study show that the functional approach of the LOCI method performs well in scenarios with inter-curve dependence, especially when the outliers are due to the magnitude of the curves. These results are supported by applying the present procedure to the meteorological database of the Alternative Energy and Environment Group in Ecuador, specifically to the humidity curves, presenting better performance than other competitive methods.
引用
收藏
页数:18
相关论文
共 67 条
[1]  
Aggarwal C. C., 2017, Outlier Analysis, DOI [10.1007/978-3-319-47578-3_1, DOI 10.1007/978-3-319-47578-3_1]
[2]   Shape outlier detection and visualization for functional data: the outliergram [J].
Arribas-Gil, Ana ;
Romo, Juan .
BIOSTATISTICS, 2014, 15 (04) :603-619
[3]  
Baillo A, 2010, The Oxford Handbook of Functional Data Analysis, DOI DOI 10.1093/OXFORDHB/9780199568444.013.10
[4]   Principal components for multivariate functional data [J].
Berrendero, J. R. ;
Justel, A. ;
Svarc, M. .
COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2011, 55 (09) :2619-2634
[5]   Autoregressive forecasting of some functional climatic variations [J].
Besse, PC ;
Cardot, H ;
Stephenson, DB .
SCANDINAVIAN JOURNAL OF STATISTICS, 2000, 27 (04) :673-687
[6]   Construction of functional data analysis modeling strategy for global solar radiation prediction: application of cross-station paradigm [J].
Beyaztas, Ufuk ;
Salih, Sinan Q. ;
Chau, Kwok-Wing ;
Al-Ansari, Nadhir ;
Yaseen, Zaher Mundher .
ENGINEERING APPLICATIONS OF COMPUTATIONAL FLUID MECHANICS, 2019, 13 (01) :1165-1181
[7]   Control charts for monitoring ship operating conditions and CO2 emissions based on scalar-on-function regression [J].
Capezza, Christian ;
Lepore, Antonio ;
Menafoglio, Alessandra ;
Palumbo, Biagio ;
Vantini, Simone .
APPLIED STOCHASTIC MODELS IN BUSINESS AND INDUSTRY, 2020, 36 (03) :477-500
[8]  
Chen Z., 2021, ARXIV
[9]   Achieving near perfect classification for functional data [J].
Delaigle, Aurore ;
Hall, Peter .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2012, 74 :267-286
[10]   Case Study of Anomaly Detection and Quality Control of Energy Efficiency and Hygrothermal Comfort in Buildings [J].
Eiras-Franco, Carlos ;
Flores, Miguel ;
Bolon-Canedo, Veronica ;
Zaragoza, Sonia ;
Fernandez-Casal, Ruben ;
Naya, Salvador ;
Tarrio-Saavedra, Javier .
PROCEEDINGS OF THE 8TH INTERNATIONAL CONFERENCE ON DATA SCIENCE, TECHNOLOGY AND APPLICATIONS (DATA), 2019, :145-151