Automatic Quality Improvement of Data on the Evolution of 2D Regions

被引:3
作者
Costa, Rogerio Luis de C. [1 ]
Moreira, Jose [2 ]
机构
[1] Polytech Leiria, CIIC, P-2411901 Leiria, Portugal
[2] Univ Aveiro, DETI IEETA, P-3810193 Aveiro, Portugal
来源
ADVANCED DATA MINING AND APPLICATIONS, ADMA 2021, PT II | 2022年 / 13088卷
关键词
Spatio-temporal data; Quadtree; Time series; Data quality; Prophet;
D O I
10.1007/978-3-030-95408-6_22
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This work deals with data cleaning and quality improvement when representing the evolution of 2D regions extracted from real-world observations. It presents a method that combines quadtrees and time series to identify inconsistencies and poor-quality data in a sequence of 2D regions. Our algorithm splits a 2D space recursively into buckets and creates time series using spatial functions on bucket-delimited subregions. Smaller buckets represent the subregions with a higher number of inconsistencies over time. Then, it uses time series outlier detection methods and consistency metrics to identify polygons that are poor-quality representations and remove them from the original sequence. The proposed method identifies errors and inaccuracies even if they occur in several consecutive observations. We evaluated our strategy using a dataset extracted from real-world videos and compared it with another method from the literature. The results prove the effectiveness of this proposal.
引用
收藏
页码:288 / 300
页数:13
相关论文
共 16 条
[1]  
Adam N., 2004, P 2004 ACM S APPL CO, P576
[2]  
Aggarwal CC, 2014, CH CRC DATA MIN KNOW, P1
[3]   Dealing with temporal and spatial correlations to classify outliers in geophysical data streams [J].
Appice, Annalisa ;
Guccione, Pietro ;
Malerba, Donato ;
Ciampi, Anna .
INFORMATION SCIENCES, 2014, 285 :162-180
[4]  
Birant D, 2006, ITI 2006: PROCEEDINGS OF THE 28TH INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY INTERFACES, P179, DOI 10.1109/ITI.2006.1708474
[5]  
Cheng Tao, 2006, Trans. GIS, V10, P253, DOI DOI 10.1111/J.1467-9671.2006.00256.X
[6]   Experience: Quality Assessment and Improvement on a Forest Fire Dataset [J].
Costa, Rogerio Luis C. ;
Miranda, Enrico ;
Dias, Paulo ;
Moreira, Jose .
ACM JOURNAL OF DATA AND INFORMATION QUALITY, 2021, 13 (01)
[7]   EXPERIENCE: Glitches in Databases, How to Ensure Data Quality by Outlier Detection Techniques [J].
D'Urso, Ciro .
ACM JOURNAL OF DATA AND INFORMATION QUALITY, 2016, 7 (03)
[8]   Spatio-temporal outlier detection algorithms based on computing behavioral outlierness factor [J].
Duggimpudi, Maria Bala ;
Abbady, Shaaban ;
Chen, Jian ;
Raghavan, Vijay V. .
DATA & KNOWLEDGE ENGINEERING, 2019, 122 :1-24
[9]  
Facebook, 2017, PROPHET FORECASTING
[10]   Outlier Detection for Temporal Data: A Survey [J].
Gupta, Manish ;
Gao, Jing ;
Aggarwal, Charu C. ;
Han, Jiawei .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2014, 26 (09) :2250-2267