Cleaning timestamps with temporal constraints

被引:0
作者
Shaoxu Song
Ruihong Huang
Yue Cao
Jianmin Wang
机构
[1] Tsinghua University,
来源
The VLDB Journal | 2021年 / 30卷
关键词
Data cleaning; Timestamp repairing; Temporal constraints;
D O I
暂无
中图分类号
学科分类号
摘要
Timestamps are often found to be dirty in various scenarios, e.g., in distributed systems with clock synchronization problems or unreliable RFID readers. Without cleaning the imprecise timestamps, temporal-related applications such as provenance analysis or pattern queries are not reliable. To evaluate the correctness of timestamps, temporal constraints could be employed, which declare the distance restrictions between timestamps. Guided by such constraints on timestamps, in this paper, we study a novel problem of repairing inconsistent timestamps that do not conform to the required temporal constraints. Following the same line of data repairing, the timestamp repairing problem is to minimally modify the timestamps towards satisfaction of temporal constraints. This problem is practically challenging, given the huge space of possible timestamps. We tackle the problem by identifying a concise set of promising candidates, where an optimal repair solution can always be found. Repair algorithms with efficient pruning are then devised over the identified candidates. Approximate solutions are also presented including simple heuristic and linear programming (LP) relaxation. Experiments on real datasets demonstrate the superiority of our proposal compared to the state-of-the-art approaches.
引用
收藏
页码:425 / 446
页数:21
相关论文
共 21 条
  • [1] Bentley JL(1975)Multidimensional binary search trees used for associative searching Commun. ACM 18 509-517
  • [2] Chu X(2013)Discovering denial constraints PVLDB 6 1498-1509
  • [3] Ilyas IF(1991)Temporal constraint networks Artif. Intell. 49 61-95
  • [4] Papotti P(2018)Bus-OLAP: a data management model for non-on-time events query over bus journey data Data Sci. Eng. 3 52-67
  • [5] Dechter R(1998)Supporting valid-time indeterminacy ACM Trans. Database Syst. 23 1-57
  • [6] Meiri I(2016)Cleaning timestamps with temporal constraints PVLDB 9 708-719
  • [7] Pearl J(2010)Recognizing patterns in streams with imprecise timestamps PVLDB 3 244-255
  • [8] Duan L(undefined)undefined undefined undefined undefined-undefined
  • [9] Pang T(undefined)undefined undefined undefined undefined-undefined
  • [10] Nummenmaa J(undefined)undefined undefined undefined undefined-undefined