A Data Quality Framework for Graph-Based Virtual Data Integration Systems

被引:0
|
作者
Li, Yalei [1 ]
Nadal, Sergi [1 ]
Romero, Oscar [1 ]
机构
[1] Univ Politecn Catalunya BarcelonaTech, Barcelona, Spain
来源
ADVANCES IN DATABASES AND INFORMATION SYSTEMS, ADBIS 2022 | 2022年 / 13389卷
关键词
Data Quality; Data integration; Denial constraints; APPROXIMATE; DISCOVERY;
D O I
10.1007/978-3-031-15740-0_9
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Data Quality (DQ) plays a critical role in data integration. Up to now, DQ has mostly been addressed from a single database perspective. Popular DQ frameworks rely on Integrity Constraints (IC) to enforce valid application semantics, which lead to the Denial Constraint (DC) formalism which models a broad range of ICs in real-world applications. Yet, current approaches are rather monolithic, considering a single database and do not suit data integration scenarios. In this paper, we address DQ for data integration systems. Specifically, we extend virtual data integration systems to elicit DCs from disparate data sources to be integrated, using DC-related state-of-the-art, and propagate them to the integrated schema (global DCs). Then, we propose a method to manage global DCs and identify (i) minimal DCs and (ii) potential clashes between them.
引用
收藏
页码:104 / 117
页数:14
相关论文
共 50 条
  • [41] A GRAPH-BASED FRAMEWORK FOR RAPID CONSTRUCTION OF DOCUMENT INTEGRATION TOOLS
    Koertgen, Anne-Therese
    Becker, Simon M.
    Herold, Sebastian
    JOURNAL OF INTEGRATED DESIGN & PROCESS SCIENCE, 2007, 11 (04) : 19 - 39
  • [42] A graph-based framework for rapid construction of document integration tools
    Department of Computer Science 3, RWTH Aachen University, Germany
    J. Integr. Des. Process Sci., 2007, 4 (19-39):
  • [43] Graph-based spatial segmentation of areal data
    Goepp, Vivien
    van de Kassteele, Jan
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2024, 192
  • [44] Graph-based estimators for paired comparison data
    Ghosh, Sayan
    Davidov, Ori
    JOURNAL OF STATISTICAL PLANNING AND INFERENCE, 2020, 209 : 1 - 11
  • [45] A graph-based model for semistructured temporal data
    Combi, C
    Oliboni, B
    Quintarelli, E
    ON THE MOVE TO MEANINGFUL INTERNET SYSTEMS 2003: OTM 2003 WORKSHOPS, 2003, 2889 : 22 - 23
  • [46] Graph-based Clustering for Time Series Data
    Li, Peiyu
    Boubrahimi, Soukaina Filali
    Hamdi, Shah Muhammad
    2021 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2021, : 4464 - 4467
  • [47] Graph-Based Analysis of Nuclear Smuggling Data
    Cook, Diane
    Holder, Lawrence
    Thompson, Sandy
    Whitney, Paul
    Chilton, Lawrence
    JOURNAL OF APPLIED SECURITY RESEARCH, 2009, 4 (04) : 501 - 517
  • [48] Graph-based Data for Accessible Indoor Navigation
    Simon-Nagy, Gabriella
    Chalhoub, Nidal
    Fleiner, Rita
    2019 IEEE 23RD INTERNATIONAL CONFERENCE ON INTELLIGENT ENGINEERING SYSTEMS (INES 2019), 2019, : 351 - 355
  • [49] Graph-based visualization of sensitive medical data
    Kalamaras, Ilias
    Glykos, Konstantinos
    Megalooikonomou, Vasilis
    Votis, Konstantinos
    Tzovaras, Dimitrios
    MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (01) : 209 - 236
  • [50] A GRAPH-BASED DATA MODEL AND ITS RAMIFICATIONS
    LEVENE, M
    LOIZOU, G
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 1995, 7 (05) : 809 - 823