A Data Quality Framework for Graph-Based Virtual Data Integration Systems

被引:0
|
作者
Li, Yalei [1 ]
Nadal, Sergi [1 ]
Romero, Oscar [1 ]
机构
[1] Univ Politecn Catalunya BarcelonaTech, Barcelona, Spain
来源
ADVANCES IN DATABASES AND INFORMATION SYSTEMS, ADBIS 2022 | 2022年 / 13389卷
关键词
Data Quality; Data integration; Denial constraints; APPROXIMATE; DISCOVERY;
D O I
10.1007/978-3-031-15740-0_9
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Data Quality (DQ) plays a critical role in data integration. Up to now, DQ has mostly been addressed from a single database perspective. Popular DQ frameworks rely on Integrity Constraints (IC) to enforce valid application semantics, which lead to the Denial Constraint (DC) formalism which models a broad range of ICs in real-world applications. Yet, current approaches are rather monolithic, considering a single database and do not suit data integration scenarios. In this paper, we address DQ for data integration systems. Specifically, we extend virtual data integration systems to elicit DCs from disparate data sources to be integrated, using DC-related state-of-the-art, and propagate them to the integrated schema (global DCs). Then, we propose a method to manage global DCs and identify (i) minimal DCs and (ii) potential clashes between them.
引用
收藏
页码:104 / 117
页数:14
相关论文
共 50 条
  • [21] Implementation of a framework for graph-based keyword search over relational data
    Cozza V.
    International Journal of Intelligent Information and Database Systems, 2023, 16 (01) : 62 - 88
  • [22] Robust classification of graph-based data
    Alaiz, Carlos M.
    Fanuel, Michael
    Suykens, Johan A. K.
    DATA MINING AND KNOWLEDGE DISCOVERY, 2019, 33 (01) : 230 - 251
  • [23] Robust classification of graph-based data
    Carlos M. Alaíz
    Michaël Fanuel
    Johan A. K. Suykens
    Data Mining and Knowledge Discovery, 2019, 33 : 230 - 251
  • [24] Graph-based skeleton data compression
    Das, Pratyusha
    Ortega, Antonio
    2020 IEEE 22ND INTERNATIONAL WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING (MMSP), 2020,
  • [25] Graph-based data clustering with overlaps
    Fellows, Michael R.
    Guo, Jiong
    Komusiewicz, Christian
    Niedermeier, Rolf
    Uhlmann, Johannes
    DISCRETE OPTIMIZATION, 2011, 8 (01) : 2 - 17
  • [26] Graph-Based RDF Data Management
    Zou L.
    Özsu M.T.
    Data Science and Engineering, 2017, 2 (1) : 56 - 70
  • [27] Graph-based Transform for Data Decorrelation
    Hou, Junhui
    Liu, Hui
    Chau, Lap-Pui
    2016 IEEE INTERNATIONAL CONFERENCE ON DIGITAL SIGNAL PROCESSING (DSP), 2016, : 177 - 180
  • [28] Graph-Based Data Clustering with Overlaps
    Fellows, Michael R.
    Guo, Jiong
    Komusiewicz, Christian
    Niedermeier, Rolf
    Uhlmann, Johannes
    COMPUTING AND COMBINATORICS, PROCEEDINGS, 2009, 5609 : 516 - +
  • [29] Knowledge graph-based data integration system for digital twins of built assets
    Ramonell, Carlos
    Chacon, Rolando
    Posada, Hector
    AUTOMATION IN CONSTRUCTION, 2023, 156
  • [30] Graph-based induction for general graph structured data
    Matsuda, T
    Horiuchi, T
    Motoda, H
    Washio, T
    Kumazawa, K
    Arai, N
    DISCOVERY SCIENCE, PROCEEDINGS, 1999, 1721 : 340 - 342