Conditional functional dependencies for data cleaning

被引:0
|
作者
Bohannon, Philip
Fan, Wenfei
Geerts, Floris
Jia, Xibei
Kementsietsidis, Anastasios
机构
关键词
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
We propose a class of constraints, referred to as conditional functional dependencies (CFDs), and study their applications in data cleaning. In contrast to traditional functional dependencies (FDs) that were developed mainly for schema design, CFDs aim at capturing the consistency of data by incorporating bindings of semantically related values. For CFDs we provide an inference system analogous to Armstrong's axioms for FDs, as well as consistency analysis. Since CFDs allow data bindings, a large number of individual constraints may hold on a table, complicating detection of constraint violations. We develop techniques for detecting CFD violations in SQL as well as novel techniques for checking multiple constraints in a single query. We experimentally evaluate the performance of our CFD-based methods for inconsistency detection. This not only yields a constraint theory for CFDs but is also a step toward a practical constraint-based method for improving data quality.
引用
收藏
页码:721 / 730
页数:10
相关论文
共 50 条
  • [1] Data repair of density-based data cleaning approach using conditional functional dependencies
    Al-Janabi, Samir
    Janicki, Ryszard
    DATA TECHNOLOGIES AND APPLICATIONS, 2022, 56 (03) : 429 - 446
  • [2] Pattern Functional Dependencies for Data Cleaning
    Qahtan, Abdulhakim
    Tang, Nan
    Ouzzani, Mourad
    Cao, Yang
    Stonebraker, Michael
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2020, 13 (05): : 684 - 697
  • [3] Contextual Data Cleaning with Ontology Functional Dependencies
    Zheng, Zheng
    Zheng, Longtao
    Alipourlangouri, Morteza
    Chiang, Fei
    Golab, Lukasz
    Szlichta, Jaroslaw
    Baskaran, Sridevi
    Journal of Data and Information Quality, 2022, 14 (03)
  • [4] Contextual Data Cleaning with Ontology Functional Dependencies
    Zheng, Zheng
    Zheng, Longtao
    Alipourlangouri, Morteza
    Chiang, Fei
    Golab, Lukasz
    Szlichta, Jaroslaw
    Baskaran, Sridevi
    ACM JOURNAL OF DATA AND INFORMATION QUALITY, 2022, 14 (03):
  • [5] Conditional functional dependencies for capturing data inconsistencies
    Fan, Wenfei
    Geerts, Floris
    Jia, Xibei
    Kementsietsidis, Anastasios
    ACM TRANSACTIONS ON DATABASE SYSTEMS, 2008, 33 (02):
  • [6] A consistency cleaning method based on content-related conditional functional dependencies
    Du, Yue-Feng (dr.duyuefeng@gmail.com), 1683, Northeast University (37):
  • [7] Semandaq: A Data Quality System Based on Conditional Functional Dependencies
    Fan, Wenfei
    Geerts, Floris
    Jia, Xibei
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2008, 1 (02): : 1460 - 1463
  • [8] Sectional and Conditional Functional Dependencies
    Li, Mingda
    Wang, Hongzhi
    Li, Ye
    WIRELESS ALGORITHMS, SYSTEMS, AND APPLICATIONS, WASA 2014, 2014, 8491 : 793 - 803
  • [9] Discovering Conditional Functional Dependencies
    Fan, Wenfei
    Geerts, Floris
    Li, Jianzhong
    Xiong, Ming
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2011, 23 (05) : 683 - 698
  • [10] Discovering Conditional Functional Dependencies
    Fan, Wenfei
    Geerts, Floris
    Lakshmanan, Laks V. S.
    Xiong, Ming
    ICDE: 2009 IEEE 25TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING, VOLS 1-3, 2009, : 1231 - +