An extensible metadata framework for data quality assessment of composite structures

被引:0
作者
Farinha, Jose [1 ]
Trigueiros, Maria Jose [1 ]
机构
[1] ISCTE ADETTI, Dept Sci & Informat Technol, Av Forcas Armadas, P-1649026 Lisbon, Portugal
来源
DATA WAREHOUSING AND KNOWLEDGE DISCOVERY, PROCEEDINGS | 2007年 / 4654卷
关键词
data quality; metadata; metamodel; CWM;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Data quality is a critical issue both in operational databases and in data warehouse systems. Data quality assessment is a strong requirement regarding the ETL subsystem, since bad data may destroy data warehouse credibility. During the last two decades, research and development efforts in the data quality field have produced techniques for data profiling and cleaning, which focus on detecting and correcting bad values in data. Little efforts have been done considering data quality when it relates to the well-formedness of coarse grained data structures resulting from the assembly of linked data records. This paper proposes a metadata model that supports the structural validation of linked data records, from a data quality point of view. The metamodel is built on top of the CWM standard and it supports the specification of data structure quality rules in a high level of abstraction, as well as by means of very specific fine grained business rules.
引用
收藏
页码:34 / +
页数:3
相关论文
共 50 条
  • [1] An Extensible Framework for Data Reliability Assessment
    Oliveira, Oscar
    Oliveira, Bruno
    ICEIS: PROCEEDINGS OF THE 24TH INTERNATIONAL CONFERENCE ON ENTERPRISE INFORMATION SYSTEMS - VOL 1, 2022, : 77 - 84
  • [2] Metadata-based data quality assessment
    Aljumaili, Mustafa
    Karim, Ramin
    Tretten, Phillip
    VINE JOURNAL OF INFORMATION AND KNOWLEDGE MANAGEMENT SYSTEMS, 2016, 46 (02) : 232 - 250
  • [3] DESCRIBING DATA QUALITY PROBLEM THROUGH A METADATA FRAMEWORK
    Yeoh, William
    Wang, Te-Wei
    Verbitskiy, Yuriy
    AMCIS 2012 PROCEEDINGS, 2012,
  • [4] Extensible metadata framework for describing virtual reality and multimedia contents
    Walczak, Krzysztof
    Chmielewski, Jacek
    Stawniak, Miroslaw
    Strykowski, Sergiusz
    PROCEEDINGS OF THE IASTED INTERNATIONAL CONFERENCE ON DATABASES AND APPLICATIONS, 2006, : 168 - +
  • [6] LabPipe: an extensible bioinformatics toolkit to manage experimental data and metadata
    Bo Zhao
    Luke Bryant
    Rebecca Cordell
    Michael Wilde
    Dahlia Salman
    Dorota Ruszkiewicz
    Wadah Ibrahim
    Amisha Singapuri
    Tim Coats
    Erol Gaillard
    Caroline Beardsmore
    Toru Suzuki
    Leong Ng
    Neil Greening
    Paul Thomas
    Paul Monks
    Christopher Brightling
    Salman Siddiqui
    Robert C. Free
    BMC Bioinformatics, 21
  • [7] LabPipe: an extensible bioinformatics toolkit to manage experimental data and metadata
    Zhao, Bo
    Bryant, Luke
    Cordell, Rebecca
    Wilde, Michael
    Salman, Dahlia
    Ruszkiewicz, Dorota
    Ibrahim, Wadah
    Singapuri, Amisha
    Coats, Tim
    Gaillard, Erol
    Beardsmore, Caroline
    Suzuki, Toru
    Ng, Leong
    Greening, Neil
    Thomas, Paul
    Monks, Paul
    Brightling, Christopher
    Siddiqui, Salman
    Free, Robert C.
    BMC BIOINFORMATICS, 2020, 21 (01)
  • [8] A Metadata Framework for Data Lagoons
    Theodorou, Vasileios
    Hai, Rihan
    Quix, Christoph
    NEW TRENDS IN DATABASES AND INFORMATION SYSTEMS, ADBIS 2019, 2019, 1064 : 452 - 462
  • [9] Automated Quality Assessment of Metadata across Open Data Portals
    Neumaier, Sebastian
    Umbrich, Jurgen
    Polleres, Axel
    ACM JOURNAL OF DATA AND INFORMATION QUALITY, 2016, 8 (01):
  • [10] Linking a Consortium-Wide Data Quality Assessment Tool with the MIRACUM Metadata Repository
    Kapsner, Lorenz A.
    Mang, Jonathan M.
    Mate, Sebastian
    Seuchter, Susanne A.
    Vengadeswaran, Abishaa
    Bathelt, Franziska
    Deppenwiese, Noemi
    Kadioglu, Dennis
    Kraska, Detlef
    Prokosch, Hans-Ulrich
    APPLIED CLINICAL INFORMATICS, 2021, 12 (04): : 826 - 835