An extensible metadata framework for data quality assessment of composite structures

被引:0
作者
Farinha, Jose [1 ]
Trigueiros, Maria Jose [1 ]
机构
[1] ISCTE ADETTI, Dept Sci & Informat Technol, Av Forcas Armadas, P-1649026 Lisbon, Portugal
来源
DATA WAREHOUSING AND KNOWLEDGE DISCOVERY, PROCEEDINGS | 2007年 / 4654卷
关键词
data quality; metadata; metamodel; CWM;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Data quality is a critical issue both in operational databases and in data warehouse systems. Data quality assessment is a strong requirement regarding the ETL subsystem, since bad data may destroy data warehouse credibility. During the last two decades, research and development efforts in the data quality field have produced techniques for data profiling and cleaning, which focus on detecting and correcting bad values in data. Little efforts have been done considering data quality when it relates to the well-formedness of coarse grained data structures resulting from the assembly of linked data records. This paper proposes a metadata model that supports the structural validation of linked data records, from a data quality point of view. The metamodel is built on top of the CWM standard and it supports the specification of data structure quality rules in a high level of abstraction, as well as by means of very specific fine grained business rules.
引用
收藏
页码:34 / +
页数:3
相关论文
共 50 条
  • [21] A Framework for Quality Assessment of Semantic Annotations of Tabular Data
    Avogadro, Roberto
    Cremaschi, Marco
    Jimenez-Ruiz, Ernesto
    Rula, Anisa
    SEMANTIC WEB - ISWC 2021, 2021, 12922 : 528 - 545
  • [22] Metadata Quality Assessment Metrics into OCW Repositories
    Romero Pelaez, Ndaudrey
    Alarcon, Ndpedro P.
    PROCEEDINGS OF THE 9TH INTERNATIONAL CONFERENCE ON EDUCATION TECHNOLOGY AND COMPUTERS (ICETC 2017), 2017, : 253 - 257
  • [23] SkySat Data Quality Assessment within the EDAP Framework
    Saunier, Sebastien
    Karakas, Gizem
    Yalcin, Ilyas
    Done, Fay
    Mannan, Rubinder
    Albinet, Clement
    Goryl, Philippe
    Kocaman, Sultan
    REMOTE SENSING, 2022, 14 (07)
  • [24] Geographic metadata: Data quality, uncertainty and imprecision
    Akli-Astouati K.
    Guebaili R.
    Mokhtari A.
    International Journal of Reasoning-based Intelligent Systems, 2011, 3 (3-4) : 164 - 172
  • [25] Cleaning by clustering: methodology for addressing data quality issues in biomedical metadata
    Wei Hu
    Amrapali Zaveri
    Honglei Qiu
    Michel Dumontier
    BMC Bioinformatics, 18
  • [26] Cleaning by clustering: methodology for addressing data quality issues in biomedical metadata
    Hu, Wei
    Zaveri, Amrapali
    Qiu, Honglei
    Dumontier, Michel
    BMC BIOINFORMATICS, 2017, 18
  • [27] Towards a Data Quality Framework for Heterogeneous Data
    Micic, Natasha
    Neagu, Daniel
    Campean, Felician
    Zadeh, Esmaeil Habib
    2017 IEEE INTERNATIONAL CONFERENCE ON INTERNET OF THINGS (ITHINGS) AND IEEE GREEN COMPUTING AND COMMUNICATIONS (GREENCOM) AND IEEE CYBER, PHYSICAL AND SOCIAL COMPUTING (CPSCOM) AND IEEE SMART DATA (SMARTDATA), 2017, : 155 - 162
  • [28] Metadata and data structures for the historical newspaper digital library
    Allen, RB
    Schalow, J
    PROCEEDINGS OF THE EIGHTH INTERNATIONAL CONFERENCE ON INFORMATION KNOWLEDGE MANAGEMENT, CIKM'99, 1999, : 147 - 153
  • [29] A Zero Trust Model Based Framework For Data Quality Assessment
    Mohammed, Mahmood
    Talburt, John R.
    Dagtas, Serhan
    Hollingsworth, Melissa
    2021 INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE AND COMPUTATIONAL INTELLIGENCE (CSCI 2021), 2021, : 305 - 307
  • [30] Medical data quality assessment: On the development of an automated framework for medical data curation
    Pezoulas, Vasileios C.
    Kourou, Konstantina D.
    Kalatzis, Fanis
    Exarchos, Themis P.
    Venetsanopoulou, Aliki
    Zampeli, Evi
    Gandolfo, Saviana
    Skopouli, Fotini
    De Vita, Salvatore
    Tzioufas, Athanasios G.
    Fotiadis, Dimitrios I.
    COMPUTERS IN BIOLOGY AND MEDICINE, 2019, 107 : 270 - 283