An extensible metadata framework for data quality assessment of composite structures

被引:0
作者
Farinha, Jose [1 ]
Trigueiros, Maria Jose [1 ]
机构
[1] ISCTE ADETTI, Dept Sci & Informat Technol, Av Forcas Armadas, P-1649026 Lisbon, Portugal
来源
DATA WAREHOUSING AND KNOWLEDGE DISCOVERY, PROCEEDINGS | 2007年 / 4654卷
关键词
data quality; metadata; metamodel; CWM;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Data quality is a critical issue both in operational databases and in data warehouse systems. Data quality assessment is a strong requirement regarding the ETL subsystem, since bad data may destroy data warehouse credibility. During the last two decades, research and development efforts in the data quality field have produced techniques for data profiling and cleaning, which focus on detecting and correcting bad values in data. Little efforts have been done considering data quality when it relates to the well-formedness of coarse grained data structures resulting from the assembly of linked data records. This paper proposes a metadata model that supports the structural validation of linked data records, from a data quality point of view. The metamodel is built on top of the CWM standard and it supports the specification of data structure quality rules in a high level of abstraction, as well as by means of very specific fine grained business rules.
引用
收藏
页码:34 / +
页数:3
相关论文
共 50 条
  • [11] A Linked Data Quality Assessment Framework for Network Data
    To, Alex
    Meymandpour, Rouzbeh
    Davis, Joseph G.
    Jourjon, Guillaume
    Chan, Jonathan
    PROCEEDINGS OF THE 2ND ACM SIGMOD JOINT INTERNATIONAL WORKSHOP ON GRAPH DATA MANAGEMENT EXPERIENCES & SYSTEMS (GRADES) AND NETWORK DATA ANALYTICS (NDA) 2019, 2019,
  • [12] Enhancing Traceability in Clinical Research Data through a Metadata Framework
    Hume, Samuel
    Sarnikar, Surendra
    Noteboom, Cherie
    METHODS OF INFORMATION IN MEDICINE, 2020, 59 (02/03) : 75 - 85
  • [13] A Framework for Linked Data Fusion and Quality Assessment
    Nahari, Mohammad Khodizadeh
    Ghadiri, Nasser
    Jafarifard, Zahra
    Dastjerdi, Ahmad Baraani
    Sack, Joerg R.
    2017 3RD INTERNATIONAL CONFERENCE ON WEB RESEARCH (ICWR), 2017, : 67 - 72
  • [14] FRAMEWORK FOR DATA QUALITY ASSURANCE BETWEEN COMPOSITE SERVICES
    Lee, Jung-Won
    Choi, Byoungju
    INTERNATIONAL JOURNAL OF SOFTWARE ENGINEERING AND KNOWLEDGE ENGINEERING, 2009, 19 (03) : 307 - 337
  • [15] Towards Configurable Composite Data Quality Assessment
    Ceravolo, Paolo
    Bellini, Emanuele
    2019 IEEE 21ST CONFERENCE ON BUSINESS INFORMATICS (CBI), VOL 1, 2019, : 249 - 257
  • [16] A Learning Quality Metadata approach: Automatic quality assessment of virtual training from metadata
    Pons, Daniel
    Ramon Hilera, Jose
    Fernandez, Luis
    Pages, Carmen
    COMPUTER STANDARDS & INTERFACES, 2016, 45 : 45 - 61
  • [17] A Big Data Framework for Electric Power Data Quality Assessment
    Liu, He
    Huang, Fupeng
    Li, Han
    Liu, Weiwei
    Wang, Tongxun
    2017 14TH WEB INFORMATION SYSTEMS AND APPLICATIONS CONFERENCE (WISA 2017), 2017, : 289 - 292
  • [18] Quality of Metadata in Open Data Portals
    Nogueras-Iso, Javier
    Lacasta, Javier
    Urena-Camara, Manuel Antonio
    Ariza-Lopez, Francisco Javier
    IEEE ACCESS, 2021, 9 : 60364 - 60382
  • [19] Managing the Quality of Data and Metadata for Biobanks
    Eder, Johann
    Shekhovtsov, Volodymyr A.
    FUTURE DATA AND SECURITY ENGINEERING. BIG DATA, SECURITY AND PRIVACY, SMART CITY AND INDUSTRY 4.0 APPLICATIONS, FDSE 2022, 2022, 1688 : 52 - 69
  • [20] Institutional Structures for Research Data and Metadata Curation
    Mayernik, Matthew S.
    JCDL'13: PROCEEDINGS OF THE 13TH ACM/IEEE-CS JOINT CONFERENCE ON DIGITAL LIBRARIES, 2013, : 401 - 402