Measuring data quality with weighted metrics

被引:17
|
作者
Vaziri, Reza [1 ]
Mohsenzadeh, Mehran [1 ]
Habibi, Jafar [2 ]
机构
[1] Islamic Azad Univ, Sci & Res Branch, Dept Comp Engn, Tehran, Iran
[2] Sharif Univ Technol, Dept Comp Engn, Tehran, Iran
关键词
data quality; information quality; metrics; weighted metrics; methodology; METHODOLOGY;
D O I
10.1080/14783363.2017.1332954
中图分类号
C93 [管理学];
学科分类号
12 ; 1201 ; 1202 ; 120202 ;
摘要
Data quality (DQ) has been defined as 'fitness for use'. In order to measure and improve DQ, various methodologies have been defined. A DQ methodology is a set of guidelines and techniques that define a rational process to measure and improve the quality of data. In order to make DQ measurement and improvement more organised, DQ dimensions have been defined. A dimension is a single aspect of DQ, such as accuracy, completeness, timeliness, and relevancy. In order to measure dimensions, special tools have been developed. These are called metrics. In most organisations, some data are more significant than others. In other words, some data carry more 'weight'. Hence, they must play a more important role in DQ measurement. Most metrics developed so far do not take into account data weights. In this paper, new metrics based on data weights are defined in order to make them more practical. The effectiveness of the new 'weighted metrics' is tested in a case study. The case study shows that the DQ measurements by weighted metrics more closely reflect the opinion of data users.
引用
收藏
页码:708 / 720
页数:13
相关论文
共 50 条
  • [31] Measuring the Effect of Fraud on Data-Quality Dimensions
    Brahimi, Samiha
    Elhussein, Mariam
    DATA, 2023, 8 (08)
  • [32] A Policy-based Approach for Measuring Data Quality
    Grueneberg, K.
    Calo, S.
    Dewan, P.
    Verma, D.
    O'Gorman, Tristan
    2019 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2019, : 4025 - 4031
  • [33] Empirical Validation of Object Oriented Data Warehouse Design Quality Metrics
    Gupta, Jaya
    Gosain, Anjana
    Nagpal, Sushama
    ADVANCES IN COMPUTING AND INFORMATION TECHNOLOGY, 2011, 198 : 320 - +
  • [34] Discovering Data Quality Problems The Case of Repurposed Data
    Zhang, Ruojing
    Indulska, Marta
    Sadiq, Shazia
    BUSINESS & INFORMATION SYSTEMS ENGINEERING, 2019, 61 (05) : 575 - 593
  • [35] A comprehensive data quality methodology for web and structured data
    Batini, Carlo
    Cabitza, Federico
    Cappiello, Cinzia
    Francalanci, Chiara
    International Journal of Innovative Computing and Applications, 2008, 1 (03) : 205 - 218
  • [36] A multidimensional analysis of data quality for credit risk management: New insights and challenges
    Moges, Helen-Tadesse
    Dejaeger, Karel
    Lemahieu, Wilfried
    Baesens, Bart
    INFORMATION & MANAGEMENT, 2013, 50 (01) : 43 - 58
  • [37] Are metrics measuring what they should? An evaluation of Image Captioning task metrics
    Gonzalez-Chavez, Othon
    Ruiz, Guillermo
    Moctezuma, Daniela
    Ramirez-delReal, Tania
    SIGNAL PROCESSING-IMAGE COMMUNICATION, 2024, 120
  • [38] Data Quality Assessment for On-line Monitoring and Measuring System of Power Quality Based on Big Data and Data Provenance Theory
    Tian Hongxun
    Wang Honggang
    Zhou Kun
    Shi Mingtai
    Li Haosong
    Xu Zhongping
    Kang Taifeng
    Li Jin
    Cai Yaqi
    2018 IEEE 3RD INTERNATIONAL CONFERENCE ON CLOUD COMPUTING AND BIG DATA ANALYSIS (ICCCBDA), 2018, : 248 - 252
  • [39] Measuring Data Quality: A Review of the Literature between 2005 and 2013
    Stausberg, Juergen
    Nasseh, Daniel
    Nonnemacher, Michael
    DIGITAL HEALTHCARE EMPOWERING EUROPEANS, 2015, 210 : 712 - 716
  • [40] Understanding the differences across data quality classifications: a literature review and guidelines for future research
    Haug, Anders
    INDUSTRIAL MANAGEMENT & DATA SYSTEMS, 2021, 121 (12) : 2651 - 2671