Measuring data quality with weighted metrics

被引:17
|
作者
Vaziri, Reza [1 ]
Mohsenzadeh, Mehran [1 ]
Habibi, Jafar [2 ]
机构
[1] Islamic Azad Univ, Sci & Res Branch, Dept Comp Engn, Tehran, Iran
[2] Sharif Univ Technol, Dept Comp Engn, Tehran, Iran
关键词
data quality; information quality; metrics; weighted metrics; methodology; METHODOLOGY;
D O I
10.1080/14783363.2017.1332954
中图分类号
C93 [管理学];
学科分类号
12 ; 1201 ; 1202 ; 120202 ;
摘要
Data quality (DQ) has been defined as 'fitness for use'. In order to measure and improve DQ, various methodologies have been defined. A DQ methodology is a set of guidelines and techniques that define a rational process to measure and improve the quality of data. In order to make DQ measurement and improvement more organised, DQ dimensions have been defined. A dimension is a single aspect of DQ, such as accuracy, completeness, timeliness, and relevancy. In order to measure dimensions, special tools have been developed. These are called metrics. In most organisations, some data are more significant than others. In other words, some data carry more 'weight'. Hence, they must play a more important role in DQ measurement. Most metrics developed so far do not take into account data weights. In this paper, new metrics based on data weights are defined in order to make them more practical. The effectiveness of the new 'weighted metrics' is tested in a case study. The case study shows that the DQ measurements by weighted metrics more closely reflect the opinion of data users.
引用
收藏
页码:708 / 720
页数:13
相关论文
共 50 条
  • [1] An Advanced Big Data Quality Framework Based on Weighted Metrics
    Elouataoui, Widad
    El Alaoui, Imane
    El Mendili, Saida
    Gahi, Youssef
    BIG DATA AND COGNITIVE COMPUTING, 2022, 6 (04)
  • [2] Metrics for measuring data quality - Foundations for an economic data quality management
    Heinrich, Bernd
    Kaiser, Marcus
    Klier, Mathias
    ICSOFT 2007: PROCEEDINGS OF THE SECOND INTERNATIONAL CONFERENCE ON SOFTWARE AND DATA TECHNOLOGIES, VOL ISDM/WSEHST/DC, 2007, : 87 - 94
  • [3] Metrics for measuring the quality of fused images
    Maruthi, R.
    Suresh, R. M.
    ICCIMA 2007: INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND MULTIMEDIA APPLICATIONS, VOL III, PROCEEDINGS, 2007, : 153 - +
  • [4] Requirements for Data Quality Metrics
    Heinrich, Bernd
    Hristova, Diana
    Klier, Mathias
    Schiller, Alexander
    Szubartowicz, Michael
    ACM JOURNAL OF DATA AND INFORMATION QUALITY, 2018, 9 (02):
  • [5] Review of data quality indicators and metrics, and suggestions for indicators and metrics for structural health monitoring
    Makhoul, Nisrine
    ADVANCES IN BRIDGE ENGINEERING, 2022, 3 (01):
  • [6] A metrics framework for measuring quality of a web service as it evolves
    Kohar R.
    Parimala N.
    International Journal of System Assurance Engineering and Management, 2017, 8 (Suppl 2) : 1222 - 1236
  • [7] A study of the metrics for measuring the quality of the requirements specification document
    Wong, B
    SERP'04: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING RESEARCH AND PRACTICE, VOLS 1 AND 2, 2004, : 549 - 553
  • [8] MEASURING DATA QUALITY IN VORTALS
    Caro, Angelica
    Angeles Moraga, Ma
    Moraga, Carmen
    Calero, Coral
    ICSOFT 2009: PROCEEDINGS OF THE 4TH INTERNATIONAL CONFERENCE ON SOFTWARE AND DATA TECHNOLOGIES, VOL 2, 2009, : 217 - +
  • [9] Recommendations for mass spectrometry data quality metrics for open access data (corollary to the Amsterdam principles)
    Kinsinger, Christopher R.
    Apffel, James
    Baker, Mark
    Bian, Xiaopeng
    Borchers, Christoph H.
    Bradshaw, Ralph
    Brusniak, Mi-Youn
    Chan, Daniel W.
    Deutsch, Eric W.
    Domon, Bruno
    Gorman, Jeff
    Grimm, Rudolf
    Hancock, William
    Hermjakob, Henning
    Horn, David
    Hunter, Christie
    Kolar, Patrik
    Kraus, Hans-Joachim
    Langen, Hanno
    Linding, Rune
    Moritz, Robert L.
    Omenn, Gilbert S.
    Orlando, Ron
    Pandey, Akhilesh
    Ping, Peipei
    Rahbar, Amir
    Rivers, Robert
    Seymour, Sean L.
    Simpson, Richard J.
    Slotta, Douglas
    Smith, Richard D.
    Stein, Stephen E.
    Tabb, David L.
    Tagle, Danilo
    Yates, John R., III
    Rodriguez, Henry
    PROTEOMICS CLINICAL APPLICATIONS, 2011, 5 (11-12) : 580 - 589
  • [10] Recommendations for Mass Spectrometry Data Quality Metrics for Open Access Data (Corollary to the Amsterdam Principles)
    Kinsinger, Christopher R.
    Apffel, James
    Baker, Mark
    Bian, Xiaopeng
    Borchers, Christoph H.
    Bradshaw, Ralph
    Brusniak, Mi-Youn
    Chan, Daniel W.
    Deutsch, Eric W.
    Domon, Bruno
    Gorman, Jeff
    Grimm, Rudolf
    Hancock, William
    Hermjakob, Henning
    Horn, David
    Hunter, Christie
    Kolar, Patrik
    Kraus, Hans-Joachim
    Langen, Hanno
    Linding, Rune
    Moritz, Robert L.
    Omenn, Gilbert S.
    Orlando, Ron
    Pandey, Akhilesh
    Ping, Peipei
    Rahbar, Amir
    Rivers, Robert
    Seymour, Sean L.
    Simpson, Richard J.
    Slotta, Douglas
    Smith, Richard D.
    Stein, Stephen E.
    Tabb, David L.
    Tagle, Danilo
    Yates, John R., III
    Rodriguez, Henry
    JOURNAL OF PROTEOME RESEARCH, 2012, 11 (02) : 1412 - 1419