Measuring data quality with weighted metrics

被引:17
|
作者
Vaziri, Reza [1 ]
Mohsenzadeh, Mehran [1 ]
Habibi, Jafar [2 ]
机构
[1] Islamic Azad Univ, Sci & Res Branch, Dept Comp Engn, Tehran, Iran
[2] Sharif Univ Technol, Dept Comp Engn, Tehran, Iran
关键词
data quality; information quality; metrics; weighted metrics; methodology; METHODOLOGY;
D O I
10.1080/14783363.2017.1332954
中图分类号
C93 [管理学];
学科分类号
12 ; 1201 ; 1202 ; 120202 ;
摘要
Data quality (DQ) has been defined as 'fitness for use'. In order to measure and improve DQ, various methodologies have been defined. A DQ methodology is a set of guidelines and techniques that define a rational process to measure and improve the quality of data. In order to make DQ measurement and improvement more organised, DQ dimensions have been defined. A dimension is a single aspect of DQ, such as accuracy, completeness, timeliness, and relevancy. In order to measure dimensions, special tools have been developed. These are called metrics. In most organisations, some data are more significant than others. In other words, some data carry more 'weight'. Hence, they must play a more important role in DQ measurement. Most metrics developed so far do not take into account data weights. In this paper, new metrics based on data weights are defined in order to make them more practical. The effectiveness of the new 'weighted metrics' is tested in a case study. The case study shows that the DQ measurements by weighted metrics more closely reflect the opinion of data users.
引用
收藏
页码:708 / 720
页数:13
相关论文
共 50 条
  • [21] Measuring data abstraction quality in multiresolution visualizations
    Cui, Qingguang
    Ward, Matthew O.
    Rundensteiner, Elke A.
    Yang, Jing
    IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2006, 12 (05) : 709 - 716
  • [22] Measuring data quality in information systems research
    Timmerman, Yoram
    Bronselaer, Antoon
    DECISION SUPPORT SYSTEMS, 2019, 126
  • [23] Measuring Manufacturing Test Data Analysis Quality
    Burkhardt, Andrew
    Berryman, Sheila
    Brio, Ashley
    Ferkau, Susan
    Hubner, Gloria
    Lynch, Kevin
    Mittman, Susan
    Sonderer, Kathy
    2018 IEEE AUTOTESTCON, 2018, : 359 - 364
  • [24] Process-Driven Data Quality Management Through Integration of Data Quality into Existing Process ModelsApplication of Complexity-Reducing Patterns and the Impact on Complexity Metrics
    Paul Glowalla
    Ali Sunyaev
    Business & Information Systems Engineering, 2013, 5 : 433 - 448
  • [25] Transforming Digital Phenotyping Raw Data Into Actionable Biomarkers, Quality Metrics, and Data Visualizations Using Cortex Software Package: Tutorial
    Burns, James
    Chen, Kelly
    Flathers, Matthew
    Currey, Danielle
    Macrynikola, Natalia
    Vaidyam, Aditya
    Langholm, Carsten
    Barnett, Ian
    Byun, Andrew
    Lane, Erlend
    Torous, John
    JOURNAL OF MEDICAL INTERNET RESEARCH, 2024, 26
  • [26] The data quality improvement plan: deciding on choice and sequence of data quality improvements
    Kleindienst, Dominikus
    ELECTRONIC MARKETS, 2017, 27 (04) : 387 - 398
  • [27] Process-Driven Data Quality Management Through Integration of Data Quality into Existing Process Models Application of Complexity-Reducing Patterns and the Impact on Complexity Metrics
    Glowalla, Paul
    Sunyaev, Ali
    BUSINESS & INFORMATION SYSTEMS ENGINEERING, 2013, 5 (06) : 433 - 448
  • [28] Measuring data quality of geoscience datasets using data mining techniques
    Cai, Cuo
    Xie, Kunqing
    Data Science Journal, 2007, 6 (SUPPL.) : S738 - S742
  • [29] A Metrics-Driven Approach for Quality Assessment of Linked Open Data
    Behkamal, Behshid
    Kahani, Mohsen
    Bagheri, Ebrahim
    Jeremic, Zoran
    JOURNAL OF THEORETICAL AND APPLIED ELECTRONIC COMMERCE RESEARCH, 2014, 9 (02): : 64 - 79
  • [30] Measuring the Quality of Data in Electronic Health Records Aggregators
    Molina, Carlos
    Prados-Suarez, Belen
    2020 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS (FUZZ-IEEE), 2020,