Measuring data quality with weighted metrics

被引:17
|
作者
Vaziri, Reza [1 ]
Mohsenzadeh, Mehran [1 ]
Habibi, Jafar [2 ]
机构
[1] Islamic Azad Univ, Sci & Res Branch, Dept Comp Engn, Tehran, Iran
[2] Sharif Univ Technol, Dept Comp Engn, Tehran, Iran
关键词
data quality; information quality; metrics; weighted metrics; methodology; METHODOLOGY;
D O I
10.1080/14783363.2017.1332954
中图分类号
C93 [管理学];
学科分类号
12 ; 1201 ; 1202 ; 120202 ;
摘要
Data quality (DQ) has been defined as 'fitness for use'. In order to measure and improve DQ, various methodologies have been defined. A DQ methodology is a set of guidelines and techniques that define a rational process to measure and improve the quality of data. In order to make DQ measurement and improvement more organised, DQ dimensions have been defined. A dimension is a single aspect of DQ, such as accuracy, completeness, timeliness, and relevancy. In order to measure dimensions, special tools have been developed. These are called metrics. In most organisations, some data are more significant than others. In other words, some data carry more 'weight'. Hence, they must play a more important role in DQ measurement. Most metrics developed so far do not take into account data weights. In this paper, new metrics based on data weights are defined in order to make them more practical. The effectiveness of the new 'weighted metrics' is tested in a case study. The case study shows that the DQ measurements by weighted metrics more closely reflect the opinion of data users.
引用
收藏
页码:708 / 720
页数:13
相关论文
共 50 条
  • [41] Big Data Analytics on MANET Routing Standardization using Quality Assurance Metrics
    Kush, Ashwani
    Hwang, C. Jinshong
    Dattana, Vishal
    PROCEEDINGS OF 2016 FUTURE TECHNOLOGIES CONFERENCE (FTC), 2016, : 192 - 198
  • [42] Multidimensional Quality Metrics (MQM): A Framework for Declaring and Describing Translation Quality Metrics
    Lommel, Arle
    Uszkoreit, Hans
    Burchardt, Aljoscha
    TRADUMATICA-TRADUCCIO I TECNOLOGIES DE LA INFORMACIO I LA COMUNICACIO, 2014, (12): : 455 - 462
  • [43] Measuring and Diffusing Data Quality in a Peer-to-Peer Architecture
    Milano, Diego
    Scannapieco, Monica
    Catarci, Tiziana
    INTERNATIONAL JOURNAL OF ENTERPRISE INFORMATION SYSTEMS, 2007, 3 (01) : 61 - 84
  • [44] MetricHaven - More Than 23,000 Metrics for Measuring Quality Attributes of Software Product Lines
    El-Sharkawy, Sascha
    Krafczyk, Adam
    Schmid, Klaus
    23RD INTERNATIONAL SYSTEMS AND SOFTWARE PRODUCT LINE CONFERENCE(SPLC 2019), VOL B, 2019, : 25 - 28
  • [45] Measuring Data Quality Through a Source Data Verification Audit in a Clinical Research Setting
    Houston, Lauren
    Probst, Yasmine
    Humphries, Allison
    DRIVING REFORM: DIGITAL HEALTH IS EVERYONE'S BUSINESS, 2015, 214 : 107 - 113
  • [46] A Metrics Suite for Measuring Reusability of Learning Objects
    Noor, Siti Fadzilah Mat
    Yusof, Norazah
    Hashim, Siti Zaiton Mohd
    2009 9TH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS DESIGN AND APPLICATIONS, 2009, : 961 - +
  • [47] Measuring sustainable diets: nutrition and ecosystem metrics
    Macdiarmid, Jennie I.
    ANNALS OF NUTRITION AND METABOLISM, 2023, 79 : 276 - 276
  • [48] A Metrics Suite for Measuring Indirect Coupling Complexity
    J. Navas-Su
    A. Gonzalez-Torres
    M. Hernandez-Vasquez
    J. Solano-Cordero
    F. Hernandez-Castro
    A. Bener
    Programming and Computer Software, 2023, 49 : 735 - 761
  • [49] Measuring the Crowd - A Preliminary Taxonomy of Crowdsourcing Metrics
    Cullina, Eoin
    Conboy, Kieran
    Morgan, Lorraine
    PROCEEDINGS OF THE 11TH INTERNATIONAL SYMPOSIUM ON OPEN COLLABORATION, 2015, : B1 - +
  • [50] A Metrics Suite for Measuring Indirect Coupling Complexity
    Navas-Su, J.
    Gonzalez-Torres, A.
    Hernandez-Vasquez, M.
    Solano-Cordero, J.
    Hernandez-Castro, F.
    Bener, A.
    PROGRAMMING AND COMPUTER SOFTWARE, 2023, 49 (08) : 735 - 761