Quality Assessment and Biases in Reused Data

被引:3
|
作者
Fernandez-Ardevo, Mireia [1 ,2 ]
Rosales, Andrea [1 ,2 ]
机构
[1] Univ Oberta Catalunya UOC, Fac Informat & Commun Sci, Barcelona, Catalonia, Spain
[2] Univ Oberta Catalunya UOC, IN3 Internet Interdisciplinary Inst, Barcelona, Catalonia, Spain
关键词
data quality; data biases; reused data; reused traces; open data; online behavioral advertising;
D O I
10.1177/00027642221144855
中图分类号
B849 [应用心理学];
学科分类号
040203 ;
摘要
This article investigates digital and non-digital traces reused beyond the context of creation. A central idea of this article is that no (reused) dataset is perfect. Therefore, data quality assessment becomes essential to determine if a given dataset is "good enough" to be used to fulfill the users' goals. Biases, a possible source of discrimination, have become a relevant data challenge. Consequently, it is appropriate to analyze whether quality assessment indicators provide information on potential biases in the dataset. We use examples representing two opposing sides regarding data access to reflect on the relationship between quality and bias. First, the European Union open data portal fosters the democratization of data and expects users to manipulate the databases directly to perform their analyses. Second, online behavioral advertising systems offer individualized promotional services but do not share the datasets supporting their design. Quality assessment is socially constructed, as there is not a universal definition but a set of quality dimensions, which might change for each professional context. From the users' perspective, trust/credibility stands out as a relevant quality dimension in the two analyzed cases. Results show that quality indicators (whatever they are) provide limited information on potential biases. We suggest that data literacy is most needed among both open data users and clients of behavioral advertising systems. Notably, users must (be able to) understand the limitations of datasets for an optimal and bias-free interpretation of results and decision-making.
引用
收藏
页码:696 / 710
页数:15
相关论文
共 50 条
  • [31] A Framework for Linked Data Fusion and Quality Assessment
    Nahari, Mohammad Khodizadeh
    Ghadiri, Nasser
    Jafarifard, Zahra
    Dastjerdi, Ahmad Baraani
    Sack, Joerg R.
    2017 3RD INTERNATIONAL CONFERENCE ON WEB RESEARCH (ICWR), 2017, : 67 - 72
  • [32] Systematic assessment and improvement of medical data quality
    Jacke, C. O.
    Kalder, M.
    Koller, M.
    Wagner, U.
    Albert, U. S.
    BUNDESGESUNDHEITSBLATT-GESUNDHEITSFORSCHUNG-GESUNDHEITSSCHUTZ, 2012, 55 (11-12) : 1495 - 1503
  • [33] Crowd and Community Sourced Data Quality Assessment
    Jolivet, Laurence
    Olteanu-Raimond, Ana-Maria
    ADVANCES IN CARTOGRAPHY AND GISCIENCE, 2017, : 47 - 60
  • [34] DATA QUALITY ASSESSMENT FOR MARITIME SITUATION AWARENESS
    Iphar, C.
    Napoli, A.
    Ray, C.
    ISPRS GEOSPATIAL WEEK 2015, 2015, II-3 (W5): : 291 - 296
  • [35] Data quality assessment in hydrological information systems
    Li Chao
    Zhou Hui
    Zhou Xiaofeng
    JOURNAL OF HYDROINFORMATICS, 2015, 17 (04) : 640 - 661
  • [36] Metadata-based data quality assessment
    Aljumaili, Mustafa
    Karim, Ramin
    Tretten, Phillip
    VINE JOURNAL OF INFORMATION AND KNOWLEDGE MANAGEMENT SYSTEMS, 2016, 46 (02) : 232 - 250
  • [37] Dual assessment of data quality in customer databases
    Even, Adir
    Shankaranarayanan, G.
    Journal of Data and Information Quality, 2009, 1 (03)
  • [38] Data Quality Assessment in Smart Manufacturing: A Review
    Peixoto, Teresa
    Oliveira, Bruno
    Oliveira, Oscar
    Ribeiro, Fillipe
    SYSTEMS, 2025, 13 (04):
  • [39] Beyond Data Quality: The Assessment of Data Utilization in Indonesian Telecommunication Industry
    Lubis, Muharman
    Raafi, Engla
    Prayogo, Sendy
    INTELLIGENT SUSTAINABLE SYSTEMS, WORLDS4 2022, VOL 2, 2023, 579 : 237 - 246
  • [40] Data Quality Assessment for Comparative Effectiveness Research in Distributed Data Networks
    Brown, Jeffrey S.
    Kahn, Michael
    Toh, Sengwee
    MEDICAL CARE, 2013, 51 (08) : S22 - S29