Quality Assessment and Biases in Reused Data

被引:3
作者
Fernandez-Ardevo, Mireia [1 ,2 ]
Rosales, Andrea [1 ,2 ]
机构
[1] Univ Oberta Catalunya UOC, Fac Informat & Commun Sci, Barcelona, Catalonia, Spain
[2] Univ Oberta Catalunya UOC, IN3 Internet Interdisciplinary Inst, Barcelona, Catalonia, Spain
关键词
data quality; data biases; reused data; reused traces; open data; online behavioral advertising;
D O I
10.1177/00027642221144855
中图分类号
B849 [应用心理学];
学科分类号
040203 ;
摘要
This article investigates digital and non-digital traces reused beyond the context of creation. A central idea of this article is that no (reused) dataset is perfect. Therefore, data quality assessment becomes essential to determine if a given dataset is "good enough" to be used to fulfill the users' goals. Biases, a possible source of discrimination, have become a relevant data challenge. Consequently, it is appropriate to analyze whether quality assessment indicators provide information on potential biases in the dataset. We use examples representing two opposing sides regarding data access to reflect on the relationship between quality and bias. First, the European Union open data portal fosters the democratization of data and expects users to manipulate the databases directly to perform their analyses. Second, online behavioral advertising systems offer individualized promotional services but do not share the datasets supporting their design. Quality assessment is socially constructed, as there is not a universal definition but a set of quality dimensions, which might change for each professional context. From the users' perspective, trust/credibility stands out as a relevant quality dimension in the two analyzed cases. Results show that quality indicators (whatever they are) provide limited information on potential biases. We suggest that data literacy is most needed among both open data users and clients of behavioral advertising systems. Notably, users must (be able to) understand the limitations of datasets for an optimal and bias-free interpretation of results and decision-making.
引用
收藏
页码:696 / 710
页数:15
相关论文
共 50 条
  • [41] A Practical Data Quality Assessment Method for Raw Data in Vessel Operations
    Chen, Gang
    Cai, Jie
    Rytter, Niels Gorm Maly
    Lutzen, Marie
    JOURNAL OF MARINE SCIENCE AND APPLICATION, 2023, 22 (02) : 370 - 380
  • [42] An Assessment of Adoption and Quality of Linked Data in European Open Government Data
    Ibanez, Luis-Daniel
    Millard, Ian
    Glaser, Hugh
    Simperl, Elena
    SEMANTIC WEB - ISWC 2019, PT II, 2019, 11779 : 436 - 453
  • [43] Importance of the Open Data Assessment: An Insight Into the (Meta) Data Quality Dimensions
    Slibar, Barbara
    Oreski, Dijana
    Redep, Nina Begicevic
    SAGE OPEN, 2021, 11 (02):
  • [44] Assessment for Remote Sensing Data: Accuracy of Interactive Data Quality Interpretation
    Borg, Erik
    Fichtelmann, Bernd
    Asche, Hartmut
    COMPUTATIONAL SCIENCE AND ITS APPLICATIONS - ICCSA 2011, PT II, 2011, 6783 : 366 - 375
  • [45] Investigating and Mitigating Biases in Crowdsourced Data
    Hettiachchi, Danula
    Sanderson, Mark
    Goncalves, Jorge
    Hosio, Simo
    Kazai, Gabriella
    Lease, Matthew
    Schaekermann, Mike
    Yilmaz, Emine
    CONFERENCE COMPANION PUBLICATION OF THE 2021 COMPUTER SUPPORTED COOPERATIVE WORK AND SOCIAL COMPUTING, CSCW 2021 COMPANION, 2021, : 331 - 334
  • [46] Data quality and uncertainty assessment of life cycle inventory data for composites
    Balcioglu, Gulizar
    Fitzgerald, Amy M.
    Rodes, Ffion A. M.
    Allen, Stephen R.
    COMPOSITES PART B-ENGINEERING, 2025, 292
  • [47] Data Quality Assessment in the Integration Process of Linked Open Data (LOD)
    Ahmed, Hana Haj
    2017 IEEE/ACS 14TH INTERNATIONAL CONFERENCE ON COMPUTER SYSTEMS AND APPLICATIONS (AICCSA), 2017, : 1 - 6
  • [48] A Method of Electronic Health Data Quality Assessment: Enabling Data Provenance
    Sun, Yuling
    Lu, Tun
    Gu, Ning
    2017 IEEE 21ST INTERNATIONAL CONFERENCE ON COMPUTER SUPPORTED COOPERATIVE WORK IN DESIGN (CSCWD), 2017, : 233 - 238
  • [49] Towards an effective user interface for data exploration, data quality assessment and data integration
    Hanlon, Rolando
    Barry, Marguerite
    Marrinan, Fergal
    O'Sullivan, Declan
    2021 IEEE 15TH INTERNATIONAL CONFERENCE ON SEMANTIC COMPUTING (ICSC 2021), 2021, : 431 - 436
  • [50] Quality of HIV Testing Data Before and After the Implementation of a National Data Quality Assessment and Feedback System
    Beltrami, John
    Wang, Guoshen
    Usman, Hussain R.
    Lin, Lillian S.
    JOURNAL OF PUBLIC HEALTH MANAGEMENT AND PRACTICE, 2017, 23 (03) : 269 - 275