On the Impact of Data Quality on Image Classification Fairness

被引:1
作者
Barry, Aki [1 ]
Han, Lei [1 ]
Demartini, Gianluca [1 ]
机构
[1] Univ Queensland, Brisbane, Qld, Australia
来源
PROCEEDINGS OF THE 46TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, SIGIR 2023 | 2023年
基金
澳大利亚研究理事会; 瑞士国家科学基金会;
关键词
Fairness; Machine Learning; Data Quality;
D O I
10.1145/3539618.3592031
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
With the proliferation of algorithmic decision-making, increased scrutiny has been placed on these systems. This paper explores the relationship between the quality of the training data and the overall fairness of the models trained with such data in the context of supervised classification. We measure key fairness metrics across a range of algorithms over multiple image classification datasets that have a varying level of noise in both the labels and the training data itself. We describe noise in the labels as inaccuracies in the labelling of the data in the training set and noise in the data as distortions in the data, also in the training set. By adding noise to the original datasets, we can explore the relationship between the quality of the training data and the fairness of the output of the models trained on that data.
引用
收藏
页码:2225 / 2229
页数:5
相关论文
共 28 条
  • [1] Angwin J., 2016, PROPUBLICA
  • [2] [Anonymous], 2013, Queue, DOI [DOI 10.1145/2447976.2447990, 10.1145/2447976.2447990, DOI 10.1145/2460276.2460278]
  • [3] Bacham Dinesh, 2017, MOODYS ANAL RISK PER
  • [4] Big Data's Disparate Impact
    Barocas, Solon
    Selbst, Andrew D.
    [J]. CALIFORNIA LAW REVIEW, 2016, 104 (03) : 671 - 732
  • [5] Bechavod Yahav, 2017, PENALIZING UNFAIRNES, DOI [10.48550/ARXIV.1707.00044, DOI 10.48550/ARXIV.1707.00044]
  • [6] Blake R., 2011, Journal of Data and Information Quality (JDIQ), V2, P1, DOI [10.1145/1891879.1891881, DOI 10.1145/1891879.1891881]
  • [7] Budach L., 2022, P AAAI 2022 SPRING S, DOI DOI 10.48550/ARXIV.2207.14529
  • [8] Caton S., 2020, Fairness in machine learning: A survey
  • [9] Chouldechova A., 2018, ARXIV181008810
  • [10] Fair Prediction with Disparate Impact: A Study of Bias in Recidivism Prediction Instruments
    Chouldechova, Alexandra
    [J]. BIG DATA, 2017, 5 (02) : 153 - 163