On the Impact of Data Quality on Image Classification Fairness

被引：1

作者：

Barry, Aki ^{[1
]}

Han, Lei ^{[1
]}

Demartini, Gianluca ^{[1
]}

机构：

[1] Univ Queensland, Brisbane, Qld, Australia

来源：

PROCEEDINGS OF THE 46TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, SIGIR 2023 | 2023年

基金：

澳大利亚研究理事会; 瑞士国家科学基金会;

关键词：

Fairness; Machine Learning; Data Quality;

D O I：

10.1145/3539618.3592031

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

With the proliferation of algorithmic decision-making, increased scrutiny has been placed on these systems. This paper explores the relationship between the quality of the training data and the overall fairness of the models trained with such data in the context of supervised classification. We measure key fairness metrics across a range of algorithms over multiple image classification datasets that have a varying level of noise in both the labels and the training data itself. We describe noise in the labels as inaccuracies in the labelling of the data in the training set and noise in the data as distortions in the data, also in the training set. By adding noise to the original datasets, we can explore the relationship between the quality of the training data and the fairness of the output of the models trained on that data.

引用

页码：2225 / 2229

页数：5

共 28 条

[1] Angwin J., 2016, PROPUBLICA
[2] [Anonymous], 2013, Queue, DOI [DOI 10.1145/2447976.2447990, 10.1145/2447976.2447990, DOI 10.1145/2460276.2460278]
[3] Bacham Dinesh, 2017, MOODYS ANAL RISK PER
[4] Big Data's Disparate Impact
Barocas, Solon
Selbst, Andrew D.
[J]. CALIFORNIA LAW REVIEW, 2016, 104 (03) : 671 - 732
[5] Bechavod Yahav, 2017, PENALIZING UNFAIRNES, DOI [10.48550/ARXIV.1707.00044, DOI 10.48550/ARXIV.1707.00044]
[6] Blake R., 2011, Journal of Data and Information Quality (JDIQ), V2, P1, DOI [10.1145/1891879.1891881, DOI 10.1145/1891879.1891881]
[7] Budach L., 2022, P AAAI 2022 SPRING S, DOI DOI 10.48550/ARXIV.2207.14529
[8] Caton S., 2020, Fairness in machine learning: A survey
[9] Chouldechova A., 2018, ARXIV181008810
[10] Fair Prediction with Disparate Impact: A Study of Bias in Recidivism Prediction Instruments
Chouldechova, Alexandra
[J]. BIG DATA, 2017, 5 (02) : 153 - 163

← 1 2 3 →