Model-based and actual independence for fairness-aware classification

被引:0
作者
Toshihiro Kamishima
Shotaro Akaho
Hideki Asoh
Jun Sakuma
机构
[1] National Institute of Advanced Industrial Science and Technology (AIST),
[2] University of Tsukuba,undefined
[3] RIKEN Center for Advanced Intelligence Project,undefined
来源
Data Mining and Knowledge Discovery | 2018年 / 32卷
关键词
Fairness; Discrimination; Classification; Cost-sensitive learning;
D O I
暂无
中图分类号
学科分类号
摘要
The goal of fairness-aware classification is to categorize data while taking into account potential issues of fairness, discrimination, neutrality, and/or independence. For example, when applying data mining technologies to university admissions, admission criteria must be non-discriminatory and fair with regard to sensitive features, such as gender or race. In this context, such fairness can be formalized as statistical independence between classification results and sensitive features. The main purpose of this paper is to analyze this formal fairness in order to achieve better trade-offs between fairness and prediction accuracy, which is important for applying fairness-aware classifiers in practical use. We focus on a fairness-aware classifier, Calders and Verwer’s two-naive-Bayes (CV2NB) method, which has been shown to be superior to other classifiers in terms of fairness. We hypothesize that this superiority is due to the difference in types of independence. That is, because CV2NB achieves actual independence, rather than satisfying model-based independence like the other classifiers, it can account for model bias and a deterministic decision rule. We empirically validate this hypothesis by modifying two fairness-aware classifiers, a prejudice remover method and a reject option-based classification (ROC) method, so as to satisfy actual independence. The fairness of these two modified methods was drastically improved, showing the importance of maintaining actual independence, rather than model-based independence. We additionally extend an approach adopted in the ROC method so as to make it applicable to classifiers other than those with generative models, such as SVMs.
引用
收藏
页码:258 / 286
页数:28
相关论文
共 10 条
[1]  
Calders T(2010)Three naive Bayes approaches for discrimination-free classification Data Min Knowl Discov 21 277-292
[2]  
Verwer S(2013)A methodology for direct and indirect discrimination prevention in data mining IEEE Trans Knowl Data Eng 25 1445-1459
[3]  
Hajian S(2012)Data preprocessing techniques for classification without discrimination Knowl Inf Syst 33 1-33
[4]  
Domingo-Ferrer J(2013)Quantifying explainable discrimination and removing illegal discrimination in automated decision making Knowl Inf Syst 35 613-644
[5]  
Kamiran F(2013)Discrimination in online ad delivery Commun ACM 56 44-54
[6]  
Calders T(undefined)undefined undefined undefined undefined-undefined
[7]  
Kamiran F(undefined)undefined undefined undefined undefined-undefined
[8]  
Žliobaitė I(undefined)undefined undefined undefined undefined-undefined
[9]  
Calders T(undefined)undefined undefined undefined undefined-undefined
[10]  
Sweeney L(undefined)undefined undefined undefined undefined-undefined