FlipTest: Fairness Testing via Optimal Transport

被引:41
作者
Black, Emily [1 ]
Yeom, Samuel [1 ]
Fredrikson, Matt [1 ]
机构
[1] Carnegie Mellon Univ, Pittsburgh, PA 15213 USA
来源
FAT* '20: PROCEEDINGS OF THE 2020 CONFERENCE ON FAIRNESS, ACCOUNTABILITY, AND TRANSPARENCY | 2020年
基金
美国国家科学基金会;
关键词
fairness; machine learning; optimal transport; disparate impact;
D O I
10.1145/3351095.3372845
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present FlipTest, a black-box technique for uncovering discrimination in classifiers. FlipTest is motivated by the intuitive question: had an individual been of a different protected status, would the model have treated them differently? Rather than relying on causal information to answer this question, FlipTest leverages optimal transport to match individuals in different protected groups, creating similar pairs of in-distribution samples. We show how to use these instances to detect discrimination by constructing a flipset: the set of individuals whose classifier output changes post-translation, which corresponds to the set of people who may be harmed because of their group membership. To shed light on why the model treats a given subgroup differently, FlipTest produces a transparency report: a ranking of features that are most associated with the model's behavior on the flipset. Evaluating the approach on three case studies, we show that this provides a computationally inexpensive way to identify subgroups that may be harmed by model discrimination, including in cases where the model satisfies group fairness criteria.
引用
收藏
页码:111 / 121
页数:11
相关论文
共 47 条
  • [1] Abadi M, 2016, PROCEEDINGS OF OSDI'16: 12TH USENIX SYMPOSIUM ON OPERATING SYSTEMS DESIGN AND IMPLEMENTATION, P265
  • [2] Agarwal A., 2018, ARXIV180903260
  • [3] Angwin Julia, 2016, MACHINE BIAS
  • [4] [Anonymous], 2017, ADV NEURAL INFORM PR
  • [5] [Anonymous], 2017, arXiv preprint arXiv:1701.07875
  • [6] [Anonymous], 2017, STRATEGIC SUBJECT LI
  • [7] [Anonymous], 2017, ADV NEURAL INFORM PR, DOI DOI 10.1001/jamainternmed.2015.5231
  • [8] [Anonymous], 2018, HARV J LAW TECHNOL
  • [9] [Anonymous], 2017, ARXIV171102283
  • [10] Big Data's Disparate Impact
    Barocas, Solon
    Selbst, Andrew D.
    [J]. CALIFORNIA LAW REVIEW, 2016, 104 (03) : 671 - 732