Fairness Testing: Testing Software for Discrimination

被引:227
作者
Galhotra, Sainyam [1 ]
Brun, Yuriy [1 ]
Meliou, Alexandra [1 ]
机构
[1] Univ Massachusetts, Amherst, MA 01003 USA
来源
ESEC/FSE 2017: PROCEEDINGS OF THE 2017 11TH JOINT MEETING ON FOUNDATIONS OF SOFTWARE ENGINEERING | 2017年
基金
美国国家科学基金会;
关键词
Discrimination testing; fairness testing; software bias; testing; RELIABILITY; CAUSALITY;
D O I
10.1145/3106237.3106277
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
This paper defines software fairness and discrimination and develops a testing-based method for measuring if and how much software discriminates, focusing on causality in discriminatory behavior. Evidence of software discrimination has been found in modern software systems that recommend criminal sentences, grant access to financial products, and determine who is allowed to participate in promotions. Our approach, Themis, generates efficient test suites to measure discrimination. Given a schema describing valid system inputs, Themis generates discrimination tests automatically and does not require an oracle. We evaluate Themis on 20 software systems, 12 of which come from prior work with explicit focus on avoiding discrimination. We find that (1) Themis is effective at discovering software discrimination, (2) state-of-the-art techniques for removing discrimination from algorithms fail in many situations, at times discriminating against as much as 98% of an input subdomain, (3) Themis optimizations are effective at producing efficient test suites for measuring discrimination, and (4) Themis is more efficient on systems that exhibit more discrimination. We thus demonstrate that fairness testing is a critical aspect of the software development cycle in domains with possible discrimination and provide initial tools for measuring software discrimination.
引用
收藏
页码:498 / 510
页数:13
相关论文
共 89 条
[1]  
Andrews J.H., 2007, Proceedings of the Twenty-second IEEE/ACM International Conference on Automated Software Engineering. ASE'07, P144, DOI 10.1145/1321631.1321654
[2]  
[Anonymous], NPR ALL THINGS CONSI
[3]  
[Anonymous], 2016, BIG DAT REP ALG SYST
[4]  
[Anonymous], ABS12070016 CORR
[5]  
[Anonymous], 2016, ETHICS DATA ANALYTIC
[6]  
[Anonymous], CAPITAL TIMES 0713
[7]  
[Anonymous], ABS161104967 CORR
[8]  
[Anonymous], 2017, IEEE EUR S SEC PRIV
[9]  
[Anonymous], INT C MAN DAT SIGMOD
[10]  
[Anonymous], 2011, TAPP