Automated Directed Fairness Testing

被引：112

作者：

Udeshi, Sakshi ^{[1
]}

Arora, Pryanshu ^{[2
]}

Chattopadhyay, Sudipta ^{[1
]}

机构：

[1] Singapore Univ Tech & Design, Singapore, Singapore

[2] BITS Pilani, Pilani, Rajasthan, India

来源：

PROCEEDINGS OF THE 2018 33RD IEEE/ACM INTERNATIONAL CONFERENCE ON AUTOMTED SOFTWARE ENGINEERING (ASE' 18) | 2018年

关键词：

Software Fairness; Directed Testing; Machine Learning;

D O I：

10.1145/3238147.3238165

中图分类号：

TP31 [计算机软件];

学科分类号：

081202 ; 0835 ;

摘要：

Fairness is a critical trait in decision making. As machine-learning models are increasingly being used in sensitive application domains (e.g. education and employment) for decision making, it is crucial that the decisions computed by such models are free of unintended bias. But how can we automatically validate the fairness of arbitrary machine-learning models? For a given machine-learning model and a set of sensitive input parameters, our AEQVITAS approach automatically discovers discriminatory inputs that highlight fairness violation. At the core of AEQVITAS are three novel strategies to employ probabilistic search over the input space with the objective of uncovering fairness violation. Our AEQVITAS approach leverages inherent robustness property in common machine-learning models to design and implement scalable test generation methodologies. An appealing feature of our generated test inputs is that they can be systematically added to the training set of the underlying model and improve its fairness. To this end, we design a fully automated module that guarantees to improve the fairness of the model. We implemented AEQVITAS and we have evaluated it on six state-of-the-art classifiers. Our subjects also include a classifier that was designed with fairness in mind. We show that AEQVITAS effectively generates inputs to uncover fairness violation in all the subject classifiers and systematically improves the fairness of respective models using the generated test inputs. In our evaluation, AEQVITAS generates up to 70% discriminatory inputs (w.r.t. the total number of inputs generated) and leverages these inputs to improve the fairness up to 94%.

引用

页码：98 / 108

页数：11

共 19 条

[11]

Kamishima Toshihiro, 2012, Machine Learning and Knowledge Discovery in Databases. Proceedings of the European Conference (ECML PKDD 2012), P35, DOI 10.1007/978-3-642-33486-3_3

[12] Application of majority voting to pattern recognition: An analysis of its behavior and performance [J].

Lam, L ;

Suen, CY .

IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART A-SYSTEMS AND HUMANS, 1997, 27 (05) :553-568

[13] Search-based software test data generation: a survey [J].

McMinn, P .

SOFTWARE TESTING VERIFICATION & RELIABILITY, 2004, 14 (02) :105-156

[14] Practical Black-Box Attacks against Machine Learning [J].

Papernot, Nicolas ;

McDaniel, Patrick ;

Goodfellow, Ian ;

Jha, Somesh ;

Celik, Z. Berkay ;

Swami, Ananthram .

PROCEEDINGS OF THE 2017 ACM ASIA CONFERENCE ON COMPUTER AND COMMUNICATIONS SECURITY (ASIA CCS'17), 2017, :506-519

[15]

Papernot N, 2016, IEEE MILIT COMMUN C, P49, DOI 10.1109/MILCOM.2016.7795300

[16] DeepXplore: Automated Whitebox Testing of Deep Learning Systems [J].

Pei, Kexin ;

Cao, Yinzhi ;

Yang, Junfeng ;

Jana, Suman .

PROCEEDINGS OF THE TWENTY-SIXTH ACM SYMPOSIUM ON OPERATING SYSTEMS PRINCIPLES (SOSP '17), 2017, :1-18

[17]

University of Michigan, NOND POL NOT

[18] Feature-Guided Black-Box Safety Testing of Deep Neural Networks [J].

Wicker, Matthew ;

Huang, Xiaowei ;

Kwiatkowska, Marta .

TOOLS AND ALGORITHMS FOR THE CONSTRUCTION AND ANALYSIS OF SYSTEMS, TACAS 2018, PT I, 2018, 10805 :408-426

[19]

Zafar MB, 2017, PR MACH LEARN RES, V54, P962

← 1 2 →