Measuring and Mitigating Unintended Bias in Text Classification

被引：322

作者：

Dixon, Lucas ^{[1
]}

Li, John ^{[1
]}

Sorensen, Jeffrey ^{[1
]}

Thain, Nithum ^{[1
]}

Vasserman, Lucy ^{[1
]}

机构：

[1] Jigsaw, Munich, Germany

来源：

PROCEEDINGS OF THE 2018 AAAI/ACM CONFERENCE ON AI, ETHICS, AND SOCIETY (AIES'18) | 2018年

关键词：

D O I：

10.1145/3278721.3278729

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We introduce and illustrate a new approach to measuring and mitigating unintended bias in machine learning models. Our definition of unintended bias is parameterized by a test set and a subset of input features. We illustrate how this can be used to evaluate text classifiers using a synthetic test set and a public corpus of comments annotated for toxicity from Wikipedia Talk pages. We also demonstrate how imbalances in training data can lead to unintended bias in the resulting models, and therefore potentially unfair applications. We use a set of common demographic identity terms as the subset of input features on which we measure bias. This technique permits analysis in the common scenario where demographic information on authors and readers is unavailable, so that bias mitigation must focus on the content of the text itself. The mitigation method we introduce is an unsupervised approach based on balancing the training dataset. We demonstrate that this approach reduces the unintended bias without compromising overall model quality.

引用

页码：67 / 73

页数：7

共 13 条

[1]

Abadi M, 2016, PROCEEDINGS OF OSDI'16: 12TH USENIX SYMPOSIUM ON OPERATING SYSTEMS DESIGN AND IMPLEMENTATION, P265

[2]

[Anonymous], 2017, ABS170700061 CORR

[3]

[Anonymous], 2016, CoRR, abs/1607.06520

[4]

[Anonymous], 2016, ACL

[5]

[Anonymous], 2016, ABS161002413 CORR

[6]

[Anonymous], 2017, ABS170700075 CORR

[7]

[Anonymous], PATTERN RECOGN LETT

[8]

[Anonymous], ETHICS NATURAL LANGU

[9]

Chollet F., 2015, about us

[10] Certifying and Removing Disparate Impact [J].

Feldman, Michael ;

Friedler, Sorelle A. ;

Moeller, John ;

Scheidegger, Carlos ;

Venkatasubramanian, Suresh .

KDD'15: PROCEEDINGS OF THE 21ST ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2015, :259-268

← 1 2 →