Optimized Pre-Processing for Discrimination Prevention

被引:0
作者
Calmon, Flavio P. [1 ]
Wei, Dennis [2 ]
Vinzamuri, Bhanukiran [2 ]
Ramamurthy, Karthikeyan Natesan [2 ]
Varshney, Kush R. [2 ]
机构
[1] Harvard Univ, Cambridge, MA 02138 USA
[2] IBM Res AI, Yorktown Hts, NY USA
来源
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 30 (NIPS 2017) | 2017年 / 30卷
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Non-discrimination is a recognized objective in algorithmic decision making. In this paper, we introduce a novel probabilistic formulation of data pre-processing for reducing discrimination. We propose a convex optimization for learning a data transformation with three goals: controlling discrimination, limiting distortion in individual data samples, and preserving utility. We characterize the impact of limited sample size in accomplishing this objective. Two instances of the proposed optimization are applied to datasets, including one on real-world criminal recidivism. Results show that discrimination can be greatly reduced at a small cost in classification accuracy.
引用
收藏
页数:10
相关论文
共 28 条
[11]  
Diamond S, 2016, J MACH LEARN RES, V17
[12]  
Dwork C, 2012, P 3 INN THEOR COMP S
[13]   Certifying and Removing Disparate Impact [J].
Feldman, Michael ;
Friedler, Sorelle A. ;
Moeller, John ;
Scheidegger, Carlos ;
Venkatasubramanian, Suresh .
KDD'15: PROCEEDINGS OF THE 21ST ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2015, :259-268
[14]  
Friedler SA, 2016, IM POSSIBILITY FAIRN, Vabs/1609.07236
[15]   A Methodology for Direct and Indirect Discrimination Prevention in Data Mining [J].
Hajian, Sara ;
Domingo-Ferrer, Josep .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2013, 25 (07) :1445-1459
[16]  
Hardt M, 2016, ADV NEUR IN, V29
[17]  
Johnson K.D., 2016, ARXIV160800528
[18]   Data preprocessing techniques for classification without discrimination [J].
Kamiran, Faisal ;
Calders, Toon .
KNOWLEDGE AND INFORMATION SYSTEMS, 2012, 33 (01) :1-33
[19]  
Kamishima T., 2011, 2011 IEEE International Conference on Data Mining Workshops, P643, DOI 10.1109/ICDMW.2011.83
[20]  
Kleinberg Jon, 2017, INNOVATIONS THEORETI