Decision Theory for Discrimination-aware Classification

被引:162
作者
Kamiran, Faisal [1 ]
Karim, Asim [2 ]
Zhang, Xiangliang [1 ]
机构
[1] King Abdullah Univ Sci & Technol, Thuwal, Saudi Arabia
[2] Lahore Univ Management Sci, Lahore, Pakistan
来源
12TH IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM 2012) | 2012年
关键词
D O I
10.1109/ICDM.2012.45
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Social discrimination (e.g., against females) arising from data mining techniques is a growing concern worldwide. In recent years, several methods have been proposed for making classifiers learned over discriminatory data discrimination-aware. However, these methods suffer from two major shortcomings: (1) They require either modifying the discriminatory data or tweaking a specific classification algorithm and (2) They are not flexible w.r.t. discrimination control and multiple sensitive attribute handling. In this paper, we present two solutions for discrimination-aware classification that neither require data modification nor classifier tweaking. Our first and second solutions exploit, respectively, the reject option of probabilistic classifier(s) and the disagreement region of general classifier ensembles to reduce discrimination. We relate both solutions with decision theory for better understanding of the process. Our experiments using real-world datasets demonstrate that our solutions outperform existing state-of-the-art methods, especially at low discrimination which is a significant advantage. The superior performance coupled with flexible control over discrimination and easy applicability to multiple sensitive attributes makes our solutions an important step forward in practical discrimination-aware classification.
引用
收藏
页码:924 / 929
页数:6
相关论文
共 10 条
  • [1] [Anonymous], 2011, ICDMW
  • [2] [Anonymous], 2008, 14 ACM SIGKDD INT C
  • [3] [Anonymous], 2007, Uci machine learning repository
  • [4] Three naive Bayes approaches for discrimination-free classification
    Calders, Toon
    Verwer, Sicco
    [J]. DATA MINING AND KNOWLEDGE DISCOVERY, 2010, 21 (02) : 277 - 292
  • [5] Hajian S., 2012, TKDE IN PRESS
  • [6] Kamiran Faisal, 2010, Proceedings 2010 10th IEEE International Conference on Data Mining (ICDM 2010), P869, DOI 10.1109/ICDM.2010.50
  • [7] Data preprocessing techniques for classification without discrimination
    Kamiran, Faisal
    Calders, Toon
    [J]. KNOWLEDGE AND INFORMATION SYSTEMS, 2012, 33 (01) : 1 - 33
  • [8] Measures of diversity in classifier ensembles and their relationship with the ensemble accuracy
    Kuncheva, LI
    Whitaker, CJ
    [J]. MACHINE LEARNING, 2003, 51 (02) : 181 - 207
  • [9] Luong Binh Thanh, 2011, P 17 ACM SIGKDD INT, P502
  • [10] Zliobaite I., 2011, Proceedings of the 2011 IEEE 11th International Conference on Data Mining (ICDM 2011), P992, DOI 10.1109/ICDM.2011.72