Improving fairness generalization through a sample-robust optimization method

被引:6
作者
Ferry, Julien [1 ]
Aivodji, Ulrich [2 ]
Gambs, Sebastien [3 ]
Huguet, Marie-Jose [1 ]
Siala, Mohamed [1 ]
机构
[1] Univ Toulouse, INSA, CNRS, LAAS CNRS, Toulouse, France
[2] Ecole Technol Super, Montreal, PQ, Canada
[3] Univ Quebec Montreal, Montreal, PQ, Canada
关键词
Supervised learning; Fairness; Generalization; Distributionally robust optimization;
D O I
10.1007/s10994-022-06191-y
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Unwanted bias is a major concern in machine learning, raising in particular significant ethical issues when machine learning models are deployed within high-stakes decision systems. A common solution to mitigate it is to integrate and optimize a statistical fairness metric along with accuracy during the training phase. However, one of the main remaining challenges is that current approaches usually generalize poorly in terms of fairness on unseen data. We address this issue by proposing a new robustness framework for statistical fairness in machine learning. The proposed approach is inspired by the domain of distributionally robust optimization and works in ensuring fairness over a variety of samplings of the training set. Our approach can be used to quantify the robustness of fairness but also to improve it when training a model. We empirically evaluate the proposed method and show that it effectively improves fairness generalization. In addition, we propose a simple yet powerful heuristic application of our framework that can be integrated into a wide range of existing fair classification techniques to enhance fairness generalization. Our extensive empirical study using two existing fair classification methods demonstrates the efficiency and scalability of the proposed heuristic approach.
引用
收藏
页码:2131 / 2192
页数:62
相关论文
共 52 条
  • [1] Agarwal A, 2018, 35 INT C MACHINE LEA, V80
  • [2] Aivodji U., 2019, ARXIV PREPRINT ARXIV
  • [3] FairCORELS, an Open-Source Library for Learning Fair Rule Lists
    Aivodji, Ulrich
    Ferry, Julien
    Gambs, Sebastien
    Huguet, Marie-Jose
    Siala, Mohamed
    [J]. PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT, CIKM 2021, 2021, : 4665 - 4669
  • [4] Angelino E, 2017, KDD'17: PROCEEDINGS OF THE 23RD ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, P35, DOI [arXiv:1704.01701, 10.1145/3097983.3098047]
  • [5] Angelino E, 2018, J MACH LEARN RES, V18
  • [6] Angwin Julia, 2016, ProPublica
  • [7] [Anonymous], 2011, CVPR 2011, DOI DOI 10.1109/CVPR.2011.5995347
  • [8] [Anonymous], 2017, THESIS COLUMBIA U
  • [9] Barocas S., 2019, FAIRNESS MACHINE LEA
  • [10] Robust Solutions of Optimization Problems Affected by Uncertain Probabilities
    Ben-Tal, Aharon
    den Hertog, Dick
    De Waegenaere, Anja
    Melenberg, Bertrand
    Rennen, Gijs
    [J]. MANAGEMENT SCIENCE, 2013, 59 (02) : 341 - 357