Efficient fair principal component analysis

被引:6
作者
Kamani, Mohammad Mahdi [1 ]
Haddadpour, Farzin [2 ]
Forsati, Rana [3 ]
Mahdavi, Mehrdad [2 ]
机构
[1] Penn State Univ, Coll Informat Sci & Technol, University Pk, PA 16802 USA
[2] Penn State Univ, Sch Elect Engn & Comp Sci, University Pk, PA 16802 USA
[3] Microsoft Bing, Bellevue, WA USA
关键词
Fairness; Pareto efficient; Dimension reduction; PCA;
D O I
10.1007/s10994-021-06100-9
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
It has been shown that dimension reduction methods such as Principal Component Analysis (PCA) may be inherently prone to unfairness and treat data from different sensitive groups such as race, color, sex, etc., unfairly. In pursuit of fairness-enhancing dimensionality reduction, using the notion of Pareto optimality, we propose an adaptive first-order algorithm to learn a subspace that preserves fairness, while slightly compromising the reconstruction loss. Theoretically, we provide sufficient conditions that the solution of the proposed algorithm belongs to the Pareto frontier for all sensitive groups; thereby, the optimal trade-off between overall reconstruction loss and fairness constraints is guaranteed. We also provide the convergence analysis of our algorithm and show its efficacy through empirical studies on different datasets, which demonstrates superior performance in comparison with state-of-the-art algorithms. The proposed fairness-aware PCA algorithm can be efficiently generalized to multiple group sensitive features and effectively reduce the unfairness decisions in downstream tasks such as classification.
引用
收藏
页码:3671 / 3702
页数:32
相关论文
共 42 条
  • [1] Agarwal A., 2018, P 35 INT C MACHINE L, P60
  • [2] [Anonymous], 2013, Advances of Neural Information Processing Systems
  • [3] [Anonymous], 2018, ARXIV180203765
  • [4] Barocas Solon, 2017, Fairness in machine learning, V1
  • [5] Blum A., 2019, ARXIV191201094
  • [6] Three naive Bayes approaches for discrimination-free classification
    Calders, Toon
    Verwer, Sicco
    [J]. DATA MINING AND KNOWLEDGE DISCOVERY, 2010, 21 (02) : 277 - 292
  • [7] Caldwell Tracey, 2009, Information World Review, P13
  • [8] SUPPORT-VECTOR NETWORKS
    CORTES, C
    VAPNIK, V
    [J]. MACHINE LEARNING, 1995, 20 (03) : 273 - 297
  • [9] Creager E., 2019, ARXIV190602589
  • [10] Cunningham JP, 2015, J MACH LEARN RES, V16, P2859