Efficient fair principal component analysis

被引:0
作者
Mohammad Mahdi Kamani
Farzin Haddadpour
Rana Forsati
Mehrdad Mahdavi
机构
[1] The Pennsylvania State University,College of Information Sciences and Technology
[2] The Pennsylvania State University,School of Electrical Engineering and Computer Science
[3] Microsoft Bing,undefined
来源
Machine Learning | 2022年 / 111卷
关键词
Fairness; Pareto efficient; Dimension reduction; PCA;
D O I
暂无
中图分类号
学科分类号
摘要
It has been shown that dimension reduction methods such as Principal Component Analysis (PCA) may be inherently prone to unfairness and treat data from different sensitive groups such as race, color, sex, etc., unfairly. In pursuit of fairness-enhancing dimensionality reduction, using the notion of Pareto optimality, we propose an adaptive first-order algorithm to learn a subspace that preserves fairness, while slightly compromising the reconstruction loss. Theoretically, we provide sufficient conditions that the solution of the proposed algorithm belongs to the Pareto frontier for all sensitive groups; thereby, the optimal trade-off between overall reconstruction loss and fairness constraints is guaranteed. We also provide the convergence analysis of our algorithm and show its efficacy through empirical studies on different datasets, which demonstrates superior performance in comparison with state-of-the-art algorithms. The proposed fairness-aware PCA algorithm can be efficiently generalized to multiple group sensitive features and effectively reduce the unfairness decisions in downstream tasks such as classification.
引用
收藏
页码:3671 / 3702
页数:31
相关论文
共 18 条
[1]  
Calders T(2010)Three Naive Bayes approaches for discrimination-free classification Data Mining and Knowledge Discovery 21 277-292
[2]  
Verwer S(1995)Support-vector networks Machine Learning 20(3) 273-297
[3]  
Cortes C(2015)Linear dimensionality reduction: Survey, insights, and generalizations The Journal of Machine Learning Research 16 2859-2900
[4]  
Vapnik V(1997)A closer look at drawbacks of minimizing weighted sums of objectives for pareto set generation in multicriteria optimization problems Structural Optimization 14 63-69
[5]  
Cunningham JP(2006)A discussion of scalarization techniques for multiple objective integer programming Annals of Operations Research 147 343-360
[6]  
Ghahramani Z(2000)Steepest descent methods for multicriteria optimization Mathematical Methods of Operations Research 51 479-494
[7]  
Das I(2019)Eliminating latent discrimination: Train then mask Proceedings of the AAAI Conference on Artificial Intelligence 33 3672-3680
[8]  
Dennis JE(2018)Skeleton matching with applications in severe weather detection Applied Soft Computing 70 1154-1166
[9]  
Ehrgott M(undefined)undefined undefined undefined undefined-undefined
[10]  
Fliege J(undefined)undefined undefined undefined undefined-undefined