Model Debiasing via Gradient-based Explanation on Representation

被引:0
|
作者
Zhang, Jindi [1 ]
Wang, Luning [1 ]
Su, Dan [3 ]
Huang, Yongxiang [1 ]
Cao, Caleb Chen [2 ]
Chen, Lei [2 ]
机构
[1] Huawei, Hong Kong Res Ctr, Hong Kong, Peoples R China
[2] Hong Kong Univ Sci, Hong Kong, Peoples R China
[3] NVIDIA Res, Hong Kong, Peoples R China
来源
PROCEEDINGS OF THE 2023 AAAI/ACM CONFERENCE ON AI, ETHICS, AND SOCIETY, AIES 2023 | 2023年
关键词
fairness; model debiasing; representation learning; gradient-based explanation;
D O I
10.1145/3600211.3604668
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Machine learning systems produce biased results towards certain demographic groups, known as the fairness problem. Recent approaches to tackle this problem learn a latent code (i.e., representation) through disentangled representation learning and then discard the latent code dimensions correlated with sensitive attributes (e.g., gender). Nevertheless, these approaches may suffer from incomplete disentanglement and overlook proxy attributes (proxies for sensitive attributes) when processing real-world data, especially for unstructured data, causing performance degradation in fairness and loss of useful information for downstream tasks. In this paper, we propose a novel fairness framework that performs debiasing with regard to both sensitive attributes and proxy attributes, which boosts the prediction performance of downstream task models without complete disentanglement. The main idea is to, first, leverage gradient-based explanation to find two model focuses, 1) one focus for predicting sensitive attributes and 2) the other focus for predicting downstream task labels, and second, use them to perturb the latent code that guides the training of downstream task models towards fairness and utility goals. We show empirically that our frameworkworks with both disentangled and non-disentangled representation learning methods and achieves better fairness-accuracy trade-off on unstructured and structured datasets than previous state-of-the-art approaches.
引用
收藏
页码:193 / 204
页数:12
相关论文
共 50 条
  • [21] Model-reduced gradient-based history matching
    Kaleta, Malgorzata P.
    Hanea, Remus G.
    Heemink, Arnold W.
    Jansen, Jan-Dirk
    COMPUTATIONAL GEOSCIENCES, 2011, 15 (01) : 135 - 153
  • [22] Gradient-based adaptation of continuous dynamic model structures
    La Cava, William G.
    Danai, Kourosh
    INTERNATIONAL JOURNAL OF SYSTEMS SCIENCE, 2016, 47 (01) : 249 - 263
  • [23] A Gradient-Based Constitutive Model for Shape Memory Alloys
    Tabesh M.
    Boyd J.
    Lagoudas D.
    Boyd, James (jgboyd@tamu.edu), 1600, Springer (03): : 84 - 108
  • [24] Gradient-based explanation for non-linear non-parametric dimensionality reduction
    Corbugy, Sacha
    Marion, Rebecca
    Frenay, Benoit
    DATA MINING AND KNOWLEDGE DISCOVERY, 2024, 38 (06) : 3690 - 3718
  • [25] Gradient-Based Algorithms for Convex Discrete Optimization via Simulation
    Zhang, Haixiang
    Zheng, Zeyu
    Lavaei, Javad
    OPERATIONS RESEARCH, 2023, 71 (05) : 1815 - 1834
  • [26] Evaluating gradient-based explanation methods for neural network ECG analysis using heatmaps
    Storas, Andrea Marheim
    Maeland, Steffen
    Isaksen, Jonas L.
    Hicks, Steven Alexander
    Thambawita, Vajira
    Graff, Claus
    Hammer, Hugo Lewi
    Halvorsen, Pal
    Riegler, Michael Alexander
    Kanters, Jorgen K.
    JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2024, 32 (01) : 79 - 88
  • [27] Sparsifying the resolvent forcing mode via gradient-based optimisation
    Skene, Calum S.
    Yeh, Chi-An
    Schmid, Peter J.
    Taira, Kunihiko
    JOURNAL OF FLUID MECHANICS, 2022, 944
  • [28] Relieving popularity bias in recommendation via debiasing representation enhancement
    Zhang, Junsan
    Wu, Sini
    Wang, Te
    Ding, Fengmei
    Zhu, Jie
    COMPLEX & INTELLIGENT SYSTEMS, 2025, 11 (01)
  • [29] INTEGRATED GRAD-CAM: SENSITIVITY-AWARE VISUAL EXPLANATION OF DEEP CONVOLUTIONAL NETWORKS VIA INTEGRATED GRADIENT-BASED SCORING
    Sattarzadeh, Sam
    Sudhakar, Mahesh
    Plataniotis, Konstantinos N.
    Jang, Jongseong
    Jeong, Yeonjeong
    Kim, Hyunwoo
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 1775 - 1779
  • [30] Gradient-Based Inverse Estimation for a Rainfall-Runoff Model
    Krapu, Christopher
    Borsuk, Mark
    Kumar, Mukesh
    WATER RESOURCES RESEARCH, 2019, 55 (08) : 6625 - 6639