Model Debiasing via Gradient-based Explanation on Representation

被引:0
|
作者
Zhang, Jindi [1 ]
Wang, Luning [1 ]
Su, Dan [3 ]
Huang, Yongxiang [1 ]
Cao, Caleb Chen [2 ]
Chen, Lei [2 ]
机构
[1] Huawei, Hong Kong Res Ctr, Hong Kong, Peoples R China
[2] Hong Kong Univ Sci, Hong Kong, Peoples R China
[3] NVIDIA Res, Hong Kong, Peoples R China
来源
PROCEEDINGS OF THE 2023 AAAI/ACM CONFERENCE ON AI, ETHICS, AND SOCIETY, AIES 2023 | 2023年
关键词
fairness; model debiasing; representation learning; gradient-based explanation;
D O I
10.1145/3600211.3604668
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Machine learning systems produce biased results towards certain demographic groups, known as the fairness problem. Recent approaches to tackle this problem learn a latent code (i.e., representation) through disentangled representation learning and then discard the latent code dimensions correlated with sensitive attributes (e.g., gender). Nevertheless, these approaches may suffer from incomplete disentanglement and overlook proxy attributes (proxies for sensitive attributes) when processing real-world data, especially for unstructured data, causing performance degradation in fairness and loss of useful information for downstream tasks. In this paper, we propose a novel fairness framework that performs debiasing with regard to both sensitive attributes and proxy attributes, which boosts the prediction performance of downstream task models without complete disentanglement. The main idea is to, first, leverage gradient-based explanation to find two model focuses, 1) one focus for predicting sensitive attributes and 2) the other focus for predicting downstream task labels, and second, use them to perturb the latent code that guides the training of downstream task models towards fairness and utility goals. We show empirically that our frameworkworks with both disentangled and non-disentangled representation learning methods and achieves better fairness-accuracy trade-off on unstructured and structured datasets than previous state-of-the-art approaches.
引用
收藏
页码:193 / 204
页数:12
相关论文
共 50 条
  • [31] A NON-ISOTHERMAL CONSOLIDATION MODEL FOR GRADIENT-BASED POROPLASTICITY
    Mroginski, Javier L.
    Etse, Guillermo
    Ripani, Marianela
    PROCEEDINGS OF THE 1ST PAN-AMERICAN CONGRESS ON COMPUTATIONAL MECHANICS AND XI ARGENTINE CONGRESS ON COMPUTATIONAL MECHANICS, 2015, : 75 - 88
  • [32] A multistart gradient-based algorithm with surrogate model for global optimization
    Peri, Daniele
    Tinti, Federica
    COMMUNICATIONS IN APPLIED AND INDUSTRIAL MATHEMATICS, 2012, 3 (01)
  • [33] Unlearning Backdoor Attacks through Gradient-Based Model Pruning
    Dunnett, Kealan
    Arablouei, Reza
    Miller, Dimity
    Dedeoglu, Volkan
    Jurdak, Raja
    2024 54TH ANNUAL IEEE/IFIP INTERNATIONAL CONFERENCE ON DEPENDABLE SYSTEMS AND NETWORKS WORKSHOPS, DSN-W 2024, 2024, : 46 - 54
  • [34] Discrete optimization via gradient-based adaptive stochastic search methods
    Chen, Xi
    Zhou, Enlu
    Hu, Jiaqiao
    IISE TRANSACTIONS, 2018, 50 (09) : 789 - 805
  • [35] Pan-sharpening via a gradient-based deep network prior
    Ye, Fei
    Guo, Yecai
    Zhuang, Peixian
    SIGNAL PROCESSING-IMAGE COMMUNICATION, 2019, 74 : 322 - 331
  • [36] SPARSE ADVERSARIAL ATTACK FOR VIDEO VIA GRADIENT-BASED KEYFRAME SELECTION
    Xu, Yixiao
    Liu, Xiaolei
    Yin, Mingyong
    Hu, Teng
    Ding, Kangyi
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 2874 - 2878
  • [37] Explainability of Speech Recognition Transformers via Gradient-Based Attention Visualization
    Sun, Tianli
    Chen, Haonan
    Hu, Guosheng
    He, Lianghua
    Zhao, Cairong
    IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 1395 - 1406
  • [38] Fast and Correct Gradient-Based Optimisation for Probabilistic Programming via Smoothing
    Khajwal, Basim
    Ong, C-H Luke
    Wagner, Dominik
    PROGRAMMING LANGUAGES AND SYSTEMS, ESOP 2023, 2023, 13990 : 479 - 506
  • [39] Parallel Ising annealer via gradient-based Hamiltonian Monte Carlo
    Wang, Hao
    Liu, Zixuan
    Xie, Zhixin
    Li, Langyu
    Miao, Zibo
    Cui, Wei
    Pan, Yu
    QUANTUM MACHINE INTELLIGENCE, 2025, 7 (01)
  • [40] A gradient-based shape optimization scheme via isogeometric exact reanalysis
    Ding, Chensen
    Cui, Xiangyang
    Huang, Guanxin
    Li, Guangyao
    Tamma, K. K.
    Cai, Yong
    ENGINEERING COMPUTATIONS, 2018, 35 (08) : 2696 - 2721