Model Debiasing via Gradient-based Explanation on Representation

被引:0
|
作者
Zhang, Jindi [1 ]
Wang, Luning [1 ]
Su, Dan [3 ]
Huang, Yongxiang [1 ]
Cao, Caleb Chen [2 ]
Chen, Lei [2 ]
机构
[1] Huawei, Hong Kong Res Ctr, Hong Kong, Peoples R China
[2] Hong Kong Univ Sci, Hong Kong, Peoples R China
[3] NVIDIA Res, Hong Kong, Peoples R China
来源
PROCEEDINGS OF THE 2023 AAAI/ACM CONFERENCE ON AI, ETHICS, AND SOCIETY, AIES 2023 | 2023年
关键词
fairness; model debiasing; representation learning; gradient-based explanation;
D O I
10.1145/3600211.3604668
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Machine learning systems produce biased results towards certain demographic groups, known as the fairness problem. Recent approaches to tackle this problem learn a latent code (i.e., representation) through disentangled representation learning and then discard the latent code dimensions correlated with sensitive attributes (e.g., gender). Nevertheless, these approaches may suffer from incomplete disentanglement and overlook proxy attributes (proxies for sensitive attributes) when processing real-world data, especially for unstructured data, causing performance degradation in fairness and loss of useful information for downstream tasks. In this paper, we propose a novel fairness framework that performs debiasing with regard to both sensitive attributes and proxy attributes, which boosts the prediction performance of downstream task models without complete disentanglement. The main idea is to, first, leverage gradient-based explanation to find two model focuses, 1) one focus for predicting sensitive attributes and 2) the other focus for predicting downstream task labels, and second, use them to perturb the latent code that guides the training of downstream task models towards fairness and utility goals. We show empirically that our frameworkworks with both disentangled and non-disentangled representation learning methods and achieves better fairness-accuracy trade-off on unstructured and structured datasets than previous state-of-the-art approaches.
引用
收藏
页码:193 / 204
页数:12
相关论文
共 50 条
  • [41] Latent Representation Learning Model for Multi-Band Images Fusion via Low-Rank and Sparse Embedding
    Wang, Bin
    Niu, Huifang
    Zeng, Jianchao
    Bai, Guifeng
    Lin, Suzhen
    Wang, Yanbo
    IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 23 : 3137 - 3152
  • [42] CARM: Confidence-aware recommender model via review representation learning and historical rating behavior in the online platforms
    Li, Duantengchuan
    Liu, Hai
    Zhang, Zhaoli
    Lin, Ke
    Fang, Shuai
    Li, Zhifei
    Xiong, Neal N.
    NEUROCOMPUTING, 2021, 455 : 283 - 296
  • [43] Recommendation Model Based on Polarization Relation Representation and Low-Dimensional Data Association Learning
    Cai X.
    Hong T.
    Cao Y.
    Huanan Ligong Daxue Xuebao/Journal of South China University of Technology (Natural Science), 2022, 50 (01): : 122 - 131
  • [44] Knowledge graph representation learning model based on meta-information and logical rule enhancements
    Wang, Ling
    Lu, Jicang
    Sun, Yepeng
    JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2023, 35 (04) : 112 - 125
  • [45] MCKRL: A Multi-channel Based Multi-graph Knowledge Representation Learning Model
    Tang, Zihao
    Zhang, Xiang
    Shang, Xiaoyu
    NATURAL LANGUAGE PROCESSING AND CHINESE COMPUTING, PT I, NLPCC 2024, 2025, 15359 : 504 - 516
  • [46] A heterogeneous E-commerce user alignment model based on data enhancement and data representation
    Wei, Shihong
    Zhou, Xinming
    An, Xubin
    Yang, Xu
    Xiao, Yunpeng
    EXPERT SYSTEMS WITH APPLICATIONS, 2023, 228
  • [47] Large Language Model-Based Representation Learning for Entity Resolution using Contrastive Learning
    Foua, Bi T.
    Talburt, John R.
    Xu, Xiaowei
    2023 INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE AND COMPUTATIONAL INTELLIGENCE, CSCI 2023, 2023, : 15 - 22
  • [48] A representation-learning-based approach to predict stock price trend via dynamic spatiotemporal feature embedding
    Pang, Bowen
    Wei, Wei
    Li, Xing
    Feng, Xiangnan
    Li, Chao
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 126
  • [49] Semi-Supervised Representation Learning via Triplet Loss Based on Explicit Class Ratio of Unlabeled Data
    Murasaki, Kazuhiko
    Ando, Shingo
    Shimamura, Jun
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2022, E105D (04) : 778 - 784
  • [50] EvRepSL: Event-Stream Representation via Self-Supervised Learning for Event-Based Vision
    Qu, Qiang
    Chen, Xiaoming
    Chung, Yuk Ying
    Shen, Yiran
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2024, 33 : 6579 - 6591