Model Debiasing via Gradient-based Explanation on Representation

被引：0

作者：

Zhang, Jindi ^{[1
]}

Wang, Luning ^{[1
]}

Su, Dan ^{[3
]}

Huang, Yongxiang ^{[1
]}

Cao, Caleb Chen ^{[2
]}

Chen, Lei ^{[2
]}

机构：

[1] Huawei, Hong Kong Res Ctr, Hong Kong, Peoples R China

[2] Hong Kong Univ Sci, Hong Kong, Peoples R China

[3] NVIDIA Res, Hong Kong, Peoples R China

来源：

PROCEEDINGS OF THE 2023 AAAI/ACM CONFERENCE ON AI, ETHICS, AND SOCIETY, AIES 2023 | 2023年

关键词：

fairness; model debiasing; representation learning; gradient-based explanation;

D O I：

10.1145/3600211.3604668

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Machine learning systems produce biased results towards certain demographic groups, known as the fairness problem. Recent approaches to tackle this problem learn a latent code (i.e., representation) through disentangled representation learning and then discard the latent code dimensions correlated with sensitive attributes (e.g., gender). Nevertheless, these approaches may suffer from incomplete disentanglement and overlook proxy attributes (proxies for sensitive attributes) when processing real-world data, especially for unstructured data, causing performance degradation in fairness and loss of useful information for downstream tasks. In this paper, we propose a novel fairness framework that performs debiasing with regard to both sensitive attributes and proxy attributes, which boosts the prediction performance of downstream task models without complete disentanglement. The main idea is to, first, leverage gradient-based explanation to find two model focuses, 1) one focus for predicting sensitive attributes and 2) the other focus for predicting downstream task labels, and second, use them to perturb the latent code that guides the training of downstream task models towards fairness and utility goals. We show empirically that our frameworkworks with both disentangled and non-disentangled representation learning methods and achieves better fairness-accuracy trade-off on unstructured and structured datasets than previous state-of-the-art approaches.

引用

页码：193 / 204

页数：12

共 50 条

[41] Latent Representation Learning Model for Multi-Band Images Fusion via Low-Rank and Sparse Embedding
Wang, Bin
Niu, Huifang
Zeng, Jianchao
Bai, Guifeng
Lin, Suzhen
Wang, Yanbo
IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 23 : 3137 - 3152
[42] CARM: Confidence-aware recommender model via review representation learning and historical rating behavior in the online platforms
Li, Duantengchuan
Liu, Hai
Zhang, Zhaoli
Lin, Ke
Fang, Shuai
Li, Zhifei
Xiong, Neal N.
NEUROCOMPUTING, 2021, 455 : 283 - 296
[43] Recommendation Model Based on Polarization Relation Representation and Low-Dimensional Data Association Learning
Cai X.
Hong T.
Cao Y.
Huanan Ligong Daxue Xuebao/Journal of South China University of Technology (Natural Science), 2022, 50 (01): : 122 - 131
[44] Knowledge graph representation learning model based on meta-information and logical rule enhancements
Wang, Ling
Lu, Jicang
Sun, Yepeng
JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2023, 35 (04) : 112 - 125
[45] MCKRL: A Multi-channel Based Multi-graph Knowledge Representation Learning Model
Tang, Zihao
Zhang, Xiang
Shang, Xiaoyu
NATURAL LANGUAGE PROCESSING AND CHINESE COMPUTING, PT I, NLPCC 2024, 2025, 15359 : 504 - 516
[46] A heterogeneous E-commerce user alignment model based on data enhancement and data representation
Wei, Shihong
Zhou, Xinming
An, Xubin
Yang, Xu
Xiao, Yunpeng
EXPERT SYSTEMS WITH APPLICATIONS, 2023, 228
[47] Large Language Model-Based Representation Learning for Entity Resolution using Contrastive Learning
Foua, Bi T.
Talburt, John R.
Xu, Xiaowei
2023 INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE AND COMPUTATIONAL INTELLIGENCE, CSCI 2023, 2023, : 15 - 22
[48] A representation-learning-based approach to predict stock price trend via dynamic spatiotemporal feature embedding
Pang, Bowen
Wei, Wei
Li, Xing
Feng, Xiangnan
Li, Chao
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 126
[49] Semi-Supervised Representation Learning via Triplet Loss Based on Explicit Class Ratio of Unlabeled Data
Murasaki, Kazuhiko
Ando, Shingo
Shimamura, Jun
IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2022, E105D (04) : 778 - 784
[50] EvRepSL: Event-Stream Representation via Self-Supervised Learning for Event-Based Vision
Qu, Qiang
Chen, Xiaoming
Chung, Yuk Ying
Shen, Yiran
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2024, 33 : 6579 - 6591

← 1 2 3 4 5 →