AFed: Algorithmic Fair Federated Learning

被引:0
作者
Chen, Huiqiang [1 ]
Zhu, Tianqing [1 ]
Zhou, Wanlei [1 ]
Zhao, Wei [2 ]
机构
[1] City Univ Macau, Fac Data Sci, Macau, Peoples R China
[2] Shenzhen Univ Adv Technol, Fac Comp Sci & Control Engn, Shenzhen 518055, Peoples R China
关键词
Training; Servers; Data models; Generators; Machine learning algorithms; Feature extraction; Classification algorithms; Generative adversarial networks; Federated learning; Accuracy; Fair machine learning; federated learning (FL); generative model;
D O I
10.1109/TNNLS.2025.3528012
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Federated learning (FL) has gained significant attention as it facilitates collaborative machine learning among multiple clients without centralizing their data on a server. FL ensures the privacy of participating clients by locally storing their data, which creates new challenges in fairness. Traditional debiasing methods assume centralized access to sensitive information, rendering them impractical for the FL setting. Additionally, FL is more susceptible to fairness issues than centralized machine learning due to the diverse client data sources that may be associated with group information. Therefore, training a fair model in FL without access to client local data is important and challenging. This article presents AFed, a straightforward, yet effective framework for promoting group fairness in FL. The core idea is to circumvent restricted data access by learning the global data distribution. This article proposes two approaches: AFed-G, which uses a conditional generator trained on the server side, and AFed-GAN, which improves upon AFed-G by training a conditional GAN on the client side. We augment the client data with the generated samples to help remove bias. Our theoretical analysis justifies the proposed methods, and empirical results on multiple real-world datasets demonstrate a substantial improvement in AFed over several baselines.
引用
收藏
页数:13
相关论文
共 50 条
  • [1] Ben-David S., 2007, ADV NEURAL INFORM PR, P137
  • [2] A theory of learning from different domains
    Ben-David, Shai
    Blitzer, John
    Crammer, Koby
    Kulesza, Alex
    Pereira, Fernando
    Vaughan, Jennifer Wortman
    [J]. MACHINE LEARNING, 2010, 79 (1-2) : 151 - 175
  • [3] Berk R, 2017, Arxiv, DOI [arXiv:1706.02409, 10.48550/ARXIV.1706.02409, DOI 10.48550/ARXIV.1706.02409]
  • [4] Bolukbasi T, 2016, ADV NEUR IN, V29
  • [5] Three naive Bayes approaches for discrimination-free classification
    Calders, Toon
    Verwer, Sicco
    [J]. DATA MINING AND KNOWLEDGE DISCOVERY, 2010, 21 (02) : 277 - 292
  • [6] Che X, 2024, Arxiv, DOI [arXiv:2109.05662, 10.48550/arXiv.2109.05662, DOI 10.48550/ARXIV.2109.05662]
  • [7] Chouldechova A, 2018, Arxiv, DOI arXiv:1810.08810
  • [8] Chuang C.-Y., 2020, P INT C LEARN REPR
  • [9] Cui S, 2021, ADV NEUR IN, V34
  • [10] Conscientious Classification: A Data Scientist's Guide to Discrimination-Aware Classification
    d'Alessandro, Brian
    O'Neil, Cathy
    LaGatta, Tom
    [J]. BIG DATA, 2017, 5 (02) : 120 - 134