PDFed: Privacy-Preserving and Decentralized Asynchronous Federated Learning for Diffusion Models

被引:0
作者
Balan, Kar Gabriel [1 ]
Gilbert, Andrew [1 ]
Collomosse, John [2 ]
机构
[1] Univ Surrey, Surrey, England
[2] Adobe Res, San Francisco, CA USA
来源
PROCEEDINGS OF THE 21ST ACM SIGGRAPH CONFERENCE ON VISUAL MEDIA PRODUCTION, CVMP 2024 | 2024年
基金
英国工程与自然科学研究理事会;
关键词
Federated Learning; Diffusion Models; Blockchain; Data Privacy; Decentralized AI; Privacy-preserving Machine Learning;
D O I
10.1145/3697294.3697306
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present PDFed, a decentralized, aggregator-free, and asynchronous federated learning protocol for training image diffusion models using a public blockchain. In general, diffusion models are prone to memorization of training data, raising privacy and ethical concerns (e.g., regurgitation of private training data in generated images). Federated learning (FL) offers a partial solution via collaborative model training across distributed nodes that safeguard local data privacy. PDFed proposes a novel sample-based score that measures the novelty and quality of generated samples, incorporating these into a blockchain-based federated learning protocol that we show reduces private data memorization in the collaboratively trained model. In addition, PDFed enables asynchronous collaboration among participants with varying hardware capabilities, facilitating broader participation. The protocol records the provenance of AI models, improving transparency and auditability, while also considering automated incentive and reward mechanisms for participants. PDFed aims to empower artists and creators by protecting the privacy of creative works and enabling decentralized, peer-to-peer collaboration. The protocol positively impacts the creative economy by opening up novel revenue streams and fostering innovative ways for artists to benefit from their contributions to the AI space.
引用
收藏
页数:9
相关论文
共 28 条
[11]  
Coalition for Content Provenance and Authenticity, 2021, Technical Report
[12]  
Hendrycks Dan, 2019, INT C LEARNING REPRE
[13]  
Ho JAT, 2020, Arxiv, DOI [arXiv:2006.11239, 10.48550/arxiv.2006.11239.arXiv, DOI 10.48550/ARXIV.2006.11239]
[14]  
Jiralerspong M, 2023, ADV NEUR IN
[15]  
Krizhevsky A., 2009, Technical report
[16]   ON A TEST OF WHETHER ONE OF 2 RANDOM VARIABLES IS STOCHASTICALLY LARGER THAN THE OTHER [J].
MANN, HB ;
WHITNEY, DR .
ANNALS OF MATHEMATICAL STATISTICS, 1947, 18 (01) :50-60
[17]  
Meehan C, 2020, PR MACH LEARN RES, V108, P3546
[18]  
Mugunthan V, 2020, Arxiv, DOI arXiv:2007.03856
[19]  
Oquab M., 2024, Transactions on Machine Learning Research
[20]   BAFFLE : Blockchain Based Aggregator Free Federated Learning [J].
Ramanan, Paritosh ;
Nakayama, Kiyoshi .
2020 IEEE INTERNATIONAL CONFERENCE ON BLOCKCHAIN (BLOCKCHAIN 2020), 2020, :72-81