TiFLCS-MARP: Client selection and model pricing for federated learning in data markets

被引:4
作者
Sun, Yongjiao [1 ]
Li, Boyang [2 ,5 ,6 ]
Yang, Kai [1 ]
Bi, Xin [3 ]
Zhao, Xiangning [4 ]
机构
[1] Northeastern Univ, Sch Comp Sci & Engn, Shenyang 110819, Peoples R China
[2] Beijing Inst Technol, Sch Comp Sci Technol, Beijing 100081, Peoples R China
[3] Northeastern Univ, Key Lab Minist Educ Safe Min Deep Met Mines, Shenyang 110819, Peoples R China
[4] Tiangong Univ, Tianjin 300387, Peoples R China
[5] BIT, Tangshan Res Inst, Tangshan 063000, Peoples R China
[6] Hebei Prov Key Lab Big Data Sci & Intelligent Tech, Tangshan 300401, Peoples R China
基金
国家重点研发计划; 中国国家自然科学基金;
关键词
Federated learning; Deep learning; Client selection; Game theory; Nash equilibrium; NETWORKS; DESIGN;
D O I
10.1016/j.eswa.2023.123071
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In recent years, the burgeoning data market has witnessed a surge in data exchange, playing a pivotal role in augmenting the predictive and decision -making capabilities of machine learning. Despite these advancements, persistent concerns surrounding data privacy have resulted in stringent limitations on data sharing and trading. Consequently, the data market is undergoing a transformative shift from pricing individual datasets to pricing the models themselves. The challenge of training high-performance machine learning models with restricted data from a single client is substantial. Federated learning has emerged as a popular solution, allowing collaborative model training without the need to transfer client data beyond local environments. However, training federated models within a data market introduces several challenges, including the effective selection of clients for model training and the optimization of utility through model pricing. In response to these challenges, we propose the Tiered Federated Learning Client Selection Algorithm (TiFLCS-MAR), employing a multi -attribute reverse auction approach. Integrated into the federated learning framework, TiFLCS-MAR excels at the comprehensive evaluation of client attributes, employing a tiered strategy to mitigate issues arising from client heterogeneity. Additionally, we introduce the TiFLCS-MAR Pricing Framework (TiFLCSMARP), leveraging Nash equilibrium principles to maximize profitability for both clients and servers. Our framework accommodates the heterogeneity of diverse clients, efficiently selecting suitable candidates from a large pool, thereby boosting training efficiency and curbing model pricing costs. Empirical evidence showcases the efficacy of federated training with TiFLCS-MAR, demonstrating nearly double the convergence speed and a 5-10 percentage point improvement in accuracy across real and synthetic datasets. Furthermore, when compared to three baseline algorithms, TiFLCS-MARP substantially increases central server revenue by factors of 1.99, 27.05, and 1.78, highlighting its superior performance in the data market context.
引用
收藏
页数:13
相关论文
共 27 条
[1]   DESIGN COMPETITION THROUGH MULTIDIMENSIONAL AUCTIONS [J].
CHE, YK .
RAND JOURNAL OF ECONOMICS, 1993, 24 (04) :668-680
[2]   Towards Model-based Pricing for Machine Learning in a Data Marketplace [J].
Chen, Lingjiao ;
Koutris, Paraschos ;
Kumar, Arun .
SIGMOD '19: PROCEEDINGS OF THE 2019 INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2019, :1535-1552
[3]  
Cho Y. J., 2020, Preprints
[4]  
Katharopoulos A, 2018, PR MACH LEARN RES, V80
[5]  
Koutris P., 2013, SIGMOD, P613, DOI DOI 10.1145/2463676.2465335
[6]   Query-Based Data Pricing [J].
Koutris, Paraschos ;
Upadhyaya, Prasang ;
Balazinska, Magdalena ;
Howe, Bill ;
Suciu, Dan .
JOURNAL OF THE ACM, 2015, 62 (05)
[7]  
Li T., 2020, PROC MACH LEARN SYST, V2, P429, DOI DOI 10.48550/ARXIV.1812.06127
[8]   Federated Learning: Challenges, Methods, and Future Directions [J].
Li, Tian ;
Sahu, Anit Kumar ;
Talwalkar, Ameet ;
Smith, Virginia .
IEEE SIGNAL PROCESSING MAGAZINE, 2020, 37 (03) :50-60
[9]  
McMahan HB, 2017, PR MACH LEARN RES, V54, P1273
[10]  
Nget R., 2017, P SIGIR 2017 WORKSHO