Cross-Silo Federated Learning-to-Rank

被引:0
作者
Shi D.-Y. [1 ,2 ,3 ]
Wang Y.-S. [1 ,2 ,3 ]
Zheng P.-F. [1 ,2 ,3 ]
Tong Y.-X. [1 ,2 ,3 ]
机构
[1] State Key Laboratory of Software Development Enviroment, Beihang University, Beijing
[2] Beijing Advanced Innovation Center for Big Data and Brain Computing, Beihang University, Beijing
[3] School of Computer Science and Engineering, Beihang University, Beijing
来源
Ruan Jian Xue Bao/Journal of Software | 2021年 / 32卷 / 03期
基金
中国国家自然科学基金;
关键词
Data silo; Differential privacy; Federated learning; Learning-to-rank; Sketch;
D O I
10.13328/j.cnki.jos.006174
中图分类号
学科分类号
摘要
Learning-to-rank (LTR) model has made a remarkable achievement. However, traditional training scheme for LTR model requires large amount of text data. Considering the increasing concerns about privacy protection, it is becoming infeasible to collect text data from multiple data owners as before, and thus data is forced to save separately. The separation turns data owners into data silos, among which the data can hardly exchange, causing LTR training severely compromised. Inspired by the recent progress in federated learning, a novel framework is proposed named cross-silo federated learning-to-rank (CS-F-LTR), which addresses two unique challenges faced by LTR when applied it to federated scenario. In order to deal with the cross-party feature generation problem, CS-F-LTR utilizes a sketch and differential privacy based method, which is much more efficient than encryption-based protocols meanwhile the accuracy loss is still guaranteed. To tackle with the missing label problem, CS-F-LTR relies on a semi-supervised learning mechanism that facilitates fast labeling with mutual labelers. Extensive experiments conducted on public datasets verify the effectiveness of the proposed framework. © Copyright 2021, Institute of Software, the Chinese Academy of Sciences. All rights reserved.
引用
收藏
页码:669 / 688
页数:19
相关论文
共 52 条
[1]  
Burges CJC, Shaked T, Renshaw E, Et al., Learning to rank using gradient descent, Proc. of the 22th Int'l Conf. on Machine Learning, pp. 89-96, (2005)
[2]  
Chapelle O, Chang Y., Yahoo! Learning to rank challenge overview, Proc. of the 28th Int'l Conf. on Machine Learning, pp. 1-24, (2011)
[3]  
Liu TY., Learning to Rank for Information Retrieval, (2011)
[4]  
Yang Q, Liu Y, Chen TJ, Et al., Federated machine learning: Concept and applications, ACM Trans. on Intelligent Systems and Technology, 10, 2, (2019)
[5]  
Chang EY, Zhu KH, Wang H, Et al., Parallelizing support vector machines on distributed computers, Proc. of the 21st Neural Information Processing Systems, pp. 257-264, (2007)
[6]  
Yang Q, Liu Y, Chen TJ, Et al., Federated learning, Communications of the CCF, 14, 11, pp. 49-55, (2018)
[7]  
Yin DW, Hu YN, Tang JL, Et al., Ranking relevance in Yahoo search, Proc. of the 22nd ACM SIGKDD Int'l Conf. on Knowledge Discovery and Data Mining, pp. 323-332, (2016)
[8]  
Li C, Shen DR, Kou Y, Et al., Diversity-aware kNN query processing approaches for temporal spatial textual content, Pattern Recognition and Artificial Intelligence, 30, 1, pp. 64-72, (2017)
[9]  
Hou YX, Duan L, Li L, Et al., Search of genes with similar phenotype based on disease information network, Ruan Jian Xue Bao/Journal of Software, 29, 3, pp. 721-733, (2018)
[10]  
Barbaro M, Zeller T., A face is exposed for AOLsearcherno, (2006)