A Privacy-Preserving Framework for Collaborative Machine Learning with Kernel methods

被引:0
作者
Hannemann, Anika [1 ]
Uenal, Ali Burak [2 ,3 ]
Swaminathan, Arjhun [2 ,3 ]
Buchmann, Erik [1 ]
Akguen, Mete [2 ,3 ]
机构
[1] Univ Leipzig, Ctr Scalable Data Analyt & Artificial Intelligenc, Dept Comp Sci, Dresden, Germany
[2] Univ Tubingen, Med Data Privacy & Privacy Preserving Machine Lea, Tubingen, Germany
[3] Univ Tubingen, Inst Bioinformat & Med Informat IBMI, Tubingen, Germany
来源
2023 5TH IEEE INTERNATIONAL CONFERENCE ON TRUST, PRIVACY AND SECURITY IN INTELLIGENT SYSTEMS AND APPLICATIONS, TPS-ISA | 2023年
关键词
Privacy; Kernel-based Machine Learning; Distributed Learning;
D O I
10.1109/TPS-ISA58951.2023.00020
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
It is challenging to implement Kernel methods, if the data sources are distributed and cannot be joined at a trusted third party for privacy reasons. It is even more challenging, if the use case rules out privacy-preserving approaches that introduce noise or entail significant computational overhead. An example for such a use case is machine learning on clinical data. To realize exact and efficient privacy preserving computation of kernel methods, we propose FLAKE, a Framework for Learning with Anonymized KErnels on horizontally distributed data. With our method, the data sources mask their data so that a Gram matrix can be computed without compromising privacy or utility. The Gram matrix allows to calculate many kernel matrices, which can be used to train kernel-based machine learning algorithms such as Support Vector Machines. We prove that our framework prevents an adversary from learning the input data or the number of input features under a semi-honest threat model. The conducted experiments on clinical, genomic, and image data provide confirmation that our approach is applicable across a wide range of settings. Additionally, our method outperforms comparable approaches in both computational efficiency and accuracy. Thus, FLAKE is a lightweight, applicable approach suitable for various use cases.
引用
收藏
页码:82 / 90
页数:9
相关论文
共 30 条
[1]   Federated learning and differential privacy for medical image analysis [J].
Adnan, Mohammed ;
Kalra, Shivam ;
Cresswell, Jesse C. ;
Taylor, Graham W. ;
Tizhoosh, Hamid R. .
SCIENTIFIC REPORTS, 2022, 12 (01)
[2]  
Aguilera A., 2004, "General n -dimensional rotations
[3]  
[Anonymous], 2006, P 2006 ACM S APPL CO
[4]   Practical Secure Aggregation for Privacy-Preserving Machine Learning [J].
Bonawitz, Keith ;
Ivanov, Vladimir ;
Kreuter, Ben ;
Marcedone, Antonio ;
McMahan, H. Brendan ;
Patel, Sarvar ;
Ramage, Daniel ;
Segal, Aaron ;
Seth, Karn .
CCS'17: PROCEEDINGS OF THE 2017 ACM SIGSAC CONFERENCE ON COMPUTER AND COMMUNICATIONS SECURITY, 2017, :1175-1191
[5]  
Chen KK, 2007, PROCEEDINGS OF THE SEVENTH SIAM INTERNATIONAL CONFERENCE ON DATA MINING, P78
[6]  
Chen KK, 2005, Fifth IEEE International Conference on Data Mining, Proceedings, P589
[7]   Least angle regression - Rejoinder [J].
Efron, B ;
Hastie, T ;
Johnstone, I ;
Tibshirani, R .
ANNALS OF STATISTICS, 2004, 32 (02) :494-499
[8]  
European Commission, 2016, Official Journal of the European Union
[9]  
Evans D., 2018, Foundations and Trends in Privacy and Security, V2, P70
[10]  
github, Shenggan: BCCD dataset