A Privacy-Preserving Framework for Collaborative Machine Learning with Kernel methods

被引：0

作者：

Hannemann, Anika ^{[1
]}

Uenal, Ali Burak ^{[2
,3
]}

Swaminathan, Arjhun ^{[2
,3
]}

Buchmann, Erik ^{[1
]}

Akguen, Mete ^{[2
,3
]}

机构：

[1] Univ Leipzig, Ctr Scalable Data Analyt & Artificial Intelligenc, Dept Comp Sci, Dresden, Germany

[2] Univ Tubingen, Med Data Privacy & Privacy Preserving Machine Lea, Tubingen, Germany

[3] Univ Tubingen, Inst Bioinformat & Med Informat IBMI, Tubingen, Germany

来源：

2023 5TH IEEE INTERNATIONAL CONFERENCE ON TRUST, PRIVACY AND SECURITY IN INTELLIGENT SYSTEMS AND APPLICATIONS, TPS-ISA | 2023年

关键词：

Privacy; Kernel-based Machine Learning; Distributed Learning;

D O I：

10.1109/TPS-ISA58951.2023.00020

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

It is challenging to implement Kernel methods, if the data sources are distributed and cannot be joined at a trusted third party for privacy reasons. It is even more challenging, if the use case rules out privacy-preserving approaches that introduce noise or entail significant computational overhead. An example for such a use case is machine learning on clinical data. To realize exact and efficient privacy preserving computation of kernel methods, we propose FLAKE, a Framework for Learning with Anonymized KErnels on horizontally distributed data. With our method, the data sources mask their data so that a Gram matrix can be computed without compromising privacy or utility. The Gram matrix allows to calculate many kernel matrices, which can be used to train kernel-based machine learning algorithms such as Support Vector Machines. We prove that our framework prevents an adversary from learning the input data or the number of input features under a semi-honest threat model. The conducted experiments on clinical, genomic, and image data provide confirmation that our approach is applicable across a wide range of settings. Additionally, our method outperforms comparable approaches in both computational efficiency and accuracy. Thus, FLAKE is a lightweight, applicable approach suitable for various use cases.

引用

页码：82 / 90

页数：9

共 30 条

[1] Federated learning and differential privacy for medical image analysis [J].

Adnan, Mohammed ;

Kalra, Shivam ;

Cresswell, Jesse C. ;

Taylor, Graham W. ;

Tizhoosh, Hamid R. .

SCIENTIFIC REPORTS, 2022, 12 (01)

[2]

Aguilera A., 2004, "General n -dimensional rotations

[3]

[Anonymous], 2006, P 2006 ACM S APPL CO

[4] Practical Secure Aggregation for Privacy-Preserving Machine Learning [J].

Bonawitz, Keith ;

Ivanov, Vladimir ;

Kreuter, Ben ;

Marcedone, Antonio ;

McMahan, H. Brendan ;

Patel, Sarvar ;

Ramage, Daniel ;

Segal, Aaron ;

Seth, Karn .

CCS'17: PROCEEDINGS OF THE 2017 ACM SIGSAC CONFERENCE ON COMPUTER AND COMMUNICATIONS SECURITY, 2017, :1175-1191

[5]

Chen KK, 2007, PROCEEDINGS OF THE SEVENTH SIAM INTERNATIONAL CONFERENCE ON DATA MINING, P78

[6]

Chen KK, 2005, Fifth IEEE International Conference on Data Mining, Proceedings, P589

[7] Least angle regression - Rejoinder [J].

Efron, B ;

Hastie, T ;

Johnstone, I ;

Tibshirani, R .

ANNALS OF STATISTICS, 2004, 32 (02) :494-499

[8]

European Commission, 2016, Official Journal of the European Union

[9]

Evans D., 2018, Foundations and Trends in Privacy and Security, V2, P70

[10]

github, Shenggan: BCCD dataset

← 1 2 3 →