When Homomorphic Encryption Marries Secret Sharing: Secure Large-Scale Sparse Logistic Regression and Applications in Risk Control

被引:43
|
作者
Chen, Chaochao [1 ]
Zhou, Jun [1 ]
Wang, Li [1 ]
Wu, Xibin [1 ]
Fang, Wenjing [1 ]
Tan, Jin [1 ]
Wang, Lei [1 ]
Liu, Alex X. [1 ]
Wang, Hao [2 ]
Hong, Cheng [3 ]
机构
[1] Ant Grp, Hangzhou, Peoples R China
[2] Shandong Normal Univ, Jinan, Shandong, Peoples R China
[3] Alibaba Grp, Hangzhou, Peoples R China
来源
KDD '21: PROCEEDINGS OF THE 27TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING | 2021年
关键词
Homomorphic encryption; secret sharing; multi-party computation; large-scale; logistic regression;
D O I
10.1145/3447548.3467210
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Logistic Regression (LR) is the most widely used machine learning model in industry for its efficiency, robustness, and interpretability. Due to the problem of data isolation and the requirement of high model performance, many applications in industry call for building a secure and efficient LR model for multiple parties. Most existing work uses either Homomorphic Encryption (HE) or Secret Sharing (SS) to build secure LR. HE based methods can deal with high-dimensional sparse features, but they incur potential security risks. SS based methods have provable security, but they have efficiency issue under high-dimensional sparse features. In this paper, we first present CAESAR, which combines HE and SS to build secure large-scale sparse logistic regression model and achieves both efficiency and security. We then present the distributed implementation of CAESAR for scalability requirement. We have deployed CAESAR in a risk control task and conducted comprehensive experiments. Our experimental results show that CAESAR improves the state-of-the-art model by around 130 times.
引用
收藏
页码:2652 / 2662
页数:11
相关论文
共 3 条
  • [1] Large-Scale Sparse Logistic Regression
    Liu, Jun
    Chen, Jianhui
    Ye, Jieping
    KDD-09: 15TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2009, : 547 - 555
  • [2] Secure large-scale genome-wide association studies using homomorphic encryption
    Blatt, Marcelo
    Gusev, Alexander
    Polyakov, Yuriy
    Goldwasser, Shafi
    PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2020, 117 (21) : 11608 - 11613
  • [3] A sparse version of the ridge logistic regression for large-scale text categorization
    Aseervatham, Sujeevan
    Antoniadis, Anestis
    Gaussier, Eric
    Burlet, Michel
    Denneulin, Yves
    PATTERN RECOGNITION LETTERS, 2011, 32 (02) : 101 - 106