Logistic regression model training based on the approximate homomorphic encryption

被引:116
|
作者
Kim, Andrey [1 ]
Song, Yongsoo [2 ]
Kim, Miran [3 ]
Lee, Keewoo [1 ]
Cheon, Jung Hee [1 ]
机构
[1] Seoul Natl Univ, Dept Math Sci, 1 Gwanak Ro, Seoul 08826, South Korea
[2] Univ Calif San Diego, Dept Comp Sci & Engn, 9500 Gillman Dr, San Diego, CA 92093 USA
[3] Univ Calif San Diego, Div Biomed Informat, 9500 Gillman Dr, San Diego, CA 92093 USA
基金
新加坡国家研究基金会;
关键词
Homomorphic encryption; Machine learning; Logistic regression;
D O I
10.1186/s12920-018-0401-7
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Background: Security concerns have been raised since big data became a prominent tool in data analysis. For instance, many machine learning algorithms aim to generate prediction models using training data which contain sensitive information about individuals. Cryptography community is considering secure computation as a solution for privacy protection. In particular, practical requirements have triggered research on the efficiency of cryptographic primitives. Methods: This paper presents a method to train a logistic regression model without information leakage. We apply the homomorphic encryption scheme of Cheon et al. (ASIACRYPT 2017) for an efficient arithmetic over real numbers, and devise a new encoding method to reduce storage of encrypted database. In addition, we adapt Nesterov's accelerated gradient method to reduce the number of iterations as well as the computational cost while maintaining the quality of an output classifier. Results: Our method shows a state-of-the-art performance of homomorphic encryption system in a real-world application. The submission based on this work was selected as the best solution of Track 3 at iDASH privacy and security competition 2017. For example, it took about six minutes to obtain a logistic regression model given the dataset consisting of 1579 samples, each of which has 18 features with a binary outcome variable. Conclusions: We present a practical solution for outsourcing analysis tools such as logistic regression analysis while preserving the data confidentiality.
引用
收藏
页数:9
相关论文
共 50 条
  • [21] Multiple Linear Regression Based on Stream Homomorphic Encryption Computing
    Zhang, Yi-Zhuo
    Liu, Yiwei
    Chung, Chan-Liang
    Chen, Chi-Hua
    Hwang, Feng-Jang
    2020 INTERNATIONAL SYMPOSIUM ON COMPUTER, CONSUMER AND CONTROL (IS3C 2020), 2021, : 533 - 536
  • [22] Differential Privacy for Free? Harnessing the Noise in Approximate Homomorphic Encryption
    Ogilvie, Tabitha
    TOPICS IN CRYPTOLOGY, CT-RSA 2024, 2024, 14643 : 292 - 315
  • [23] A Bitwise Logistic Regression Using Binary Approximation and Real Number Division in Homomorphic Encryption Scheme
    Yoo, Joon Soo
    Hwang, Jeong Hwan
    Song, Baek Kyung
    Yoon, Ji Won
    INFORMATION SECURITY PRACTICE AND EXPERIENCE, ISPEC 2019, 2019, 11879 : 20 - 40
  • [24] Privacy-preserving approximate GWAS computation based on homomorphic encryption
    Duhyeong Kim
    Yongha Son
    Dongwoo Kim
    Andrey Kim
    Seungwan Hong
    Jung Hee Cheon
    BMC Medical Genomics, 13
  • [25] Rubato: Noisy Ciphers for Approximate Homomorphic Encryption
    Ha, Jincheol
    Kim, Seongkwang
    Lee, Byeonghak
    Lee, Jooyoung
    Son, Mincheol
    ADVANCES IN CRYPTOLOGY - EUROCRYPT 2022, PT I, 2022, 13275 : 581 - 610
  • [26] Privacy preservation for machine learning training and classification based on homomorphic encryption schemes
    Li, Jing
    Kuang, Xiaohui
    Lin, Shujie
    Ma, Xu
    Tang, Yi
    INFORMATION SCIENCES, 2020, 526 : 166 - 179
  • [27] From accuracy to approximation: A survey on approximate homomorphic encryption and its applications
    Liu, Weinan
    You, Lin
    Shao, Yunfei
    Shen, Xinyi
    Hu, Gengran
    Shi, Jiawen
    Gao, Shuhong
    COMPUTER SCIENCE REVIEW, 2025, 55
  • [28] When Homomorphic Encryption Marries Secret Sharing: Secure Large-Scale Sparse Logistic Regression and Applications in Risk Control
    Chen, Chaochao
    Zhou, Jun
    Wang, Li
    Wu, Xibin
    Fang, Wenjing
    Tan, Jin
    Wang, Lei
    Liu, Alex X.
    Wang, Hao
    Hong, Cheng
    KDD '21: PROCEEDINGS OF THE 27TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2021, : 2652 - 2662
  • [29] Exploring the future of privacy-preserving heart disease prediction: a fully homomorphic encryption-driven logistic regression approach
    Naresh, Vankamamidi S.
    Reddi, Sivaranjani
    JOURNAL OF BIG DATA, 2025, 12 (01)
  • [30] Approximate Methods for the Computation of Step Functions in Homomorphic Encryption
    Huang, Tairong
    Ma, Shihe
    Wang, Anyu
    Wang, Xiaoyun
    INFORMATION SECURITY AND PRIVACY, PT I, ACISP 2024, 2024, 14895 : 217 - 237