Privacy-Preserving Collaborative Data Collection and Analysis With Many Missing Values

被引:10
|
作者
Sei, Yuichi [1 ,2 ]
Onesimu, J. Andrew [3 ]
Okumura, Hiroshi [4 ]
Ohsuga, Akihiko [1 ]
机构
[1] Univ Electrocommun, Tokyo 1828585, Japan
[2] PRESTO, JST, Kawaguchi, Saitama 3320012, Japan
[3] Manipal Acad Higher Educ, Manipal Inst Technol, Dept Comp Sci & Engn, Manipal 576104, India
[4] Mitsubishi Res Inst, Tokyo 1008141, Japan
关键词
Data collection; Servers; Differential privacy; Data models; COVID-19; Privacy; Hospitals; differential privacy; missing values; multi-dimensional analysis; privacy-preserving data collection; MEMBERSHIP INFERENCE ATTACKS; VALUE IMPUTATION; COPULAS; NOISE;
D O I
10.1109/TDSC.2022.3174887
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Privacy-preserving data mining techniques are useful for analyzing various information, such as Internet of Things data and COVID-19-related patient data. However, collecting a large amount of sensitive personal information is a challenging task. In addition, this information may have missing values, which are not considered in the existing methods for collecting personal information while ensuring data privacy. Failure to account for missing values reduces the accuracy of the data analysis. In this article, we propose a method for privacy-preserving data collection that considers many missing values. The patient data are anonymized and sent to a data collection server. The data collection server creates a generative model and a contingency table suitable for multi-attribute analysis based on expectation-maximization and Gaussian copula methods. Using differential privacy (the de facto standard) as a privacy metric, we conduct experiments on synthetic and real data, including COVID-19-related data. The results are 50-80% more accurate than those of existing methods that do not consider missing values.
引用
收藏
页码:2158 / 2173
页数:16
相关论文
共 50 条
  • [31] Privacy-preserving Multimedia Data Analysis
    Zhu, Xiaofeng
    Thung, Kim Han
    Kim, Minjeong
    COMPUTER JOURNAL, 2021, 64 (07): : 991 - 992
  • [32] Collaborative privacy-preserving analysis of oncological data using multiparty homomorphic encryption
    Geva, Ravit
    Gusev, Alexander
    Polyakov, Yuriy
    Liram, Lior
    Rosolio, Oded
    Alexandru, Andreea
    Genise, Nicholas
    Blatt, Marcelo
    Duchin, Zohar
    Waissengrin, Barliz
    Mirelman, Dan
    Bukstein, Felix
    Blumenthal, Deborah T.
    Wolf, Ido
    Pelles-Avraham, Sharon
    Schaffer, Tali
    Lavi, Lee A.
    Micciancio, Daniele
    Vaikuntanathan, Vinod
    Al Badawi, Ahmad
    Goldwasser, Shafi
    PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2023, 120 (33)
  • [33] Privacy-Preserving Data Collection for Mobile Phone Sensing Tasks
    Liu, Yi-Ning
    Wang, Yan-Ping
    Wang, Xiao-Fen
    Xia, Zhe
    Xu, Jingfang
    INFORMATION SECURITY PRACTICE AND EXPERIENCE (ISPEC 2018), 2018, 11125 : 506 - 518
  • [34] Optimal Privacy-Preserving Data Collection: A Prospect Theory Perspective
    Liao, Guocheng
    Chen, Xu
    Huang, Jianwei
    GLOBECOM 2017 - 2017 IEEE GLOBAL COMMUNICATIONS CONFERENCE, 2017,
  • [35] Privacy-preserving hybrid collaborative filtering on cross distributed data
    Ibrahim Yakut
    Huseyin Polat
    Knowledge and Information Systems, 2012, 30 : 405 - 433
  • [36] Correlation-Aware and Personalized Privacy-Preserving Data Collection
    Yu, Dongxiao
    Zhang, Kaiyi
    Tao, Youming
    Xu, Wenlu
    Zou, Yifei
    Cheng, Xiuzhen
    2024 INTERNATIONAL CONFERENCE ON COMPUTING, NETWORKING AND COMMUNICATIONS, ICNC, 2024, : 724 - 729
  • [37] An anonymization protocol for continuous and dynamic privacy-preserving data collection
    Kim, Soohyung
    Chung, Yon Dohn
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2019, 93 : 1065 - 1073
  • [38] Towards task-free privacy-preserving data collection
    Wang, Zhibo
    Yuan, Wei
    Pang, Xiaoyi
    Li, Jingxin
    Shao, Huajie
    CHINA COMMUNICATIONS, 2022, 19 (07) : 310 - 323
  • [39] Privacy-Preserving Data Collection in Context-Aware Applications
    Li, Wei
    Hu, Chunqiang
    Song, Tianyi
    Yu, Jiguo
    Xing, Xiaoshuang
    Cai, Zhipeng
    2018 IEEE SYMPOSIUM ON PRIVACY-AWARE COMPUTING (PAC), 2018, : 75 - 85
  • [40] Privacy-Preserving Overgrid: Secure Data Collection for the Smart Grid
    Croce, Daniele
    Giuliano, Fabrizio
    Tinnirello, Ilenia
    Giarre, Laura
    SENSORS, 2020, 20 (08)