Privacy-Preserving Collaborative Data Collection and Analysis With Many Missing Values

被引:10
|
作者
Sei, Yuichi [1 ,2 ]
Onesimu, J. Andrew [3 ]
Okumura, Hiroshi [4 ]
Ohsuga, Akihiko [1 ]
机构
[1] Univ Electrocommun, Tokyo 1828585, Japan
[2] PRESTO, JST, Kawaguchi, Saitama 3320012, Japan
[3] Manipal Acad Higher Educ, Manipal Inst Technol, Dept Comp Sci & Engn, Manipal 576104, India
[4] Mitsubishi Res Inst, Tokyo 1008141, Japan
关键词
Data collection; Servers; Differential privacy; Data models; COVID-19; Privacy; Hospitals; differential privacy; missing values; multi-dimensional analysis; privacy-preserving data collection; MEMBERSHIP INFERENCE ATTACKS; VALUE IMPUTATION; COPULAS; NOISE;
D O I
10.1109/TDSC.2022.3174887
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Privacy-preserving data mining techniques are useful for analyzing various information, such as Internet of Things data and COVID-19-related patient data. However, collecting a large amount of sensitive personal information is a challenging task. In addition, this information may have missing values, which are not considered in the existing methods for collecting personal information while ensuring data privacy. Failure to account for missing values reduces the accuracy of the data analysis. In this article, we propose a method for privacy-preserving data collection that considers many missing values. The patient data are anonymized and sent to a data collection server. The data collection server creates a generative model and a contingency table suitable for multi-attribute analysis based on expectation-maximization and Gaussian copula methods. Using differential privacy (the de facto standard) as a privacy metric, we conduct experiments on synthetic and real data, including COVID-19-related data. The results are 50-80% more accurate than those of existing methods that do not consider missing values.
引用
收藏
页码:2158 / 2173
页数:16
相关论文
共 50 条
  • [41] A Privacy-Preserving Federated Learning for Multiparty Data Sharing in Social IoTs
    Yin, Lihua
    Feng, Jiyuan
    Xun, Hao
    Sun, Zhe
    Cheng, Xiaochun
    IEEE TRANSACTIONS ON NETWORK SCIENCE AND ENGINEERING, 2021, 8 (03): : 2706 - 2718
  • [42] Local Information Privacy and Its Application to Privacy-Preserving Data Aggregation
    Jiang, Bo
    Li, Ming
    Tandon, Ravi
    IEEE TRANSACTIONS ON DEPENDABLE AND SECURE COMPUTING, 2022, 19 (03) : 1918 - 1935
  • [43] Adapting Geo-Indistinguishability for Privacy-Preserving Collection of Medical Microdata
    Song, Seungmin
    Kim, Jongwook
    ELECTRONICS, 2023, 12 (13)
  • [44] Privacy-preserving data publishing for cluster analysis
    Fung, Benjamin C. M.
    Wang, Ke
    Wang, Lingyu
    Hung, Patrick C. K.
    DATA & KNOWLEDGE ENGINEERING, 2009, 68 (06) : 552 - 575
  • [45] Privacy-Preserving Collaborative Recommender Systems
    Zhan, Justin
    Hsieh, Chia-Lung
    Wang, I-Cheng
    Hsu, Tsan-Sheng
    Liau, Churn-Jung
    Wang, Da-Wei
    IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART C-APPLICATIONS AND REVIEWS, 2010, 40 (04): : 472 - 476
  • [46] Privacy-Preserving Correlated Data Publication: Privacy Analysis and Optimal Noise Design
    Sun, Mingjing
    Zhao, Chengcheng
    He, Jianping
    Cheng, Peng
    Quevedo, Daniel E.
    IEEE TRANSACTIONS ON NETWORK SCIENCE AND ENGINEERING, 2021, 8 (03): : 2014 - 2024
  • [47] Privacy-Preserving Data Mining: Methods, Metrics, and Applications
    Mendes, Ricardo
    Vilela, Joao P.
    IEEE ACCESS, 2017, 5 : 10562 - 10582
  • [48] Incentive Compatible Privacy-Preserving Data Analysis
    Kantarcioglu, Murat
    Jiang, Wei
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2013, 25 (06) : 1323 - 1335
  • [49] Privacy-Preserving Collaborative Learning Through Feature Extraction
    Sarmadi, Alireza
    Fu, Hao
    Krishnamurthy, Prashanth
    Garg, Siddharth
    Khorrami, Farshad
    IEEE TRANSACTIONS ON DEPENDABLE AND SECURE COMPUTING, 2024, 21 (01) : 486 - 498
  • [50] Improving Data Utility in Privacy-Preserving Location Data Collection via Adaptive Grid Partitioning
    Kim, Jongwook
    ELECTRONICS, 2024, 13 (15)