Privacy-Preserving Collaborative Data Collection and Analysis With Many Missing Values

被引:10
|
作者
Sei, Yuichi [1 ,2 ]
Onesimu, J. Andrew [3 ]
Okumura, Hiroshi [4 ]
Ohsuga, Akihiko [1 ]
机构
[1] Univ Electrocommun, Tokyo 1828585, Japan
[2] PRESTO, JST, Kawaguchi, Saitama 3320012, Japan
[3] Manipal Acad Higher Educ, Manipal Inst Technol, Dept Comp Sci & Engn, Manipal 576104, India
[4] Mitsubishi Res Inst, Tokyo 1008141, Japan
关键词
Data collection; Servers; Differential privacy; Data models; COVID-19; Privacy; Hospitals; differential privacy; missing values; multi-dimensional analysis; privacy-preserving data collection; MEMBERSHIP INFERENCE ATTACKS; VALUE IMPUTATION; COPULAS; NOISE;
D O I
10.1109/TDSC.2022.3174887
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Privacy-preserving data mining techniques are useful for analyzing various information, such as Internet of Things data and COVID-19-related patient data. However, collecting a large amount of sensitive personal information is a challenging task. In addition, this information may have missing values, which are not considered in the existing methods for collecting personal information while ensuring data privacy. Failure to account for missing values reduces the accuracy of the data analysis. In this article, we propose a method for privacy-preserving data collection that considers many missing values. The patient data are anonymized and sent to a data collection server. The data collection server creates a generative model and a contingency table suitable for multi-attribute analysis based on expectation-maximization and Gaussian copula methods. Using differential privacy (the de facto standard) as a privacy metric, we conduct experiments on synthetic and real data, including COVID-19-related data. The results are 50-80% more accurate than those of existing methods that do not consider missing values.
引用
收藏
页码:2158 / 2173
页数:16
相关论文
共 50 条
  • [41] Towards Task-Free Privacy-Preserving Data Collection
    Zhibo Wang
    Wei Yuan
    Xiaoyi Pang
    Jingxin Li
    Huajie Shao
    ChinaCommunications, 2022, 19 (07) : 310 - 323
  • [42] Efficient Bilateral Privacy-Preserving Data Collection for Mobile Crowdsensing
    Wu, Axin
    Luo, Weiqi
    Yang, Anjia
    Zhang, Yinghui
    Zhu, Jianhao
    IEEE TRANSACTIONS ON SERVICES COMPUTING, 2024, 17 (03) : 865 - 877
  • [43] Privacy-Preserving Data Collection with Self-Awareness Protection
    Wong, Kok-Seng
    Kim, Myung Ho
    FRONTIER AND INNOVATION IN FUTURE COMPUTING AND COMMUNICATIONS, 2014, 301 : 365 - 371
  • [44] Privacy-Preserving Collaborative Analytics on Medical Time Series Data
    Liu, Xiaoning
    Zheng, Yifeng
    Yi, Xun
    Nepal, Surya
    IEEE TRANSACTIONS ON DEPENDABLE AND SECURE COMPUTING, 2022, 19 (03) : 1687 - 1702
  • [45] Privacy-Preserving Multiparty Collaborative Mining with Geometric Data Perturbation
    Chen, Keke
    Liu, Ling
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2009, 20 (12) : 1764 - 1776
  • [46] Collaborative Fog Computing Architecture for Privacy-Preserving Data Aggregation
    Qusa, Hani
    Tarazi, Jumana
    2021 IEEE WORLD AI IOT CONGRESS (AIIOT), 2021, : 86 - 91
  • [47] Privacy-preserving hybrid collaborative filtering on cross distributed data
    Yakut, Ibrahim
    Polat, Huseyin
    KNOWLEDGE AND INFORMATION SYSTEMS, 2012, 30 (02) : 405 - 433
  • [48] Collaborative classification mechanism for privacy-Preserving on horizontally partitioned data
    Zhang, Zhancheng
    Chung, Fu-Lai
    Wang, Shitong
    AUTOMATIKA, 2019, 60 (01) : 58 - 67
  • [49] Social-Aware Privacy-Preserving Correlated Data Collection
    Liao, Guocheng
    Chen, Xu
    Huang, Jianwei
    PROCEEDINGS OF THE 2018 THE NINETEENTH INTERNATIONAL SYMPOSIUM ON MOBILE AD HOC NETWORKING AND COMPUTING (MOBIHOC '18), 2018, : 11 - 20
  • [50] Adaptive personalized privacy-preserving data collection scheme with local differential privacy
    Song, Haina
    Shen, Hua
    Zhao, Nan
    He, Zhangqing
    Xiong, Wei
    Wu, Minghu
    Zhang, Mingwu
    JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2024, 36 (04)