Privacy-preserving imputation of missing data

被引:17
|
作者
Jagannathan, Geetha [1 ]
Wright, Rebecca N. [1 ]
机构
[1] Stevens Inst Technol, Dept Comp Sci, Hoboken, NJ 07030 USA
基金
美国国家科学基金会;
关键词
data cleaning; data imputation; privacy-preserving protocols;
D O I
10.1016/j.datak.2007.06.013
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Handling missing data is a critical step to ensuring good results in data mining. Like most data mining algorithms, existing privacy-preserving data mining algorithms assume data is complete. In order to maintain privacy in the data mining process while cleaning data, privacy-preserving methods of data cleaning are required. In this paper, we address the problem of privacy-preserving data imputation of missing data. We present a privacy-preserving protocol for filling in missing values using a lazy decision-tree imputation algorithm for data that is horizontally partitioned between two parties. The participants of the protocol learn only the imputed values. The computed decision tree is not learned by either party. (c) 2007 Elsevier B.V. All rights reserved.
引用
收藏
页码:40 / 56
页数:17
相关论文
共 50 条
  • [41] Privacy-preserving Data Mining in Industry
    Kenthapadi, Krishnaram
    Mironov, Ilya
    Thakurta, Abhradeep Guha
    PROCEEDINGS OF THE TWELFTH ACM INTERNATIONAL CONFERENCE ON WEB SEARCH AND DATA MINING (WSDM'19), 2019, : 840 - 841
  • [42] Privacy-preserving collaborative data mining
    Zhan, Justin
    IEEE COMPUTATIONAL INTELLIGENCE MAGAZINE, 2008, 3 (02) : 31 - 41
  • [43] Lightweight privacy-Preserving data classification
    Ngoc Hong Tran
    Le-Khac, Nhien-An
    Kechadi, M-Tahar
    COMPUTERS & SECURITY, 2020, 97
  • [44] Privacy-preserving data mining systems
    Zhang, Nan
    Zhao, Wei
    COMPUTER, 2007, 40 (04) : 52 - +
  • [45] Privacy-preserving queries on encrypted data
    Yang, Zhiqiang
    Zhong, Sheng
    Wright, Rebecca N.
    Computer Security - ESORICS 2006, Proceedings, 2006, 4189 : 479 - 495
  • [46] Privacy-Preserving Clustering of Data Streams
    Chao, Ching-Ming
    Chen, Po-Zung
    Sun, Chu-Hao
    JOURNAL OF APPLIED SCIENCE AND ENGINEERING, 2010, 13 (03): : 349 - 358
  • [47] Privacy-Preserving Data Publishing: An Overview
    Wong, Raymond Chi-Wing
    Fu, Ada Wai-Chee
    Synthesis Lectures on Data Management, 2010, 2 (01): : 1 - 138
  • [48] Privacy risk assessment and privacy-preserving data monitoring
    Silva, Paulo
    Goncalves, Carolina
    Antunes, Nuno
    Curado, Marilia
    Walek, Bogdan
    EXPERT SYSTEMS WITH APPLICATIONS, 2022, 200
  • [49] Interval Privacy: A Framework for Privacy-Preserving Data Collection
    Ding, Jie
    Ding, Bangjun
    IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2022, 70 : 2443 - 2459
  • [50] Accurate privacy-preserving record linkage for databases with missing values
    Vaiwsri, Sirintra
    Ranbaduge, Thilina
    Christen, Peter
    Schnell, Rainer
    INFORMATION SYSTEMS, 2022, 106