Differentially private model release for healthcare applications

被引:1
作者
Sangeetha S. [1 ]
Sudha Sadasivam G. [2 ]
Srikanth A. [1 ]
机构
[1] Department of Information Technology, PSG College of Technology, Tamil Nadu, Coimbatore
[2] Department of Computer Science and Engineering, PSG College of Technology, Tamil Nadu, Coimbatore
关键词
Differential privacy; health care; machine learning; model release; privacy preserved learning; private model;
D O I
10.1080/1206212X.2021.2024958
中图分类号
学科分类号
摘要
The evolution of technology allowed the collection of a large amount of user data, known as big data. Among all the datasets healthcare data is more sensitive. It is extremely important to protect the individual users in such datasets. Even anonymized data release is vulnerable. Hence, in this paper, we suggest a differential privacy-based model release instead of the data release. A private model release based on six machine learning classifiers namely Support Vector Machine (SVM), Random Forest algorithm, Logistic Regression, K-Nearest Neighbor, Decision Tree, and Naive Bayes are proposed. Experimental evaluation is performed using the benchmark heart disease dataset and the accuracy of the model is analyzed. The published private model can be used for the prediction of possible heart disease in patients. © 2022 Informa UK Limited, trading as Taylor & Francis Group.
引用
收藏
页码:953 / 958
页数:5
相关论文
共 29 条
  • [1] Omolara A.E., Jantan A., Abiodun O.I., Et al., Honeydetails: a prototype for ensuring patient's information privacy and thwarting electronic health record threats based on decoys, Health Inform J
  • [2] Omotosho A., Emuoyibofarhe J., Meinel C.
  • [3] Omotosho A., Emuoyibofarhe J., Oke A., Securing private keys in electronic health records using session-based hierarchical key encryption, J Appl Secur Res, 12, 4, pp. 463-477, (2017)
  • [4] Sweeney L., K-anonymity: a model for protecting privacy, Int J Uncertain Fuzziness Knowledge-Based Syst, 10, 5, pp. 557-570, (2002)
  • [5] Machanavajjhala A., Kifer D., Gehrke J., Et al., L-diversity: privacy beyond k-anonymity, ACM Trans Knowl Discov Data, 1, 1, (2007)
  • [6] Li N., Li T., Venkatasubramanian S.
  • [7] Sangeetha S., Sadasivam G.S.
  • [8] Virupaksha S., Venkatesulu D., Subspace-based aggregation for enhancing utility, information measures, and cluster identification in privacy preserved data mining on high-dimensional continuous data, Int J Comput Appl, 2, 4, pp. 1-10, (2019)
  • [9] Mehmood A., Natgunanathan I., Xiang Y., Et al., Protection of big data privacy, IEEE Access, 4, pp. 1821-1834, (2016)
  • [10] Karlekar N.P., Gomathi N., Ow-svm: ontology and whale optimization-based support vector machine for privacy-preserved medical data classification in cloud, Int J Commun Syst, 31, 12, (2018)