ElGamal Homomorphic Encryption-Based Privacy Preserving Association Rule Mining on Horizontally Partitioned Healthcare Data

被引:7
作者
Domadiya N. [1 ]
Rao U.P. [2 ]
机构
[1] Computer Engineering Department, L. D. College of Engineering, Ahmedabad
[2] Computer Engineering Department, National Institute of Technology, Surat
关键词
Association Rule Mining; Breast Cancer Disease; Coronavirus(COVID-19); Data Mining Privacy; Distributed Healthcare Data Mining;
D O I
10.1007/s40031-021-00696-1
中图分类号
学科分类号
摘要
In today’s world, life-threatening diseases have become a pre-eminent issue in healthcare due to the higher mortality rate. It is possible to lower this mortality rate by utilizing healthcare intelligence to detect diseases early. Patient’s medical data is stored in the EHR system, which is kept up to date by the healthcare provider. Data mining techniques like Association Rule Mining can detect a patient’s disease from their symptoms using digital healthcare data stored in the EHR system. Association rule mining’s efficacy can be improved by using global data from various EHR systems. It mandates that all EHR systems exchange healthcare records to a central server. When personal health information is made available on an untrusted server, several privacy laws may be violated. As a result, the challenge of privacy preserving distributed healthcare data mining has become a well-known study field in the healthcare industry. This research uses an efficient ElGamal homomorphic encryption technique to protect privacy in a distributed association rule mining. The proposed approach to discover the risk factor of most life-threatening diseases like breast cancer and heart disease with its symptoms and discuss the scope for combating COVID-19. Theoretical analysis of the proposed approach shows that it is efficient and maintains privacy in an insecure communication environment. An experimental study with a real dataset shows the proposed approach’s benefit compared to the local single EHR system results. © 2021, The Institution of Engineers (India).
引用
收藏
页码:817 / 830
页数:13
相关论文
共 68 条
[1]  
Nahar J., Imam T., Tickle K.S., Chen Y.-P.P., Association rule mining to detect factors which contribute to heart disease in males and females, Expert Syst. Appl., 40, 4, pp. 1086-1093, (2013)
[2]  
Heron M., Deaths: Leading causes for 2015. National vital statistics reports: From the Centers for Disease Control and Prevention, National Vital Statistics System, 66, 5, pp. 1-76, (2017)
[3]  
Bray F., Ferlay J., Soerjomataram I., Siegel R.L., Torre L.A., Jemal A., Global cancer statistics 2018: globocan estimates of incidence and mortality worldwide for 36 cancers in 185 countries, CA Cancer J. Clinic., 68, 6, pp. 394-424, (2018)
[4]  
Covid-19 coronavirus pandemic
[5]  
Kakushadze Z., Raghubanshi R., Yu W., Estimating cost savings from early cancer diagnosis, Data, 2, 3, (2017)
[6]  
Alefan Q., Saadeh A., Yaghan R.J., Direct medical costs for stage-specific breast cancer: a retrospective analysis, Breast Cancer Manag., 9, 1, (2020)
[7]  
Simic R., Ratkovic N., Simic V.D., Savkovic Z., Jakovljevic M., Peric V., Pandrc M., Rancic N., Cost Analysis of Health Examination Screening Program for Ischemic Heart Disease in Active-Duty Military Personnel in the Middle-Income Country, 9
[8]  
Kadam V.J., Jadhav S.M., Vijayakumar K., Breast cancer diagnosis using feature ensemble learning based on stacked sparse autoencoders and softmax regression, J. Med. Syst., 43, 8, pp. 1-11, (2019)
[9]  
Wang H., Zheng B., Yoon S.W., Ko H.S., A support vector machine-based ensemble algorithm for breast cancer diagnosis, Eur. J. Oper. Res., 267, 2, pp. 687-699, (2018)
[10]  
Dubey A.K., Gupta U., Jain S., Analysis of k-means clustering approach on the breast cancer wisconsin dataset, Int. J. Computer Ass. Radiol. Surg., 11, 11, pp. 2033-2047, (2016)