Improving healthcare services using source anonymous scheme with privacy preserving distributed healthcare data collection and mining

被引:0
作者
Nikunj Domadiya
Udai Pratap Rao
机构
[1] Sardar Vallabhbhai National Institute of Technology,Department of Computer Engineering
来源
Computing | 2021年 / 103卷
关键词
Healthcare; Data Mining; Privacy; Source Anonymous; Privacy Preserving Data Mining; Healthcare Improvement; 68P20; 68P27; 92C50;
D O I
暂无
中图分类号
学科分类号
摘要
The trends of data mining on healthcare data for improving medical services have increased because of the electronic healthcare record(EHR) system, which collects a massive amount of data on a daily basis. In the current scenario, hospital maintains its EHR system and stores the detailed information of patients. Data mining for healthcare improvement requires the data from all the EHR systems located at a different location to be stored at the central data mining server. Collection of healthcare data at some untrusted central data mining server raises privacy threats. Healthcare data contains patients’ private information and sharing this information for data mining creates privacy issues. Most of the previous research either focused on k-anonymity technique which causes information loss and decreases data mining accuracy or privacy preserving data mining which is focused on only specific data mining technique. We adopt source anonymous technique as privacy preserving scheme and present a novel scheme for healthcare data collection and mining in this paper. Our scheme collects data from all EHR systems without any information loss and stores at a single central data mining server, also ensuring privacy is preserved. Central data mining server helps to analyze the collected data with different data mining techniques (Association rule mining, Classification, Clustering, etc.) without the involvement of EHR systems. Our scheme is collusion resilient against central data mining server and EHR systems. Theoretical and experimental analysis show the efficiency of our scheme in terms of computation and communication cost. The experimental results using Heart disease dataset show the advantage to EHR systems using the proposed approach in terms of disease prediction accuracy.
引用
收藏
页码:155 / 177
页数:22
相关论文
共 95 条
[1]  
Tang PC(2006)Electronic health record systems Biomed Inform 10 447-undefined
[2]  
McDonald CJ(2010)Diagnostic analysis of patients with essential hypertension using association rule mining Healthc Inform Res 16 77-undefined
[3]  
Shin AM(2013)Intelligent heart disease prediction system using data mining techniques Int J Healthc Biomed Res 1 94-undefined
[4]  
Lee IH(2006)Association rule discovery with the train and test approach for heart disease prediction IEEE Trans Inf Technol Biomed 10 334-undefined
[5]  
Lee GH(2020)Medication use and the risk of newly diagnosed diabetes in patients with epilepsy: a data mining application on a healthcare database J Organ End User Comput (JOEUC) 32 93-undefined
[6]  
Park HJ(2009)An expert system for detection of breast cancer based on association rules and neural network Expert syst Appl 36 3465-undefined
[7]  
Park HS(2018)Privacy-preserving association rule mining for horizontally partitioned healthcare data: a case study on the heart diseases Sādhanā 43 127-undefined
[8]  
Yoon KI(2019)Status of health information exchange: a comparison of six countries J Global Health 9 0204279-undefined
[9]  
Lee JJ(2013)Aggregate health data in the United States: Steps toward a public good Health Inf J 19 137-undefined
[10]  
Kim YN(2020)Privacy-preserving data integrity verification by using lightweight streaming authenticated data structures for healthcare cyber-physical system Future Gener Comput Syst 108 1287-undefined