Protecting anonymity in data-driven biomedical science

被引:11
作者
Kieseberg, Peter [1 ,2 ]
Hobel, Heidelinde [1 ]
Schrittwieser, Sebastian [3 ]
Weippl, Edgar [1 ]
Holzinger, Andreas [2 ]
机构
[1] Research Unit HCI, Institute for Medical Informatics, Statistics and Documentation, Medical University Graz
来源
Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) | 2014年 / 8401卷
关键词
Anonymization; Big data; Data-driven sciences; Privacy; Pseudonymization; Safety; Security;
D O I
10.1007/978-3-662-43968-5_17
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
With formidable recent improvements in data processing and information retrieval, knowledge discovery/data mining, business intelligence, content analytics and other upcoming empirical approaches have an enormous potential, particularly for the data intensive biomedical sciences. For results derived using empirical methods, the underlying data set should be made available, at least during the review process for the reviewers, to ensure the quality of the research done and to prevent fraud or errors and to enable the replication of studies. However, in particular in the medicine and the life sciences, this leads to a discrepancy, as the disclosure of research data raises considerable privacy concerns, as researchers have of course the full responsibility to protect their (volunteer) subjects, hence must adhere to respective ethical policies. One solution for this problem lies in the protection of sensitive information in medical data sets by applying appropriate anonymization. This paper provides an overview on the most important and well-researched approaches and discusses open research problems in this area, with the goal to act as a starting point for further investigation. © Springer-Verlag Berlin Heidelberg 2014.
引用
收藏
页码:301 / 316
页数:15
相关论文
共 43 条
[1]  
Chawla N.V., Davis D.A., Bringing big data to personalized healthcare: A patientcentered framework, Journal of General Internal Medicine, 28, pp. S660-S665
[2]  
Holzinger A., Biomedical Informatics: Discovering Knowledge in Big Data, (2014)
[3]  
Holzinger A., Dehmer M., Jurisica I., Knowledge discovery and interactive data mining in bioinformatics-state-of-the-art, future challenges and research directions, BMC Bioinformatics, 15, (2014)
[4]  
Emmert-Streib F., de Matos Simoes R., Glazko G., McDade S., Haibe-Kains B., Holzinger A., Dehmer M., Campbell F., Functional and genetic analysis of the colon cancer network, BMC Bioinformatics, 15, (2014)
[5]  
Jacobs A., The pathologies of big data, Communications of the ACM, 52, 8, pp. 36-44, (2009)
[6]  
Craig T., Ludloff M.E., Privacy and Big Data: The Players, Regulators and Stakeholders, (2011)
[7]  
Weippl E., Holzinger A., Tjoa A.M., Security aspects of ubiquitous computing in health care, Springer Elektrotechnik & Informationstechnik, E&I, 123, 4, pp. 156-162, (2006)
[8]  
Breivik M., Hovland G., From P.J., Trends in research and publication: Science 2.0 and open access, Modeling Identification and Control, 30, 3, pp. 181-190, (2009)
[9]  
Thompson M., Heneghan C., Bmj open data campaign: We need to move the debate on open clinical trial data forward, British Medical Journal, (2012)
[10]  
Hobel H., Schrittwieser S., Kieseberg P., Weippl E., Privacy, Anonymity, Pseudonymity and Data Disclosure in Data-Driven Science, (2013)