De-Identification of Electronic Health Records Data

被引:0
作者
Borowik, Piotr [1 ]
Brylicki, Piotr [2 ]
Dzieciatko, Mariusz [1 ]
Jeda, Waldemar [3 ,4 ]
Leszewski, Lukasz [1 ]
Zajac, Piotr [1 ]
机构
[1] SAS Inst, Ul Gdanska 27-31, PL-01633 Warsaw, Poland
[2] Maria Sklodowska Curie Mem Canc Ctr & Inst Oncol, Ul Roentgena 5, PL-02781 Warsaw, Poland
[3] Warsaw Sch Informat Technol, Ul Newelska 6, PL-01447 Warsaw, Poland
[4] Polish Acad Sci, Syst Res Inst, Ul Newelska 6, PL-01447 Warsaw, Poland
来源
INFORMATION TECHNOLOGY IN BIOMEDICINE | 2019年 / 1011卷
关键词
EHR; Data anonymization; De-identyfication; Data quality; SAS; DataFlux; SECURITY; PRIVACY; CARE; ANONYMIZATION;
D O I
10.1007/978-3-030-23762-2_29
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
ONKO.SYS is an IT infrastructure platform, which consists of Data Warehouse module, with the purpose of cancer research in Warsaw, Poland. Electronic health records are available for scientific purposes and the data items allowing to persons identification have to be encoded or removed. The paper explain sources of personal data and its patterns, especially in doctors' text notes. Also implementation of personal data identification process is described for structural data and for unstructured text notes. The system of text notes de-identification is build in the framework of SAS Institute DataFlux commercial software package.
引用
收藏
页码:325 / 337
页数:13
相关论文
共 24 条
  • [1] Big data security and privacy in healthcare: A Review
    Abouelmehdi, Karim
    Beni-Hssane, Abderrahim
    Khaloufi, Hayat
    Saadi, Mostafa
    [J]. 8TH INTERNATIONAL CONFERENCE ON EMERGING UBIQUITOUS SYSTEMS AND PERVASIVE NETWORKS (EUSPN 2017) / 7TH INTERNATIONAL CONFERENCE ON CURRENT AND FUTURE TRENDS OF INFORMATION AND COMMUNICATION TECHNOLOGIES IN HEALTHCARE (ICTH-2017) / AFFILIATED WORKSHOPS, 2017, 113 : 73 - 80
  • [2] [Anonymous], 2006, I2B2 WORKSH CHALL NA
  • [3] Aramaki E, 2006, I2B2 WORKSH CHALL NA
  • [4] De-identification of patient notes with recurrent neural networks
    Dernoncourt, Franck
    Lee, Ji Young
    Uzuner, Ozlem
    Szolovits, Peter
    [J]. JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2017, 24 (03) : 596 - 606
  • [5] Douglass MM, 2005, COMPUT CARDIOL, V32, P331
  • [6] Security Challenges and Success Factors of Electronic Healthcare System
    Ghazvini, Arash
    Shukur, Zarina
    [J]. 4TH INTERNATIONAL CONFERENCE ON ELECTRICAL ENGINEERING AND INFORMATICS (ICEEI 2013), 2013, 11 : 212 - 219
  • [7] Evaluation of a deidentification (De-Id) software engine to share pathology reports and clinical documents for research
    Gupta, D
    Saul, M
    Gilbertson, J
    [J]. AMERICAN JOURNAL OF CLINICAL PATHOLOGY, 2004, 121 (02) : 176 - 186
  • [8] Aspects of privacy for electronic health records
    Haas, Sebastian
    Wohlgemuth, Sven
    Echizen, Isao
    Sonehara, Noboru
    Mueller, Guenter
    [J]. INTERNATIONAL JOURNAL OF MEDICAL INFORMATICS, 2011, 80 (02) : E26 - E31
  • [9] Hara K, 2006, I2B2 WORKSH CHALL NA
  • [10] Anonymizing Healthcare Records: A Study of Privacy Preserving Data Publishing Techniques
    Jayabalan, Manoj
    Rana, Muhammad Ehsan
    [J]. ADVANCED SCIENCE LETTERS, 2018, 24 (03) : 1694 - 1697