De-Identification of Electronic Health Records Data

被引:0
|
作者
Borowik, Piotr [1 ]
Brylicki, Piotr [2 ]
Dzieciatko, Mariusz [1 ]
Jeda, Waldemar [3 ,4 ]
Leszewski, Lukasz [1 ]
Zajac, Piotr [1 ]
机构
[1] SAS Inst, Ul Gdanska 27-31, PL-01633 Warsaw, Poland
[2] Maria Sklodowska Curie Mem Canc Ctr & Inst Oncol, Ul Roentgena 5, PL-02781 Warsaw, Poland
[3] Warsaw Sch Informat Technol, Ul Newelska 6, PL-01447 Warsaw, Poland
[4] Polish Acad Sci, Syst Res Inst, Ul Newelska 6, PL-01447 Warsaw, Poland
来源
关键词
EHR; Data anonymization; De-identyfication; Data quality; SAS; DataFlux; SECURITY; PRIVACY; CARE; ANONYMIZATION;
D O I
10.1007/978-3-030-23762-2_29
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
ONKO.SYS is an IT infrastructure platform, which consists of Data Warehouse module, with the purpose of cancer research in Warsaw, Poland. Electronic health records are available for scientific purposes and the data items allowing to persons identification have to be encoded or removed. The paper explain sources of personal data and its patterns, especially in doctors' text notes. Also implementation of personal data identification process is described for structural data and for unstructured text notes. The system of text notes de-identification is build in the framework of SAS Institute DataFlux commercial software package.
引用
收藏
页码:325 / 337
页数:13
相关论文
共 50 条
  • [1] Generation of Surrogates for De-Identification of Electronic Health Records
    Chen, Aipeng
    Jonnagaddala, Jitendra
    Nekkantti, Chandini
    Liaw, Siaw-Teng
    MEDINFO 2019: HEALTH AND WELLBEING E-NETWORKS FOR ALL, 2019, 264 : 70 - 73
  • [2] Generating Synthetic Training Data for Supervised De-Identification of Electronic Health Records
    Libbi, Claudia Alessandra
    Trienes, Jan
    Trieschnigg, Dolf
    Seifert, Christin
    FUTURE INTERNET, 2021, 13 (05)
  • [3] Methods for the de-identification of electronic health records for genomic research
    El Emam, Khaled
    GENOME MEDICINE, 2011, 3
  • [4] Methods for the de-identification of electronic health records for genomic research
    Khaled El Emam
    Genome Medicine, 3
  • [5] Privacy Preserving Neural Networks for Electronic Health Records De-Identification
    Ahmed, Tanbir
    Al Aziz, Md Momin
    Mohammed, Noman
    Jiang, Xiaoqian
    12TH ACM CONFERENCE ON BIOINFORMATICS, COMPUTATIONAL BIOLOGY, AND HEALTH INFORMATICS (ACM-BCB 2021), 2021,
  • [6] Leveraging text skeleton for de-identification of electronic medical records
    Yue-Shu Zhao
    Kun-Li Zhang
    Hong-Chao Ma
    Kun Li
    BMC Medical Informatics and Decision Making, 18
  • [7] Leveraging text skeleton for de-identification of electronic medical records
    Zhao, Yue-Shu
    Zhang, Kun-Li
    Ma, Hong-Chao
    Li, Kun
    BMC MEDICAL INFORMATICS AND DECISION MAKING, 2018, 18
  • [8] Principle-Based Approach for the De-Identification of Code-Mixed Electronic Health Records
    Wang, Chen-Kai
    Wang, Feng-Duo
    Lee, You-Qian
    Chen, Pei-Tsz
    Wang, Bo-Hong
    Su, Chu-Hsien
    Kuo, Joseph Chin-Chi
    Wu, Chi-Shin
    Chien, Yi-Ling
    Dai, Hong-Jie
    Tseng, Vincent S.
    Hsu, Wen-Lian
    IEEE ACCESS, 2022, 10 : 22875 - 22885
  • [9] Improving domain adaptation in de-identification of electronic health records through self-training
    Liao, Shun
    Kiros, Jamie
    Chen, Jiyang
    Zhang, Zhaolei
    Chen, Ting
    JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2021, 28 (10) : 2093 - 2100
  • [10] Development and evaluation of a de-identification procedure for a case register sourced from mental health electronic records
    Andrea C Fernandes
    Danielle Cloete
    Matthew TM Broadbent
    Richard D Hayes
    Chin-Kuo Chang
    Richard G Jackson
    Angus Roberts
    Jason Tsang
    Murat Soncul
    Jennifer Liebscher
    Robert Stewart
    Felicity Callard
    BMC Medical Informatics and Decision Making, 13