A scalable privacy-preserving framework for temporal record linkage

被引:2
|
作者
Ranbaduge, Thilina [1 ]
Christen, Peter [1 ]
机构
[1] Australian Natl Univ, Res Sch Comp Sci, Canberra, ACT, Australia
基金
澳大利亚研究理事会;
关键词
Secure multiparty computation; Encryption; Temporal records;
D O I
10.1007/s10115-019-01370-1
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Record linkage (RL) is the process of identifying matching records from different databases that refer to the same entity. In many applications, it is common that the attribute values of records that belong to the same entity evolve over time, for example people can change their surname or address. Therefore, to identify the records that refer to the same entity over time, RL should make use of temporal information such as the time-stamp of when a record was created and/or update last. However, if RL needs to be conducted on information about people, due to privacy and confidentiality concerns organisations are often not willing or allowed to share sensitive data in their databases, such as personal medical records or location and financial details, with other organisations. This paper proposes a scalable framework for privacy-preserving temporal record linkage that can link different databases while ensuring the privacy of sensitive data in these databases. We propose two protocols that can be used in different linkage scenarios with and without a third party. Our protocols use Bloom filter encoding which incorporates the temporal information available in records during the linkage process. Our approaches first securely calculate the probabilities of entities changing attribute values in their records over a period of time. Based on these probabilities, we then generate a set of masking Bloom filters to adjust the similarities between record pairs. We provide a theoretical analysis of the complexity and privacy of our techniques and conduct an empirical study on large real databases containing several millions of records. The experimental results show that our approaches can achieve better linkage quality compared to non-temporal PPRL while providing privacy to individuals in the databases that are being linked.
引用
收藏
页码:45 / 78
页数:34
相关论文
共 50 条
  • [1] A scalable privacy-preserving framework for temporal record linkage
    Thilina Ranbaduge
    Peter Christen
    Knowledge and Information Systems, 2020, 62 : 45 - 78
  • [2] Privacy-Preserving Temporal Record Linkage
    Ranbaduge, Thilina
    Christen, Peter
    2018 IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2018, : 377 - 386
  • [3] ScaDS Research on Scalable Privacy-preserving Record Linkage
    Franke, Martin
    Gladbach, Marcel
    Sehili, Ziad
    Rohde, Florens
    Rahm, Erhard
    Datenbank-Spektrum, 2019, 19 (01): : 31 - 40
  • [4] A Vulnerability Assessment Framework for Privacy-preserving Record Linkage
    Vidanage, Anushka
    Christen, Peter
    Ranbaduge, Thilina
    Schnell, Rainer
    ACM TRANSACTIONS ON PRIVACY AND SECURITY, 2023, 26 (03)
  • [5] Privacy-preserving record linkage
    Verykios, Vassilios S.
    Christen, Peter
    WILEY INTERDISCIPLINARY REVIEWS-DATA MINING AND KNOWLEDGE DISCOVERY, 2013, 3 (05) : 321 - 332
  • [6] Privacy-Preserving Record Linkage
    Hall, Rob
    Fienberg, Stephen E.
    PRIVACY IN STATISTICAL DATABASES, 2010, 6344 : 269 - +
  • [7] Semantic privacy-preserving framework for electronic health record linkage
    Lu, Yang
    Sinnott, Richard O.
    TELEMATICS AND INFORMATICS, 2018, 35 (04) : 737 - 752
  • [8] Privacy-Preserving Record Linkage with Spark
    Valkering, Onno
    Belloum, Adam
    2019 19TH IEEE/ACM INTERNATIONAL SYMPOSIUM ON CLUSTER, CLOUD AND GRID COMPUTING (CCGRID), 2019, : 440 - 448
  • [9] FEDERAL: A Framework for Distance-Aware Privacy-Preserving Record Linkage
    Karapiperis, Dimitrios
    Gkoulalas-Divanis, Aris
    Verykios, Vassilios S.
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2018, 30 (02) : 292 - 304
  • [10] A Practical and Scalable Privacy-preserving Framework
    Avgerinos, Nikos
    D'Antonio, Salvatore
    Kamara, Irene
    Kotselidis, Christos
    Lazarou, Ioannis
    Mannarino, Teresa
    Meditskos, Georgios
    Papachristopoulou, Konstantina
    Papoutsis, Angelos
    Roccetti, Paolo
    Zuber, Martin
    2023 IEEE INTERNATIONAL CONFERENCE ON CYBER SECURITY AND RESILIENCE, CSR, 2023, : 598 - 603