Anonymisation Models for Text Data: State of the Art, Challenges and Future Directions

被引:0
|
作者
Lison, Pierre [1 ]
Pilan, Ildiko [1 ]
Sanchez, David [2 ]
Batet, Montserrat [2 ]
Ovrelid, Lilja [3 ]
机构
[1] Norwegian Comp Ctr, Oslo, Norway
[2] Univ Rovira & Virgili, CYBERCAT, UNESCO Chair Data Privacy, Tarragona, Spain
[3] Univ Oslo, Language Technol Grp, Oslo, Norway
关键词
DE-IDENTIFICATION; PRIVACY PROTECTION; INFORMATION; SURROGATES; REDACTION; RELEASE;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This position paper investigates the problem of automated text anonymisation, which is a pre-requisite for secure sharing of documents containing sensitive information about individuals. We summarise the key concepts behind text anonymisation and provide a review of current approaches. Anonymisation methods have so far been developed in two fields with little mutual interaction, namely natural language processing and privacy-preserving data publishing. Based on a case study, we outline the benefits and limitations of these approaches and discuss a number of open challenges, such as (1) how to account for multiple types of semantic inferences, (2) how to strike a balance between disclosure risk and data utility and (3) how to evaluate the quality of the resulting anonymisation. We lay out a case for moving beyond sequence labelling models and incorporate explicit measures of disclosure risk into the text anonymisation process.
引用
收藏
页码:4188 / 4203
页数:16
相关论文
共 50 条
  • [31] Text Data Security and Privacy in the Internet of Things: Threats, Challenges, and Future Directions
    Khadam, Umair
    Iqbal, Muhammad Munwar
    Alruily, Meshrif
    Al Ghamdi, Mohammed A.
    Ramzan, Muhammad
    Almotiri, Sultan H.
    WIRELESS COMMUNICATIONS & MOBILE COMPUTING, 2020, 2020
  • [32] Mathematical Models for Immunology: Current State of the Art and Future Research Directions
    Eftimie, Raluca
    Gillard, Joseph J.
    Cantrell, Doreen A.
    BULLETIN OF MATHEMATICAL BIOLOGY, 2016, 78 (10) : 2091 - 2134
  • [33] Mathematical Models for Immunology: Current State of the Art and Future Research Directions
    Raluca Eftimie
    Joseph J. Gillard
    Doreen A. Cantrell
    Bulletin of Mathematical Biology, 2016, 78 : 2091 - 2134
  • [34] Challenges and future directions of SUDEP models
    Gu, JiaXuan
    Shao, WeiHui
    Liu, Lu
    Wang, YuLing
    Yang, Yue
    Zhang, ZhuoYue
    Wu, YaXuan
    Xu, Qing
    Gu, LeYuan
    Zhang, YuanLi
    Shen, Yue
    Zhao, HaiTing
    Zeng, Chang
    Zhang, HongHai
    LAB ANIMAL, 2024, 53 (09) : 226 - 243
  • [35] Hydrogen Sulfide in Diabetic Complications Revisited: The State of the Art, Challenges, and Future Directions
    Sun, Hai-Jian
    Xiong, Si-Ping
    Wang, Zi-Chao
    Nie, Xiao-Wei
    Bian, Jin-Song
    ANTIOXIDANTS & REDOX SIGNALING, 2023, 38 (1-3) : 18 - 44
  • [36] Low bitrate audio coding - State-of-the-art, challenges and future directions
    Brandenburg, K
    2000 INTERNATIONAL CONFERENCE ON COMMUNICATION TECHNOLOGY PROCEEDINGS, VOLS. I & II, 2000, : 594 - 597
  • [37] Radiosurgical thalamotomy for essential tremor: state of the art, current challenges and future directions
    Iorio-Morin, Christian
    Mathieu, David
    Franzini, Andrea
    Hodaie, Mojgan
    Villeneuve, Samuelle-Arianne
    Hamel, Andreanne
    Lozano, Andres M.
    EXPERT REVIEW OF NEUROTHERAPEUTICS, 2024, 24 (06) : 597 - 605
  • [38] Challenges in identifying malnutrition in obesity; An overview of the state of the art and directions for future research
    Mwala, Natasha Nalucha
    Borkent, Jos W.
    van der Meij, Barbara S.
    de van der Schueren, Marian A. E.
    NUTRITION RESEARCH REVIEWS, 2024,
  • [39] Sonic Interactions in Virtual Reality: State of the Art, Current Challenges, and Future Directions
    Serafin, Stefania
    Geronazzo, Michele
    Erkut, Cumhur
    Nilsson, Niels C.
    Nordahl, Rolf
    IEEE COMPUTER GRAPHICS AND APPLICATIONS, 2018, 38 (02) : 31 - 43
  • [40] Low bitrate audio coding - State-of-the-art, challenges and future directions
    Brandenburg, K
    2000 5TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS, VOLS I-III, 2000, : 1 - 4