Processing Text for Privacy: An Information Flow Perspective

被引:6
|
作者
Fernandes, Natasha [1 ]
Dras, Mark [1 ]
McIver, Annabelle [1 ]
机构
[1] Macquarie Univ, Dept Comp, N Ryde, NSW, Australia
来源
FORMAL METHODS | 2018年 / 10951卷
基金
澳大利亚研究理事会;
关键词
Refinement; Information flow; Privacy; Probabilistic semantics; Text processing; Author anonymity; Author obfuscation; AUTHORSHIP; MODEL; RISK;
D O I
10.1007/978-3-319-95582-7_1
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
The problem of text document obfuscation is to provide an automated mechanism which is able to make accessible the content of a text document without revealing the identity of its writer. This is more challenging than it seems, because an adversary equipped with powerful machine learning mechanisms is able to identify authorship (with good accuracy) where, for example, the name of the author has been redacted. Current obfuscation methods are ad hoc and have been shown to provide weak protection against such adversaries. Differential privacy, which is able to provide strong guarantees of privacy in some domains, has been thought not to be applicable to text processing. In this paper we will review obfuscation as a quantitative information flow problem and explain how generalised differential privacy can be applied to this problem to provide strong anonymisation guarantees in a standard model for text processing.
引用
收藏
页码:3 / 21
页数:19
相关论文
共 50 条