PMDG: Privacy for Multi-perspective Process Mining Through Data Generalization

被引:1
|
作者
Hildebrant, Ryan [1 ]
Fahrenkrog-Petersen, Stephan A. [2 ,3 ]
Weidlich, Matthias [2 ]
Ren, Shangping [4 ]
机构
[1] Univ Calif Irvine, Irvine, CA 92697 USA
[2] Humboldt Univ, Berlin, Germany
[3] Weizenbaum Inst Networked Soc, Berlin, Germany
[4] San Diego State Univ, San Diego, CA 92182 USA
来源
ADVANCED INFORMATION SYSTEMS ENGINEERING, CAISE 2023 | 2023年 / 13901卷
关键词
Privatization; K-anonymity; Attribute Generalization; DIFFERENTIAL PRIVACY; ALIGNMENT;
D O I
10.1007/978-3-031-34560-9_30
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Anonymization of event logs facilitates process mining while protecting sensitive information of process stakeholders. Existing techniques, however, focus on the privatization of the control-flow. Other process perspectives, such as roles, resources, and objects are neglected or subject to randomization, which breaks the dependencies between the perspectives. Hence, existing techniques are not suited for advanced process mining tasks, e.g., social network mining or predictive monitoring. To address this gap, we propose PMDG, a framework to ensure privacy for multi-perspective process mining through data generalization. It provides group-based privacy guarantees for an event log, while preserving the characteristic dependencies between the control-flow and further process perspectives. Unlike existing privatization techniques that rely on data suppression or noise insertion, PMDG adopts data generalization: a technique where the activities and attribute values referenced in events are generalized into more abstract ones, to obtain equivalence classes that are sufficiently large from a privacy point of view. We demonstrate empirically that PMDG outperforms state-of-the-art anonymization techniques, when mining handovers and predicting outcomes.
引用
收藏
页码:506 / 521
页数:16
相关论文
共 6 条
  • [1] Privacy-Preserving Data Publishing in Process Mining
    Rafiei, Majid
    van der Aalst, Wil M. P.
    BUSINESS PROCESS MANAGEMENT FORUM, BPM FORUM 2020, 2020, 392 : 122 - 138
  • [2] Generalization-based privacy preservation and discrimination prevention in data publishing and mining
    Hajian, Sara
    Domingo-Ferrer, Josep
    Farras, Oriol
    DATA MINING AND KNOWLEDGE DISCOVERY, 2014, 28 (5-6) : 1158 - 1188
  • [3] Multi-perspective quality control of Illumina RNA sequencing data analysis
    Sheng, Quanhu
    Vickers, Kasey
    Zhao, Shilin
    Wang, Jing
    Samuels, David C.
    Koues, Olivia
    Shyr, Yu
    Guo, Yan
    BRIEFINGS IN FUNCTIONAL GENOMICS, 2017, 16 (04) : 194 - 204
  • [4] Multi-Attribute Generalization Method in Privacy Preserving Data Publishing
    Yu Wen-bing
    Pin, L. V.
    Chen Nian-sheng
    2010 2ND INTERNATIONAL CONFERENCE ON E-BUSINESS AND INFORMATION SYSTEM SECURITY (EBISS 2010), 2010, : 319 - 322
  • [5] Multi-perspective quality control of Illumina exome sequencing data using QC3
    Guo, Yan
    Zhao, Shilin
    Sheng, Quanhu
    Ye, Fei
    Li, Jiang
    Lehmann, Brian
    Pietenpol, Jennifer
    Samuels, David C.
    Shyr, Yu
    GENOMICS, 2014, 103 (5-6) : 323 - 328
  • [6] Privacy-preserving data-mining through micro-aggregation for web-based e-commerce
    Navarro-Arribas, Guillermo
    Torra, Vicenc
    INTERNET RESEARCH, 2010, 20 (03) : 366 - 384