Distributed Cooperative Coevolution of Data Publishing Privacy and Transparency

被引:22
作者
Ge, Yong-Feng [1 ]
Bertino, Elisa [2 ]
Wang, Hua [1 ]
Cao, Jinli [3 ]
Zhang, Yanchun [1 ,4 ]
机构
[1] Victoria Univ, 70-104 Ballarat Rd, Footscray, Vic 3011, Australia
[2] Purdue Univ, 610 Purdue Mall, W Lafayette, IN 47907 USA
[3] La Trobe Univ, Plenty Rd, Melbourne, Vic 3086, Australia
[4] Peng Cheng Lab, 2 Xingke First St, Shenzhen 518066, Guangdong, Peoples R China
关键词
Large-scale multi-objective optimization; data privacy and transparency; genetic algorithm; cooperative coevolution; MULTIOBJECTIVE EVOLUTIONARY ALGORITHM; DATA PUBLICATION; OPTIMIZATION; ANONYMITY; NOISE;
D O I
10.1145/3613962
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Data transparency is beneficial to data participants' awareness, users' fairness, and research work's reproducibility. However, when addressing transparency requirements, we cannot ignore data privacy. This article defines the multi-objective data publishing (MODP) problem, optimizing data privacy and transparency at the same time. Accordingly, we propose a distributed cooperative coevolutionary genetic algorithm (DCCGA) to optimize the MODP problem. In the population of DCCGA, each individual represents an anonymization solution to MODP. Three modules in DCCGA, i.e., grouping module, cooperative coevolutionary module, and evolving module, are proposed for distributed sub-population update and evaluation, improving DCCGA's optimization performance and parallel efficiency. Moreover, a matrix-based crossover operator and a matrix-based mutation operator are designed to exchange and adjust anonymization information in the individuals efficiently. Experimental results demonstrate that the proposed DCCGA outperforms the competitors with respect to solution accuracy, convergence speed, and scalability. Besides, we verify the effectiveness of all the proposed components in DCCGA.
引用
收藏
页数:23
相关论文
共 67 条
  • [1] Data Transparency and Fairness Analysis of the NYPD Stop-and-Frisk Program
    Badr, Youakim
    Sharma, Rahul
    [J]. ACM JOURNAL OF DATA AND INFORMATION QUALITY, 2022, 14 (02):
  • [2] Editorial: Special Issue on Data Transparency-Uses Cases and Applications
    Barhamgi, Mahmoud
    Bertino, Elisa
    [J]. ACM JOURNAL OF DATA AND INFORMATION QUALITY, 2022, 14 (02):
  • [3] Synthesis of Longitudinal Human Location Sequences: Balancing Utility and Privacy
    Benarous, Maya
    Toch, Eran
    Ben-gal, Irad
    [J]. ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2022, 16 (06)
  • [4] The Quest for Data Transparency
    Bertino, Elisa
    [J]. IEEE SECURITY & PRIVACY, 2020, 18 (03) : 67 - 68
  • [5] Cameron K., 2005, Microsoft Corp, V12, P8
  • [6] Applying graph-based differential grouping for multiobjective large-scale optimization
    Cao, Bin
    Zhao, Jianwei
    Gu, Yu
    Ling, Yingbiao
    Ma, Xiaoliang
    [J]. SWARM AND EVOLUTIONARY COMPUTATION, 2020, 53 (53)
  • [7] A Distributed Parallel Cooperative Coevolutionary Multiobjective Evolutionary Algorithm for Large-Scale Optimization
    Cao, Bin
    Zhao, Jianwei
    Lv, Zhihan
    Liu, Xin
    [J]. IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2017, 13 (04) : 2030 - 2038
  • [8] Achieving Transparency Report Privacy in Linear Time
    Chen, Chien-Lun
    Golubchik, Leana
    Pal, Ranjan
    [J]. ACM JOURNAL OF DATA AND INFORMATION QUALITY, 2022, 14 (02):
  • [9] CenEEGs: Valid EEG Selection for Classification
    Dai, Chenglong
    Pi, Dechang
    Becker, Stefanie, I
    Wu, Jia
    Cui, Lin
    Johnson, Blake
    [J]. ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2020, 14 (02)
  • [10] Brain EEG Time-Series Clustering Using Maximum-Weight Clique
    Dai, Chenglong
    Wu, Jia
    Pi, Dechang
    Becker, Stefanie, I
    Cui, Lin
    Zhang, Qin
    Johnson, Blake
    [J]. IEEE TRANSACTIONS ON CYBERNETICS, 2022, 52 (01) : 357 - 371