Novel Iterative Min-Max Clustering to Minimize Information Loss in Statistical Disclosure Control

被引:0
作者
Mahmood, Abdun Naser [1 ]
Kabir, Md Enamul [2 ]
Mustafa, Abdul K. [3 ]
机构
[1] Univ New S Wales, Australian Def Force Acad, Sch Informat Technol & Engn, Canberra, ACT 2600, Australia
[2] Univ Queensland, Sch Human Movement Studies, St Lucia, Qld 4072, Australia
[3] Humber Coll, Sch Appl Technol, Toronto, ON, Canada
来源
INTERNATIONAL CONFERENCE ON SECURITY AND PRIVACY IN COMMUNICATION NETWORKS, SECURECOMM 2014, PT II | 2015年 / 153卷
关键词
Privacy; Microaggregation; Microdata protection; k-anonymity; Disclosure control; K-ANONYMITY; MICROAGGREGATION; PROTECTION; ALGORITHM; PRIVACY;
D O I
10.1007/978-3-319-23802-9_14
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
In recent years, there has been an alarming increase of online identity theft and attacks using personally identifiable information. The goal of privacy preservation is to de-associate individuals from sensitive or microdata information. Microaggregation techniques seeks to protect microdata in such a way that can be published and mined without providing any private information that can be linked to specific individuals. Microaggregation works by partitioning the microdata into groups of at least k records and then replacing the records in each group with the centroid of the group. An optimal microaggregation method must minimize the information loss resulting from this replacement process. The challenge is how to minimize the information loss during the microaggregation process. This paper presents a new microaggregation technique for Statistical Disclosure Control (SDC). It consists of two stages. In the first stage, the algorithm sorts all the records in the data set in a particular way to ensure that during microaggregation very dissimilar observations are never entered into the same cluster. In the second stage an optimal microaggregation method is used to create k-anonymous clusters while minimizing the information loss. It works by taking the sorted data and simultaneously creating two distant clusters using the two extreme sorted values as seeds for the clusters. The performance of the proposed technique is compared against the most recent microaggregation methods. Experimental results using benchmark datasets show that the proposed algorithm has the lowest information loss compared with a basket of techniques in the literature.
引用
收藏
页码:157 / 172
页数:16
相关论文
共 14 条
  • [1] Scalable Min-Max Multi-View Spectral Clustering
    Yang, Ben
    Zhang, Xuetao
    Wu, Jinghan
    Nie, Feiping
    Wang, Fei
    Chen, Badong
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2025, 37 (05) : 2918 - 2931
  • [2] Novel Min-Max Reformulations of Linear Inverse Problems
    Sheriff, Mohammed Rayyan
    Chatterjee, Debasish
    JOURNAL OF MACHINE LEARNING RESEARCH, 2022, 23 : 1 - 46
  • [3] Data Clustering Using a Modified Fuzzy Min-Max Neural Network
    Seera, Manjeevan
    Lim, Chee Peng
    Loo, Chu Kiong
    Jain, Lakhmi C.
    SOFT COMPUTING APPLICATIONS, (SOFA 2014), VOL 1, 2016, 356 : 413 - 422
  • [4] Improving the Fuzzy Min-Max neural network performance with an ensemble of clustering trees
    Seera, Manjeevan
    Randhawa, Kuldeep
    Lim, Chee Peng
    NEUROCOMPUTING, 2018, 275 : 1744 - 1751
  • [5] Min-Max Optimal Control of Robot Manipulators Affected by Sensor Faults
    Milic, Vladimir
    Kasac, Josip
    Lukas, Marin
    SENSORS, 2023, 23 (04)
  • [6] Associative Knowledge Graph Using Fuzzy Clustering and Min-Max Normalization in Video Contents
    Kim, Hyun-Jin
    Baek, Ji-Won
    Chung, Kyungyong
    IEEE ACCESS, 2021, 9 : 74802 - 74816
  • [7] Explicit solution of min-max model predictive control for uncertain systems
    Gao, Yu
    Sun, Li Ning
    IET CONTROL THEORY AND APPLICATIONS, 2016, 10 (04) : 461 - 468
  • [8] A modified fuzzy min-max neural network for data clustering and its application to power quality monitoring
    Seera, Manjeevan
    Lim, Chee Peng
    Loo, Chu Kiong
    Singh, Harapajan
    APPLIED SOFT COMPUTING, 2015, 28 : 19 - 29
  • [9] Power Quality Analysis Using a Hybrid Model of the Fuzzy Min-Max Neural Network and Clustering Tree
    Seera, Manjeevan
    Lim, Chee Peng
    Loo, Chu Kiong
    Singh, Harapajan
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2016, 27 (12) : 2760 - 2767
  • [10] Min-max piecewise constant optimal control for multi-model linear systems
    Miranda, Felix A.
    Castanos, Fernando
    Poznyak, Alexander
    IMA JOURNAL OF MATHEMATICAL CONTROL AND INFORMATION, 2016, 33 (04) : 1157 - 1176