Association rule hiding based on evolutionary multi-objective optimization

被引:19
作者
Cheng, Peng [1 ,4 ]
Lee, Ivan [2 ]
Lin, Chun-Wei [1 ]
Pan, Jeng-Shyang [1 ,3 ]
机构
[1] Harbin Inst Technol, Shenzhen Grad Sch, Shenzhen, Guangdong, Peoples R China
[2] Univ S Australia, Sch IT & Math Sci, Adelaide, SA 5001, Australia
[3] Fujian Univ Technol, Coll Informat Sci & Engn, Fuzhou, Fujian, Peoples R China
[4] Southwest Univ, Sch Comp & Informat Sci, Chongqing, Peoples R China
关键词
Privacy preserving data mining; association rule hiding; evolutionary multi-objective optimization; EMO; ALGORITHMS;
D O I
10.3233/IDA-160817
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
When data mining techniques are applied to discover useful knowledge behind a large data collection, they are often required to preserve some confidential information, such as sensitive frequent itemsets, rules and so on. A feasible way to ensure the confidentiality is to sanitize the database and conceal sensitive information. However, the sanitization process often produces side effects, thus minimizing these side effects is an important task. An important but ignored fact is that a tradeoff exists within different side effects. When attempting to improve the performance on one dimension, the performance on other dimensions often will be degraded. In this paper, we focus on privacy preserving in association rule mining. Since there is a tradeoff within different side effects, we tried to minimize them from the view of multi-objective optimization. A rule hiding approach based on evolutionary multi-objective optimization (EMO) is proposed. It hides sensitive rules through removing identified items. The side effects on missing non-sensitive rules, ghost rules and data loss are formulated as optimization objectives. EMO is utilized to find a suitable subset of transactions for modification so that side effects can be minimized. Experimental results on real datasets illustrate that the proposed approach can achieve satisfactory results with fewer side effects. In addition, the EMO-based approach can produce multiple hiding solutions in a single run. It provides the opportunity for a user to choose freely the preferred one by preference or experience.
引用
收藏
页码:495 / 514
页数:20
相关论文
共 35 条
[11]   SMS-EMOA: Multiobjective selection based on dominated hypervolume [J].
Beume, Nicola ;
Naujoks, Boris ;
Emmerich, Michael .
EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2007, 181 (03) :1653-1669
[12]  
Bleuler S, 2003, LECT NOTES COMPUT SC, V2632, P494
[13]   Finding knees in multi-objective optimization [J].
Branke, E ;
Deb, K ;
Dierolf, H ;
Osswald, M .
PARALLEL PROBLEM SOLVING FROM NATURE - PPSN VIII, 2004, 3242 :722-731
[14]  
Brijs T., 1999, Proceedings of the fth ACM SIGKDD inter- national conference on Knowledge discovery and data mining, P254, DOI 10.1145/312129.312241
[15]   Privacy Preserving Association Rule Mining Using Binary Encoded NSGA-II [J].
Cheng, Peng ;
Pan, Jeng-Shyang ;
Lin, Chun-Wei .
TRENDS AND APPLICATIONS IN KNOWLEDGE DISCOVERY AND DATA MINING, 2014, 8643 :87-99
[16]  
Dasseni E., 2001, IHW 01, P369, DOI DOI 10.1007/3-540-45496-9_27
[17]   A fast and elitist multiobjective genetic algorithm: NSGA-II [J].
Deb, K ;
Pratap, A ;
Agarwal, S ;
Meyarivan, T .
IEEE TRANSACTIONS ON EVOLUTIONARY COMPUTATION, 2002, 6 (02) :182-197
[18]  
Deb K., 2001, MULTIOBJECTIVE OPTIM, V16
[19]   Effect of rheological behavior of geosynthetics on settlement response [J].
Deb, Kousik ;
Chandra, Sarvesh ;
Basudhar, Prabir Kumar .
INTERNATIONAL JOURNAL OF GEOTECHNICAL ENGINEERING, 2007, 1 (01) :1-8
[20]  
Gkoulalas-Divanis A., 2010, ASS RULE HIDING DATA