Binary partition for itemsets expansion in mining high utility itemsets

被引:3
作者
Song, Wei [1 ]
Wang, Chunhua [1 ]
Li, Jinhong [1 ]
机构
[1] North China Univ Technol, Coll Comp, Beijing 100144, Peoples R China
基金
中国国家自然科学基金;
关键词
Data mining; high utility itemsets; binary partition; transaction utility list; key support count;
D O I
10.3233/IDA-160838
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
High utility itemset mining has recently emerged to address the limitations of frequent itemset mining. It entails relevance measures to reflect both statistical significance and user expectations. Whether breadth-first or depth-first search algorithms are employed, most methods generate new candidates by 1-extension of existing itemsets (i.e., by adding only one item to verified itemsets to generate new potential candidates). As an alternative to 1-extension, we introduce an expansion method based on binary partition. We then define the transaction utility list and key support count and discuss a new pruning strategy. Based on the new itemset expansion method and pruning strategy, we propose an efficient high utility itemset mining algorithm called BPHUI-Mine (Binary Partition-based High Utility Itemsets Mine). Tests on publicly available datasets show that the proposed algorithm outperforms other state-of-the-art algorithms.
引用
收藏
页码:915 / 931
页数:17
相关论文
共 28 条
[1]  
Agrawal R., P 20 INT C VERY LARG
[2]   HUC-Prune: an efficient candidate pruning technique to mine high utility patterns [J].
Ahmed, Chowdhury Farhan ;
Tanbeer, Syed Khairuzzaman ;
Jeong, Byeong-Soo ;
Lee, Young-Koo .
APPLIED INTELLIGENCE, 2011, 34 (02) :181-198
[3]   RIB: A Robust Itemset-based Bayesian approach to classification [J].
Baralis, Elena ;
Cagliero, Luca .
KNOWLEDGE-BASED SYSTEMS, 2014, 71 :366-375
[4]   Closed and noise-tolerant patterns in n-ary relations [J].
Cerf, Loic ;
Besson, Jeremy ;
Nguyen, Kim-Ngan T. ;
Boulicaut, Jean-Francois .
DATA MINING AND KNOWLEDGE DISCOVERY, 2013, 26 (03) :574-619
[5]  
Chan R, 2003, THIRD IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, P19
[6]   Mining fuzzy frequent itemsets for hierarchical document clustering [J].
Chen, Chun-Ling ;
Tseng, Frank S. C. ;
Liang, Tyne .
INFORMATION PROCESSING & MANAGEMENT, 2010, 46 (02) :193-211
[7]   BISC: A Bitmap Itemset Support Counting Approach for Efficient Frequent Itemset Mining [J].
Chen, Jinlin ;
Xiao, Keli .
ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2010, 4 (03)
[8]   CTU-Mine: An efficient high utility itemset mining algorithm using the pattern growth approach [J].
Erwin, Alva ;
Gopalan, Raj P. ;
Achuthan, N. R. .
2007 CIT: 7TH IEEE INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION TECHNOLOGY, PROCEEDINGS, 2007, :71-+
[9]   UT-Tree: Efficient mining of high utility itemsets from data streams [J].
Feng, Lin ;
Wang, Le ;
Jin, Bo .
INTELLIGENT DATA ANALYSIS, 2013, 17 (04) :585-602
[10]  
Fournier-Viger Philippe, 2014, Foundations of Intelligent Systems. 21st International Symposium, ISMIS 2014. Proceedings: LNCS 8502, P83, DOI 10.1007/978-3-319-08326-1_9