An efficient algorithm for mining high utility patterns from incremental databases with one database scan

被引:74
作者
Yun, Unil [1 ]
Ryang, Heungmo [1 ]
Lee, Gangin [1 ]
Fujita, Hamido [2 ]
机构
[1] Sejong Univ, Dept Comp Engn, Seoul, South Korea
[2] Iwate Prefectural Univ, Fac Software & Informat Sci, Iwate, Japan
基金
新加坡国家研究基金会;
关键词
Data mining; High utility patterns; One database scan; Incremental mining; Utility mining; TRANSACTION DELETION; ITEMSETS; FREQUENT;
D O I
10.1016/j.knosys.2017.03.016
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
High utility pattern mining has been actively researched as one of the significant topics in the data mining field since this approach can solve the limitation of traditional pattern mining that cannot fully consider characteristics of real world databases. Moreover, database volumes have been bigger gradually in various applications such as sales data of retail markets and connection information of web services, and general methods for static databases are not suitable for processing dynamic databases and extracting useful information from them. Although incremental utility pattern mining approaches have been suggested, previous approaches need at least two scans for incremental utility pattern mining irrespective of using any structure. However, the approaches with multiple scans are actually not adequate for stream environments. In this paper, we propose an efficient algorithm for mining high utility patterns from incremental databases with one database scan based on a list-based data structure without candidate generation. Experimental results with real and synthetic datasets show that the proposed algorithm outperforms previous one phase construction methods with candidate generation. (C) 2017 Elsevier B.V. All rights reserved.
引用
收藏
页码:188 / 206
页数:19
相关论文
共 46 条
[31]   High utility pattern mining over data streams with sliding window technique [J].
Ryang, Heungmo ;
Yun, Unil .
EXPERT SYSTEMS WITH APPLICATIONS, 2016, 57 :214-231
[32]   Fast algorithm for high utility pattern mining with the sum of item quantities [J].
Ryang, Heungmo ;
Yun, Unil ;
Ryu, Keun Ho .
INTELLIGENT DATA ANALYSIS, 2016, 20 (02) :395-415
[33]   An efficient approach for mining association rules from high utility itemsets [J].
Sahoo, Jayakrushna ;
Das, Ashok Kumar ;
Goswami, A. .
EXPERT SYSTEMS WITH APPLICATIONS, 2015, 42 (13) :5754-5778
[34]   Binary partition for itemsets expansion in mining high utility itemsets [J].
Song, Wei ;
Wang, Chunhua ;
Li, Jinhong .
INTELLIGENT DATA ANALYSIS, 2016, 20 (04) :915-931
[35]   Mining high utility itemsets by dynamically pruning the tree structure [J].
Song, Wei ;
Liu, Yu ;
Li, Jinhong .
APPLIED INTELLIGENCE, 2014, 40 (01) :29-43
[36]   Mining frequent itemsets in data streams within a time horizon [J].
Troiano, Luigi ;
Scibelli, Giacomo .
DATA & KNOWLEDGE ENGINEERING, 2014, 89 :21-37
[37]  
Tseng V.S., 2017, DATA SCI PATTERN REC, V1, P13
[38]   Efficient Algorithms for Mining Top-K High Utility Itemsets [J].
Tseng, Vincent S. ;
Wu, Cheng-Wei ;
Fournier-Viger, Philippe ;
Yu, Philip S. .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2016, 28 (01) :54-67
[39]   Efficient Algorithms for Mining the Concise and Lossless Representation of High Utility Itemsets [J].
Tseng, Vincent S. ;
Wu, Cheng-Wei ;
Fournier-Viger, Philippe ;
Yu, Philip S. .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2015, 27 (03) :726-739
[40]   Efficient Algorithms for Mining High Utility Itemsets from Transactional Databases [J].
Tseng, Vincent S. ;
Shie, Bai-En ;
Wu, Cheng-Wei ;
Yu, Philip S. .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2013, 25 (08) :1772-1786