An efficient algorithm for mining high utility patterns from incremental databases with one database scan

被引:74
作者
Yun, Unil [1 ]
Ryang, Heungmo [1 ]
Lee, Gangin [1 ]
Fujita, Hamido [2 ]
机构
[1] Sejong Univ, Dept Comp Engn, Seoul, South Korea
[2] Iwate Prefectural Univ, Fac Software & Informat Sci, Iwate, Japan
基金
新加坡国家研究基金会;
关键词
Data mining; High utility patterns; One database scan; Incremental mining; Utility mining; TRANSACTION DELETION; ITEMSETS; FREQUENT;
D O I
10.1016/j.knosys.2017.03.016
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
High utility pattern mining has been actively researched as one of the significant topics in the data mining field since this approach can solve the limitation of traditional pattern mining that cannot fully consider characteristics of real world databases. Moreover, database volumes have been bigger gradually in various applications such as sales data of retail markets and connection information of web services, and general methods for static databases are not suitable for processing dynamic databases and extracting useful information from them. Although incremental utility pattern mining approaches have been suggested, previous approaches need at least two scans for incremental utility pattern mining irrespective of using any structure. However, the approaches with multiple scans are actually not adequate for stream environments. In this paper, we propose an efficient algorithm for mining high utility patterns from incremental databases with one database scan based on a list-based data structure without candidate generation. Experimental results with real and synthetic datasets show that the proposed algorithm outperforms previous one phase construction methods with candidate generation. (C) 2017 Elsevier B.V. All rights reserved.
引用
收藏
页码:188 / 206
页数:19
相关论文
共 46 条
[1]  
Agrawal R., P 20 INT C VERY LARG
[2]   Interactive mining of high utility patterns over data streams [J].
Ahmed, Chowdhury Farhan ;
Tanbeer, Syed Khairuzzaman ;
Jeong, Byeong-Soo ;
Choi, Ho-Jin .
EXPERT SYSTEMS WITH APPLICATIONS, 2012, 39 (15) :11979-11991
[3]   Single-pass incremental and interactive mining for weighted frequent patterns [J].
Ahmed, Chowdhury Farhan ;
Tanbeer, Syed Khairuzzaman ;
Jeong, Byeong-Soo ;
Lee, Young-Koo ;
Choi, Ho-Jin .
EXPERT SYSTEMS WITH APPLICATIONS, 2012, 39 (09) :7976-7994
[4]   Efficient Tree Structures for High Utility Pattern Mining in Incremental Databases [J].
Ahmed, Chowdhury Farhan ;
Tanbeer, Syed Khairuzzaman ;
Jeong, Byeong-Soo ;
Lee, Young-Koo .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2009, 21 (12) :1708-1721
[5]   Mining frequent items in data stream using time fading model [J].
Chen, Ling ;
Mei, Qingling .
INFORMATION SCIENCES, 2014, 257 :54-69
[6]   UT-Tree: Efficient mining of high utility itemsets from data streams [J].
Feng, Lin ;
Wang, Le ;
Jin, Bo .
INTELLIGENT DATA ANALYSIS, 2013, 17 (04) :585-602
[7]  
Fournier-Viger Philippe, 2014, Foundations of Intelligent Systems. 21st International Symposium, ISMIS 2014. Proceedings: LNCS 8502, P83, DOI 10.1007/978-3-319-08326-1_9
[8]  
Fournier-Viger P., 2015, P 5 INT C ASE BIG DA
[9]  
FournierViger P., 2017, Data Sci. Pattern Recogn, V1, P54, DOI DOI 10.1007/978-3-030-04921-8_4
[10]  
Han JW, 2000, SIGMOD RECORD, V29, P1