An efficient algorithm for mining high utility patterns from incremental databases with one database scan

被引：74

作者：

Yun, Unil ^{[1
]}

Ryang, Heungmo ^{[1
]}

Lee, Gangin ^{[1
]}

Fujita, Hamido ^{[2
]}

机构：

[1] Sejong Univ, Dept Comp Engn, Seoul, South Korea

[2] Iwate Prefectural Univ, Fac Software & Informat Sci, Iwate, Japan

来源：

KNOWLEDGE-BASED SYSTEMS | 2017年 / 124卷

基金：

新加坡国家研究基金会;

关键词：

Data mining; High utility patterns; One database scan; Incremental mining; Utility mining; TRANSACTION DELETION; ITEMSETS; FREQUENT;

D O I：

10.1016/j.knosys.2017.03.016

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

High utility pattern mining has been actively researched as one of the significant topics in the data mining field since this approach can solve the limitation of traditional pattern mining that cannot fully consider characteristics of real world databases. Moreover, database volumes have been bigger gradually in various applications such as sales data of retail markets and connection information of web services, and general methods for static databases are not suitable for processing dynamic databases and extracting useful information from them. Although incremental utility pattern mining approaches have been suggested, previous approaches need at least two scans for incremental utility pattern mining irrespective of using any structure. However, the approaches with multiple scans are actually not adequate for stream environments. In this paper, we propose an efficient algorithm for mining high utility patterns from incremental databases with one database scan based on a list-based data structure without candidate generation. Experimental results with real and synthetic datasets show that the proposed algorithm outperforms previous one phase construction methods with candidate generation. (C) 2017 Elsevier B.V. All rights reserved.

引用

页码：188 / 206

页数：19

共 46 条

[31] High utility pattern mining over data streams with sliding window technique [J].