Pre-large based high utility pattern mining for transaction insertions in incremental database

被引:26
作者
Kim, Hyeonmo [1 ]
Lee, Chanhee [1 ]
Ryu, Taewoong [1 ]
Kim, Heonho [1 ]
Kim, Sinyoung [1 ]
Vo, Bay [2 ]
Lin, Jerry Chun-Wei [3 ]
Yun, Unil [1 ]
机构
[1] Sejong Univ, Dept Comp Engn, Seoul, South Korea
[2] HUTECH Univ, Fac Informat Technol, Ho Chi Minh City, Vietnam
[3] Western Norway Univ Appl Sci, Dept Comp Sci Elect Engn & Math Sci, Bergen, Norway
基金
新加坡国家研究基金会;
关键词
Data mining; Pattern mining; High-utility pattern; Pre-large; Incremental database; ALGORITHM; ITEMSETS;
D O I
10.1016/j.knosys.2023.110478
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
High utility pattern mining has been actively researched and applied to diverse applications because it can process the database by considering the quantity and importance of items. However, traditional high utility pattern mining methods aim to handle static databases, so they cannot meet the requirements of users who want to process the dynamic environments. Although methods to process incremental databases have been proposed, they have limitations that they perform the mining process on the entire database, including already processed data, whenever data are newly inserted. The pre-large concept is one of the techniques to process the dynamic database. Utilizing the pre-large technique, we can efficiently handle the transaction insertion using the extracted patterns of the previous mining process. In this paper, we propose a novel pre-large-based approach to discover high utility patterns from incremental databases. A list structure is proposed to store the utility information of patterns, so candidate patterns are not generated, and an additional database scan is not required. Performance evaluation performed on various real and synthetic datasets shows that the proposed algorithm is more efficient and effective than the latest approaches in a dynamic environment.(c) 2023 Elsevier B.V. All rights reserved.
引用
收藏
页数:24
相关论文
共 42 条
[1]  
Agrawal R., 1994, P 20 VLDB C SANT CHI, P1
[2]   CD-SPM: Cross-domain book recommendation using sequential pattern mining and rule mining [J].
Anwar, Taushif ;
Uma, V .
JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2022, 34 (03) :793-800
[3]   RHUPS: Mining Recent High Utility Patterns with Sliding Window-based Arrival Time Control over Data Streams [J].
Baek, Yoonji ;
Yun, Unil ;
Kim, Heonho ;
Nam, Hyoju ;
Kim, Hyunsoo ;
Lin, Jerry Chun-Wei ;
Vo, Bay ;
Pedrycz, Witold .
ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2021, 12 (02)
[4]   Uncertainty-Based Pattern Mining for Maximizing Profit of Manufacturing Plants With List Structure [J].
Baek, Yoonji ;
Yun, Unil ;
Yoon, Eunchul ;
Fournier-Viger, Philippe .
IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 2020, 67 (11) :9914-9926
[5]   MINING ONLINE REVIEWS TO SUPPORT CUSTOMERS' DECISION-MAKING PROCESS IN E-COMMERCE PLATFORMS: A NARRATIVE LITERATURE REVIEW [J].
Benlahbib, Abdessamad ;
Boumhidi, Achraf ;
Nfaoui, El Habib .
JOURNAL OF ORGANIZATIONAL COMPUTING AND ELECTRONIC COMMERCE, 2022, 32 (01) :69-97
[6]   A Sliding Window-Based Approach for Mining Frequent Weighted Patterns Over Data Streams [J].
Bui, Huong ;
Nguyen-Hoang, Tu-Anh ;
Vo, Bay ;
Nguyen, Ham ;
Le, Tuong .
IEEE ACCESS, 2021, 9 :56318-56329
[7]   Automatic CV processing for scientific research using data mining algorithm [J].
El Mohadab, Mohamed ;
Bouikhalene, Belaid ;
Safi, Said .
JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2020, 32 (05) :561-567
[8]  
Fayyad U., 1996, KDD-96 Proceedings. Second International Conference on Knowledge Discovery and Data Mining, P82
[9]   HUOPM: High-Utility Occupancy Pattern Mining [J].
Gan, Wensheng ;
Lin, Jerry Chun-Wei ;
Fournier-Viger, Philippe ;
Chao, Han-Chieh ;
Yu, Philip S. .
IEEE TRANSACTIONS ON CYBERNETICS, 2020, 50 (03) :1195-1208
[10]  
Gionis A., 2007, ACM Transactions on Knowledge Discovery from Data (TKDD), V1, P1, DOI DOI 10.1145/1297332.1297338