IME: Efficient list-based method for incremental mining of maximal erasable patterns

被引:4
作者
Davashi, Razieh [1 ,2 ]
机构
[1] Islamic Azad Univ, Fac Comp Engn, Najafabad Branch, Najafabad, Iran
[2] Islamic Azad Univ, Big Data Res Ctr, Najafabad Branch, Najafabad, Iran
关键词
Erasable pattern mining; Maximal erasable patterns; Incremental mining; Dynamic data; FREQUENT PATTERNS; ALGORITHM;
D O I
10.1016/j.patcog.2023.110166
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Erasable pattern mining can help factories facing a financial crisis increase productivity by identifying and eliminating unprofitable products. The Flag-GenMax-EI algorithm extracts Maximal Erasable Itemsets (MEIs); however, it does not support dynamic data. In practice, many applications create databases incrementally. Using the Flag-GenMax-EI algorithm to mine maximal erasable patterns from incremental databases is clearly very costly because it must be run each time. In this paper, an efficient method called IME is proposed for incremental mining of maximal erasable patterns. IMEI-List and IMEP-List are two new data structures introduced by the proposed method. These lists allow the algorithm to update all tree nodes without rescanning the updated database (original database + new database) and recreating the nodes. This is the first study of incremental mining of maximal erasable patterns. Extensive experimental results on dense and sparse incremental data show that the proposed algorithm improves scalability. It extracts MEIs much faster than the Flag-GenMax-EI algorithm in different modes of database update.
引用
收藏
页数:17
相关论文
共 41 条
[1]  
Agrawal R., 2023, Quest synthetic data generator
[2]  
[Anonymous], 2023, Frequent itemset mining dataset repository
[3]   CD-SPM: Cross-domain book recommendation using sequential pattern mining and rule mining [J].
Anwar, Taushif ;
Uma, V .
JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2022, 34 (03) :793-800
[4]   TKN: An efficient approach for discovering top-k high utility itemsets with positive or negative profits [J].
Ashraf, Mohamed ;
Abdelkader, Tamer ;
Rady, Sherine ;
Gharib, Tarek F. .
INFORMATION SCIENCES, 2022, 587 :654-678
[5]   Mining erasable itemsets with subset and superset itemset constraints [J].
Bay Vo ;
Tuong Le ;
Pedrycz, Witold ;
Giang Nguyen ;
Baik, Sung Wook .
EXPERT SYSTEMS WITH APPLICATIONS, 2017, 69 :50-61
[6]   MWFP-outlier: Maximal weighted frequent-pattern-based approach for detecting outliers from uncertain weighted data streams [J].
Cai, Saihua ;
Li, Li ;
Chen, Jinfu ;
Zhao, Kaiyi ;
Yuan, Gang ;
Sun, Ruizhi ;
Huang, Longxia ;
Sosu, Rexford Nii Ayitey .
INFORMATION SCIENCES, 2022, 591 :195-225
[7]  
Chang Y.I., 2022, Int. J. Mach. Learn. Comput., V12, P236
[8]   Discovering high utility-occupancy patterns from uncertain data [J].
Chen, Chien-Ming ;
Chen, Lili ;
Gan, Wensheng ;
Qiu, Lina ;
Ding, Weiping .
INFORMATION SCIENCES, 2021, 546 :1208-1229
[9]   ITUFP: A fast method for interactive mining of Top-K frequent patterns from uncertain data [J].
Davashi, Razieh .
EXPERT SYSTEMS WITH APPLICATIONS, 2023, 214
[10]   UP-tree & UP-Mine: A fast method based on upper bound for frequent pattern mining from uncertain data [J].
Davashi, Razieh .
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2021, 106