Hybrid time decay model and probability decay window model for data stream closed frequent pattern mining

被引:1
作者
Yang, Rui [1 ]
Ye, Dong [1 ]
机构
[1] Zhengzhou Univ Sci & Technol, Sch Elect Engn, Zhengzhou 450000, Peoples R China
来源
JOURNAL OF APPLIED SCIENCE AND ENGINEERING | 2020年 / 23卷 / 04期
关键词
Data stream; Frequent pattern mining; Time decay model; Probability decay window model; Closure opera-tor; Decay factor; ALGORITHM; ITEMSETS;
D O I
10.6180/jase.202012_23(4).0005
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Data stream is continuous and uncertain. Frequent pattern mining for data stream will cause that data distributes unevenly and concept drift. In order to improve mining efficiency and decrease data storage, we propose a hybrid time decay model and probability decay window model (HTPDWM) for data stream closed frequent pattern mining. This new method is divided into three steps. First, we adopt mining closed frequent pattern of sliding window model and time decay model in data stream to deal with new and old things. Second, we use probability decay window model and closure operator to calculate ex-pected support degree and improve efficiency of close pattern mining respectively. Third, we use decay factor to correct concept drift and data distributes evenly. Finally, we make experiments to verify the ef-fectiveness of the new method. Results show that HTPDWM can present stable with different sliding window and have better performance when processing time and memory space.
引用
收藏
页码:611 / 618
页数:8
相关论文
共 14 条
[11]   An efficient algorithm for incrementally mining frequent closed itemsets [J].
Yen, Show-Jane ;
Lee, Yue-Shi ;
Wang, Chiu-Kuang .
APPLIED INTELLIGENCE, 2014, 40 (04) :649-668
[12]   Large Scale Remote Sensing Image Segmentation Based on Fuzzy Region Competition and Gaussian Mixture Mlodel [J].
Yin, Shoulin ;
Zhang, Ye ;
Karim, Shahid .
IEEE ACCESS, 2018, 6 :26069-26080
[13]   A false negative approach to mining frequent itemsets from high speed transactional data streams [J].
Yu, Jeffrey Xu ;
Chong, Zhihong ;
Lu, Hongjun ;
Zhang, Zhenjie ;
Zhou, Aoying .
INFORMATION SCIENCES, 2006, 176 (14) :1986-2015
[14]   A distributed frequent itemset mining algorithm using Spark for Big Data analytics [J].
Zhang, Feng ;
Liu, Min ;
Gui, Feng ;
Shen, Weiming ;
Shami, Abdallah ;
Ma, Yunlong .
CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2015, 18 (04) :1493-1501