Mining Frequent Itemsets in Data Streams Based on Genetic Algorithm

被引:0
作者
Han, Chong [1 ]
Sun, Lijuan [1 ,2 ,3 ]
Guo, Jian [1 ,2 ,3 ]
Chen, Xiaodong [1 ]
机构
[1] Nanjing Univ Posts & Telecommun, Coll Comp, Nanjing 210003, Jiangsu, Peoples R China
[2] Jiangsu High Technol Res Key Lab Wireless Sensor, Nanjing 210003, Jiangsu, Peoples R China
[3] Minist Educ Jiangsu Prov, Key Lab Broadband Wireless Commun & Sensor Networ, Nanjing 210003, Jiangsu, Peoples R China
来源
2013 15TH IEEE INTERNATIONAL CONFERENCE ON COMMUNICATION TECHNOLOGY (ICCT) | 2013年
关键词
Big data; data streams; genetic algorithm; frequent itemsets;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Stream data is a very common data type in big data and in many data streams applications, users tend to pay more attention to the mode information of the data streams. So mining frequent patterns in data streams is a significative work. Meanwhile, finding frequent itemests in a data set with predefined fixed support threshold could be seen as an optimization problem. In this paper, the problem of frequent itemsets mining is derived as a non-linear optimization problem, then genetic algorithm is adopted to solve it. Through the formal and bitmap representation of frequent itemsets, the non-linear optimization problem is transformed to 0-1 programming. A set of experimental results show that unlike typical Apriori algorithm, the complexity of time and memory space grows exponentially as the support decrease, our proposed algorithm has a high time and space efficiency even with a very low support.
引用
收藏
页码:748 / 753
页数:6
相关论文
共 50 条
[21]   Mining maximal frequent itemsets in data streams based on FP-Tree [J].
Ao, Fujiang ;
Yan, Yuejin ;
Huang, Jian ;
Huang, Kedi .
MACHINE LEARNING AND DATA MINING IN PATTERN RECOGNITION, PROCEEDINGS, 2007, 4571 :479-+
[22]   A survey on algorithms for mining frequent itemsets over data streams [J].
Cheng, James ;
Ke, Yiping ;
Ng, Wilfred .
KNOWLEDGE AND INFORMATION SYSTEMS, 2008, 16 (01) :1-27
[23]   Frequent Itemsets Mining in Data Streams Using Reconfigurable Hardware [J].
Bustio, Lazaro ;
Cumplido, Rene ;
Hernandez, Raudel ;
Bande, Jose M. ;
Feregrino, Claudia .
NEW FRONTIERS IN MINING COMPLEX PATTERNS, 2016, 9607 :32-45
[24]   Mining frequent itemsets in data streams within a time horizon [J].
Troiano, Luigi ;
Scibelli, Giacomo .
DATA & KNOWLEDGE ENGINEERING, 2014, 89 :21-37
[25]   A survey on algorithms for mining frequent itemsets over data streams [J].
James Cheng ;
Yiping Ke ;
Wilfred Ng .
Knowledge and Information Systems, 2008, 16 :1-27
[26]   Mining Frequent Itemsets with Normalized Weight in Continuous Data Streams [J].
Kim, Younghee ;
Kim, Wonyoung ;
Kim, Ungmo .
JOURNAL OF INFORMATION PROCESSING SYSTEMS, 2010, 6 (01) :79-90
[27]   Efficient strategies for incremental mining of frequent closed itemsets over data streams [J].
Liu, Junqiang ;
Ye, Zhousheng ;
Yang, Xiangcai ;
Wang, Xueling ;
Shen, Linjie ;
Jiang, Xiaoning .
EXPERT SYSTEMS WITH APPLICATIONS, 2022, 191
[28]   The Algorithm of Mining Frequent Itemsets Based on MapReduce [J].
He, Bo .
PROCEEDINGS OF INTERNATIONAL CONFERENCE ON SOFT COMPUTING TECHNIQUES AND ENGINEERING APPLICATION, ICSCTEA 2013, 2014, 250 :529-534
[29]   Mining maximal frequent itemsets in a sliding window over data streams [J].
Mao Y. ;
Li H. ;
Yang L. ;
Liu L. .
Gaojishu Tongxin/Chinese High Technology Letters, 2010, 20 (11) :1142-1148
[30]   A Mining Maximal Frequent Itemsets over the Entire History of Data Streams [J].
Mao, Yinmin ;
Li, Hong ;
Yang, Lumin ;
Chen, Zhigang ;
Liu, Lixin .
FIRST INTERNATIONAL WORKSHOP ON DATABASE TECHNOLOGY AND APPLICATIONS, PROCEEDINGS, 2009, :413-417