Mining Frequent Itemsets in Data Streams Based on Genetic Algorithm

被引:0
作者
Han, Chong [1 ]
Sun, Lijuan [1 ,2 ,3 ]
Guo, Jian [1 ,2 ,3 ]
Chen, Xiaodong [1 ]
机构
[1] Nanjing Univ Posts & Telecommun, Coll Comp, Nanjing 210003, Jiangsu, Peoples R China
[2] Jiangsu High Technol Res Key Lab Wireless Sensor, Nanjing 210003, Jiangsu, Peoples R China
[3] Minist Educ Jiangsu Prov, Key Lab Broadband Wireless Commun & Sensor Networ, Nanjing 210003, Jiangsu, Peoples R China
来源
2013 15TH IEEE INTERNATIONAL CONFERENCE ON COMMUNICATION TECHNOLOGY (ICCT) | 2013年
关键词
Big data; data streams; genetic algorithm; frequent itemsets;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Stream data is a very common data type in big data and in many data streams applications, users tend to pay more attention to the mode information of the data streams. So mining frequent patterns in data streams is a significative work. Meanwhile, finding frequent itemests in a data set with predefined fixed support threshold could be seen as an optimization problem. In this paper, the problem of frequent itemsets mining is derived as a non-linear optimization problem, then genetic algorithm is adopted to solve it. Through the formal and bitmap representation of frequent itemsets, the non-linear optimization problem is transformed to 0-1 programming. A set of experimental results show that unlike typical Apriori algorithm, the complexity of time and memory space grows exponentially as the support decrease, our proposed algorithm has a high time and space efficiency even with a very low support.
引用
收藏
页码:748 / 753
页数:6
相关论文
共 50 条
[41]   An Improved Algorithm for Frequent Itemsets Mining [J].
Jiang, Hao ;
He, Xu .
2017 FIFTH INTERNATIONAL CONFERENCE ON ADVANCED CLOUD AND BIG DATA (CBD), 2017, :314-317
[42]   Mining maximal frequent itemsets in uncertain data [J].
Tang, Xianghong ;
Yang, Quanwei ;
Zheng, Yang .
Huazhong Keji Daxue Xuebao (Ziran Kexue Ban)/Journal of Huazhong University of Science and Technology (Natural Science Edition), 2015, 43 (09) :29-34
[43]   The Mining Algorithm of Frequent Itemsets based on Mapreduce and FP-tree [J].
He, Bo ;
Zhang, Hongyuan ;
Pei, Jianhui .
2017 INTERNATIONAL CONFERENCE ON COMPUTER NETWORK, ELECTRONIC AND AUTOMATION (ICCNEA), 2017, :108-111
[44]   Finding frequent itemsets over online data streams [J].
Chang, Joong Hyuk ;
Lee, Won Suk .
INFORMATION AND SOFTWARE TECHNOLOGY, 2006, 48 (07) :606-618
[45]   Approximate mining of maximal frequent itemsets in data streams with different window models [J].
Li, Hua-Fu ;
Lee, Suh-Yin .
EXPERT SYSTEMS WITH APPLICATIONS, 2008, 35 (03) :781-789
[46]   On the design of hardware-software architectures for frequent itemsets mining on data streams [J].
Lázaro Bustio-Martínez ;
René Cumplido ;
Raudel Hernández-León ;
José M. Bande-Serrano ;
Claudia Feregrino-Uribe .
Journal of Intelligent Information Systems, 2018, 50 :415-440
[47]   Mining Recent Maximal Frequent Itemsets Over Data Streams with Sliding Window [J].
Cai, Saihua ;
Hao, Shangbo ;
Sun, Ruizhi ;
Wu, Gang .
INTERNATIONAL ARAB JOURNAL OF INFORMATION TECHNOLOGY, 2019, 16 (06) :961-969
[48]   On the design of hardware-software architectures for frequent itemsets mining on data streams [J].
Bustio-Martinez, Lazaro ;
Cumplido, Rene ;
Hernandez-Leon, Raudel ;
Bande-Serrano, Jose M. ;
Feregrino-Uribe, Claudia .
JOURNAL OF INTELLIGENT INFORMATION SYSTEMS, 2018, 50 (03) :415-440
[49]   An integrated cuckoo search-genetic algorithm for mining frequent itemsets [J].
Sukanya, N. S. ;
Thangaiah, P. Ranjit Jeba .
JOURNAL OF DISCRETE MATHEMATICAL SCIENCES & CRYPTOGRAPHY, 2022, 25 (03) :671-690
[50]   A genetic algorithm based searching of maximal frequent itemsets [J].
Huang, JP ;
Yang, CT ;
Fu, CH .
IC-AI '04 & MLMTA'04 , VOL 1 AND 2, PROCEEDINGS, 2004, :548-554