Mining Frequent Itemsets in Data Streams Based on Genetic Algorithm

被引:0
作者
Han, Chong [1 ]
Sun, Lijuan [1 ,2 ,3 ]
Guo, Jian [1 ,2 ,3 ]
Chen, Xiaodong [1 ]
机构
[1] Nanjing Univ Posts & Telecommun, Coll Comp, Nanjing 210003, Jiangsu, Peoples R China
[2] Jiangsu High Technol Res Key Lab Wireless Sensor, Nanjing 210003, Jiangsu, Peoples R China
[3] Minist Educ Jiangsu Prov, Key Lab Broadband Wireless Commun & Sensor Networ, Nanjing 210003, Jiangsu, Peoples R China
来源
2013 15TH IEEE INTERNATIONAL CONFERENCE ON COMMUNICATION TECHNOLOGY (ICCT) | 2013年
关键词
Big data; data streams; genetic algorithm; frequent itemsets;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Stream data is a very common data type in big data and in many data streams applications, users tend to pay more attention to the mode information of the data streams. So mining frequent patterns in data streams is a significative work. Meanwhile, finding frequent itemests in a data set with predefined fixed support threshold could be seen as an optimization problem. In this paper, the problem of frequent itemsets mining is derived as a non-linear optimization problem, then genetic algorithm is adopted to solve it. Through the formal and bitmap representation of frequent itemsets, the non-linear optimization problem is transformed to 0-1 programming. A set of experimental results show that unlike typical Apriori algorithm, the complexity of time and memory space grows exponentially as the support decrease, our proposed algorithm has a high time and space efficiency even with a very low support.
引用
收藏
页码:748 / 753
页数:6
相关论文
共 50 条
[31]   Mining Frequent Itemsets from Online Data Streams: Comparative Study [J].
Nabil, HebaTallah Mohamed ;
Eldin, Ahmed Sharaf ;
Belal, Mohamed Abd El-Fattah .
INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2013, 4 (07) :117-125
[32]   Using hashing and lexicographic order for Frequent Itemsets Mining on data streams [J].
Bustio-Martinez, Lazaro ;
Letras-Luna, Martin ;
Cumplido, Rene ;
Hernandez-Leon, Raudel ;
Feregrino-Uribe, Claudia ;
Bande-Serrano, Jose M. .
JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2019, 125 :58-71
[33]   Mining frequent itemsets over data streams using efficient window sliding techniques [J].
Li, Hua-Fu ;
Lee, Suh-Yin .
EXPERT SYSTEMS WITH APPLICATIONS, 2009, 36 (02) :1466-1477
[34]   Frequent itemsets mining algorithm based on sort tree [J].
Wang H.-M. ;
Dang Y.-Y. ;
Hu M. ;
Liu D.-Y. .
Jilin Daxue Xuebao (Gongxueban)/Journal of Jilin University (Engineering and Technology Edition), 2016, 46 (04) :1216-1221
[35]   Mining frequent itemsets Algorithm Based on Compression Matrix [J].
Lin, Zizhi ;
Shu, Sihui .
MECHATRONICS ENGINEERING, COMPUTING AND INFORMATION TECHNOLOGY, 2014, 556-562 :3501-3505
[36]   A frequent itemsets mining algorithm based on Spatial Partition [J].
Liu, Tieying ;
Chen, Lirong ;
Wang, Guoguang .
2009 2ND IEEE INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND INFORMATION TECHNOLOGY, VOL 5, 2009, :339-+
[37]   The Mining Algorithm of Maximum Frequent Itemsets Based on Frequent Pattern Tree [J].
Mi, Xifeng .
COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2022, 2022
[38]   Sliding Window-based Frequent Itemsets Mining over Data Streams using Tail Pointer Table [J].
Le Wang ;
Lin Feng ;
Bo Jin .
International Journal of Computational Intelligence Systems, 2014, 7 :25-36
[39]   A sliding window algorithm for mining frequent itemsets on data stream [J].
Liu, Junqiang ;
Li, Xiurong .
DCABES 2006 PROCEEDINGS, VOLS 1 AND 2, 2006, :637-639
[40]   Sliding Window- based Frequent Itemsets Mining over Data Streams using Tail Pointer Table [J].
Wang, Le ;
Feng, Lin ;
Jin, Bo .
INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE SYSTEMS, 2014, 7 (01) :25-36