A sequential patterns mining incremental algorithm PIN-Prefixspan based on prefix analysis

被引:0
作者
Wu, Di [1 ]
Ren, Jiadong [2 ]
机构
[1] Department of Information and Electronic Engineering, Hebei University of Engineering
[2] College of Information Science and Engineering, Yanshan University
来源
Advances in Information Sciences and Service Sciences | 2012年 / 4卷 / 19期
关键词
Incremental mining; Prefix; Prefixspan; Sequential pattern mining;
D O I
10.4156/AISS.vol4.issue19.7
中图分类号
学科分类号
摘要
Most previous sequential patterns mining algorithms have spend a long time dealing with the projection database, it will lead to a lot of time cost. In this paper, the concept of the prefix sequence of the sequence x on its prefix sequence y is defined, and a sequential patterns mining algorithm P- Prefixspan based on prefix analysis is presented. According to scanning sequence database SD, all the 1-length sequential patterns are gained. The number of sequential pattern and the minimum support count is compared, if the former is smaller than the latter, it will be abandoned directly, and only the frequent items in the projection database can be constructed. The execution time of mining sequential patterns in the projection database is reduced. Moreover, a prefix-based incremental prefixspan algorithm PIN-Prefixspan is proposed for dealing with dynamic database, the frequent pattern mining results of the original sequence database is used for improving the mining efficiency in the updated sequence database. The experimental results show that PIN-Prefixspan is more efficient in time cost.
引用
收藏
页码:48 / 56
页数:8
相关论文
共 10 条
[1]  
Ren J., Tian Y., He H., Bitmap-based Algorithm of Mining Approximate Sequential Pattern in Data Stream, Journal of AISS, 3, 9, pp. 132-139, (2011)
[2]  
Aggarwal C., Li Y., Wang J., Wang J., Frequent Pattern Mining with Uncertain Data, Proceedings of the 15th ACM SIGKDD International Conference On Knowledge Discovery and Data Mining, pp. 29-38, (2009)
[3]  
Jia Z., Gong Z., Wei Z., Zhang J., Luo S., Xin Y., A Distributed Method on Web Log Sequential Pattern Mining, Journal of IJACT, 4, 6, pp. 24-33, (2012)
[4]  
Dong W., Li M., Huang G., Ren J., Chen L., A Weighted Closed Sequential Patterns Mining Algorithm Based on Regular Expression Constraints, Journal of AISS, 4, 1, pp. 415-422, (2012)
[5]  
Li N., Yao X., Tian D., Mining Temporal Sequential Patterns Based on Multi- granularities, International Journal of Computers, Communications & Control, 7, 3, pp. 471-485, (2012)
[6]  
Kuo R.J., Chao C.M., Liu C.-Y., Integration of K-means algorithm and AprioriSome algorithm for fuzzy sequential pattern mining, International Journal of Applied Soft Computing, 9, 1, pp. 85-93, (2009)
[7]  
Pei J., Han J., Mortazavi-Asl B., Pinto H., Chen Q., Dayal U., Hsu M., PrefixSpan: Mining Sequential Patterns Efficiently by Prefix-Projected Pattern Growth, Proceedings of the 17th International Conference On Data Engineering, pp. 215-224, (2001)
[8]  
Saputra D., Rambli D., Oi M.F., Mining Sequential Patterns Using I-PrefixSpan, Proceedings of World Academy of Science, Engineering and Technology, 26, pp. 499-503, (2008)
[9]  
Wang N., Chen L., Pan L., Zhang Y., An Improved Prefixspan Algorithm Based on Time Interval and Click Quantity, Journal of Computer Technology and Development, 21, 10, pp. 81-84, (2011)
[10]  
Mingyang S., Gwojong Y., Lin C., A real-time network intrusion detection system for large-scale attacks based on an incremental mining approach, International Journal of Computers & Security, 28, 5, pp. 301-309, (2009)