An Efficient Parallel Method for Mining Frequent Closed Sequential Patterns

被引:17
作者
Bao Huynh [1 ,2 ,3 ]
Bay Vo [4 ,5 ]
Snasel, Vaclav [3 ]
机构
[1] Ton Duc Thang Univ, Ctr Appl Informat Technol, Ho Chi Minh City 700000, Vietnam
[2] Ton Duc Thang Univ, Fac Informat Technol, Ho Chi Minh City 700000, Vietnam
[3] Tech Univ Ostrava, VSB, Fac Elect Engn & Comp Sci, Ostrava 70800, Czech Republic
[4] Duy Tan Univ, Inst Res & Dev, Da Nang 550000, Vietnam
[5] Sejong Univ, Coll Elect & Informat Engn, Seoul 143747, South Korea
关键词
Data mining; dynamic bit vectors; dynamic load balancing; multi-core processors; closed sequential patterns;
D O I
10.1109/ACCESS.2017.2739749
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Mining frequent closed sequential pattern (FCSPs) has attracted a great deal of research attention, because it is an important task in sequences mining. In recently, many studies have focused on mining frequent closed sequential patterns because, such patterns have proved to be more efficient and compact than frequent sequential patterns. Information can be fully extracted from frequent closed sequential patterns. In this paper, we propose an efficient parallel approach called parallel dynamic bit vector frequent closed sequential patterns (pDBV-FCSP) using multi-core processor architecture for mining FCSPs from large databases. The pDBV-FCSP divides the search space to reduce the required storage space and performs closure checking of prefix sequences early to reduce execution time for mining frequent closed sequential patterns. This approach overcomes the problems of parallel mining such as overhead of communication, synchronization, and data replication. It also solves the load balance issues of the workload between the processors with a dynamic mechanism that re-distributes the work, when some processes are out of work to minimize the idle CPU time.
引用
收藏
页码:17392 / 17402
页数:11
相关论文
共 43 条
[1]  
AGRAWAL R, 1995, PROC INT CONF DATA, P3, DOI 10.1109/ICDE.1995.380415
[2]  
[Anonymous], 2014, VIETNAM J COMPUT SCI, DOI DOI 10.1007/S40595-013-0012-3
[3]  
[Anonymous], [No title captured]
[4]  
[Anonymous], INDIAN J COMPUT SCI
[5]  
Ayres J., 2002, SIGKDD 02, V2, P1
[6]  
Bao Huynh, 2016, ICIC Express Letters, V10, P2105
[7]  
Bao Huynh, 2015, ICIC Express Letters, V9, P3071
[8]   DBV-Miner: A Dynamic Bit-Vector approach for fast mining frequent closed itemsets [J].
Bay Vo ;
Hong, Tzung-Pei ;
Bac Le .
EXPERT SYSTEMS WITH APPLICATIONS, 2012, 39 (08) :7196-7206
[9]  
Chai L, 2007, CCGRID 2007: SEVENTH IEEE INTERNATIONAL SYMPOSIUM ON CLUSTER COMPUTING AND THE GRID, P471
[10]  
Cong S., 2005, P 11 ACM SIGKDD INT, P562