Mining Integrated Sequential Patterns From Multiple Databases

被引:0
|
作者
Ezeife, Christie, I [1 ]
Aravindan, Vignesh [2 ]
Chaturvedi, Ritu [3 ]
机构
[1] Univ Windsor, Comp Sci, Windsor, ON, Canada
[2] Royal Bank Canada, Toronto, ON, Canada
[3] Univ Guelph, Sch Comp Sci, Guelph, ON, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
Candidate Generation; Complex Queries; Foreign key; Frequent Itemsets; Frequent Patterns; Frequent Sequences; Multiple Databases; Sequence Database; Transaction Ids; FREQUENT PATTERNS;
D O I
10.4018/IJDWM.2020010101
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Existing work on multiple databases (MDBs) sequential pattern mining cannot mine frequent sequences to answer exact and historical queries from MDBs having different table structures. This article proposes the transaction id frequent sequence pattern (TidFSeq) algorithm to handle the difficult problem of mining frequent sequences from diverse MDBs. The TidFSeq algorithm transforms candidate 1-sequences to get transaction subsequences where candidate 1-sequences occurred as (1-sequence, itssubsequenceidlist) tuple or (1-sequence, position id list). Subsequent frequent i-sequences are computed using the counts of the sequence ids in each candidate i-sequence position id list tuples. An extended version of the general sequential pattern (GSP)-like candidate generates and a frequency count approach is used for computing supports of itemset (I-step) and separate (S-step) sequences without repeated database scans but with transaction ids. Generated patterns answer complex queries from MDBs. The TidFSeq algorithm has a faster processing time than existing algorithms.
引用
收藏
页码:1 / 21
页数:21
相关论文
共 50 条
  • [1] Mining integrated sequential patterns from multiple databases
    Ezeife, Christie I.
    Aravindan, Vignesh
    Chaturvedi, Ritu
    International Journal of Data Warehousing and Mining, 2020, 16 (01): : 1 - 21
  • [2] Mining sequential patterns across multiple sequence databases
    Peng, Wen-Chih
    Liao, Zhung-Xun
    DATA & KNOWLEDGE ENGINEERING, 2009, 68 (10) : 1014 - 1033
  • [3] Mining sequential patterns from probabilistic databases
    Muzammal, Muhammad
    Raman, Rajeev
    KNOWLEDGE AND INFORMATION SYSTEMS, 2015, 44 (02) : 325 - 358
  • [4] Mining sequential patterns from probabilistic databases
    Muhammad Muzammal
    Rajeev Raman
    Knowledge and Information Systems, 2015, 44 : 325 - 358
  • [5] Mining Sequential Patterns from Probabilistic Databases
    Muzammal, Muhammad
    Raman, Rajeev
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PT II: 15TH PACIFIC-ASIA CONFERENCE, PAKDD 2011, 2011, 6635 : 210 - 221
  • [6] A fast algorithm for mining sequential patterns from large databases
    Chen, N
    Chen, A
    Zhou, LX
    Liu, L
    JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2001, 16 (04) : 359 - 370
  • [7] A Fast Algorithm for Mining Sequential Patterns from Large Databases
    陈宁
    陈安
    周龙骧
    刘鲁
    Journal of Computer Science and Technology, 2001, (04) : 359 - 370
  • [8] A fast algorithm for mining sequential patterns from large databases
    Chen, N.
    Chen, A.
    Zhou, L.
    Liu, L.
    2001, Allerton Press Inc. (16)
  • [9] A fast algorithm for mining sequential patterns from large databases
    Ning Chen
    An Chen
    Longxiang Zhou
    Lu Liu
    Journal of Computer Science and Technology, 2001, 16 : 359 - 370
  • [10] Mining Closed Sequential Patterns in Progressive Databases
    Subramanyam, R. B. V.
    Rao, A. Suresh
    Karnati, Ramesh
    Suvvari, Somaraju
    Somayajulu, D. V. L. N.
    JOURNAL OF INFORMATION & KNOWLEDGE MANAGEMENT, 2013, 12 (03)