From sequential pattern mining to structured pattern mining: A pattern-growth approach

被引:0
作者
Jia-Wei Han
Jian Pei
Xi-Feng Yan
机构
[1] University of Illinois at Urbana-Champaign,
[2] State University of New York at Buffalo,undefined
来源
Journal of Computer Science and Technology | 2004年 / 19卷
关键词
data mining; sequential pattern mining; structured pattern mining; scalability; performance analysis;
D O I
暂无
中图分类号
学科分类号
摘要
Sequential pattern mining is an important data mining problem with broad applications. However, it is also a challenging problem since the mining may have to generate or examine a combinatorially explosive number of intermediate subsequences. Recent studies have developed two major classes of sequential pattern mining methods: (1) acandidate generation-and-test approach, represented by (i) GSP, a horizontal format-based sequential pattern mining method, and (ii) SPADE, a vertical format-based method; and (2) apattern-growth method, represented by PrefixSpan and its further extensions, such as gSpan for mining structured patterns.
引用
收藏
页码:257 / 279
页数:22
相关论文
共 12 条
  • [1] Mannila H(1997)Discovery of frequent episodes in event sequences Data Mining and Knowledge Discovery 1 259-289
  • [2] Toivonen H(1998)Mining temporal relationships with multiple granularities in time sequences Data Engineering Bulletin 21 32-38
  • [3] Verkamo A I(2001)SPADE: An efficient algorithm for mining frequent sequences Machine Learning 40 31-60
  • [4] Bettini C(2000)KDD-Cup 2000 organizers' report: Peeling the onion SIGKDD Explorations 2 86-98
  • [5] Wang X S(undefined)undefined undefined undefined undefined-undefined
  • [6] Jajodia S(undefined)undefined undefined undefined undefined-undefined
  • [7] Zaki M(undefined)undefined undefined undefined undefined-undefined
  • [8] Kohavi R(undefined)undefined undefined undefined undefined-undefined
  • [9] Brodley C(undefined)undefined undefined undefined undefined-undefined
  • [10] Frasca B(undefined)undefined undefined undefined undefined-undefined