WSpan: Weighted sequential pattern mining in large sequence databases

被引:0
|
作者
Yun, Unil
Leggett, John J.
机构
关键词
Data Mining; downward closure property; sequential pattern mining; weight constraints;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Sequential pattern mining algorithms have been developed which mine the set of frequent subsequences satisfying a minimum support constraint in a sequence database. However, previous sequential mining algorithms treat sequential patterns uniformly while sequential patterns have different importance. Another main problem in most of the sequence mining algorithms is that they still generate an exponentially large number of sequential patterns when a minimum support is lowered and they do not provide alternative ways to adjust the number of sequential patterns other than increasing the minimum support. In this paper, we propose a Weighted Sequential pattern mining algorithm called WSpan. Our main approach is to push the weight constraints into the sequential pattern growth approach while maintaining the downward closure property. A weight range is defined to maintain the downward closure property and items are given different weights within the weight range. In scanning a sequence database, a maximum weight in the sequence database is used to prune weighted infrequent sequential patterns and in the mining step, maximum weights of projected sequence databases are used. By doing so, the downward closure property can be maintained. WSpan generates fewer but important weighted sequential patterns in large databases, particularly dense databases with a low minimum support, by adjusting a weight range. Introduction
引用
收藏
页码:503 / 508
页数:6
相关论文
共 50 条
  • [1] Mining Weighted a Closed Sequential Patterns in Large Databases
    Ren, Jia-Dong
    Yang, Jing
    Li, Yan
    FIFTH INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY, VOL 5, PROCEEDINGS, 2008, : 640 - 644
  • [2] Distributed Sequential Pattern Mining in Large Scale Uncertain Databases
    Ge, Jiaqi
    Xia, Yuni
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2016, PT II, 2016, 9652 : 17 - 29
  • [3] A new framework for detecting weighted sequential patterns in large sequence databases
    Yun, Unil
    KNOWLEDGE-BASED SYSTEMS, 2008, 21 (02) : 110 - 122
  • [4] New approach for the sequential pattern mining of high-dimensional sequence databases
    Liu, Hongyan
    Lin, Fangzhou
    He, Jun
    Cai, Yunjue
    DECISION SUPPORT SYSTEMS, 2010, 50 (01) : 270 - 280
  • [5] Distributed Algorithm for Sequential Pattern Mining on a Large Sequence Dataset
    Tho Hoang
    Bac Le
    Minh-Thai Tran
    2017 9TH INTERNATIONAL CONFERENCE ON KNOWLEDGE AND SYSTEMS ENGINEERING (KSE 2017), 2017, : 18 - 23
  • [6] Efficient weighted sequential pattern mining
    Chen, Shaotao
    Chen, Jiahui
    Wan, Shicheng
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 243
  • [7] Weighted frequent sequential pattern mining
    Md Ashraful Islam
    Mahfuzur Rahman Rafi
    Al-amin Azad
    Jesan Ahammed Ovi
    Applied Intelligence, 2022, 52 : 254 - 281
  • [8] Weighted frequent sequential pattern mining
    Islam, Md Ashraful
    Rafi, Mahfuzur Rahman
    Azad, Al-amin
    Ovi, Jesan Ahammed
    APPLIED INTELLIGENCE, 2022, 52 (01) : 254 - 281
  • [9] Fast Weighted Sequential Pattern Mining
    Ye, Zhenqiang
    Li, Ziyang
    Guo, Weibin
    Gan, Wensheng
    Wan, Shicheng
    Chen, Jiahui
    ADVANCES AND TRENDS IN ARTIFICIAL INTELLIGENCE: THEORY AND PRACTICES IN ARTIFICIAL INTELLIGENCE, 2022, 13343 : 807 - 818
  • [10] Sequential pattern mining in databases with temporal uncertainty
    Ge, Jiaqi
    Xia, Yuni
    Wang, Jian
    Nadungodage, Chandima Hewa
    Prabhakar, Sunil
    KNOWLEDGE AND INFORMATION SYSTEMS, 2017, 51 (03) : 821 - 850