Mining Time-constrained Sequential Patterns with Constraint Programming

被引:0
|
作者
John O. R. Aoga
Tias Guns
Pierre Schaus
机构
[1] Université catholique de Louvain (UCLouvain),Institute of Information and Communication Technologies, Electronics and Applied Mathematics (ICTEAM)
[2] Université d’Abomey-Calavi (UAC),Ecole Doctorale Science de l’Ingénieur (ED
[3] Vrije Universiteit Brussel (VUB),SDI)
[4] Katholieke Universiteit Leuven,undefined
来源
Constraints | 2017年 / 22卷
关键词
Data mining; Sequential pattern mining; Constraint programming; Global constraint; Gap constraint; Span constraint; Time constraint;
D O I
暂无
中图分类号
学科分类号
摘要
Constraint Programming (CP) has proven to be an effective platform for constraint based sequence mining. Previous work has focused on standard frequent sequence mining, as well as frequent sequence mining with a maximum ’gap’ between two matching events in a sequence. The main challenge in the latter is that this constraint can not be imposed independently of the omnipresent frequency constraint. Indeed, the gap constraint changes whether a subsequence is included in a sequence, and hence its frequency. In this work, we go beyond that and investigate the integration of timed events and constraining the minimum/maximum gap as well as minimum/maximum span. The latter constrains the allowed time between the first and last matching event of a pattern. We show how the three are interrelated, and what the required changes to the frequency constraint are. Key in our approach is the concept of an extension window defined by gap/span and we develop techniques to avoid scanning the sequences needlessly, as well as using a backtracking-aware data structure. Experiments demonstrate that the proposed approach outperforms both specialized and CP-based approaches in almost all cases and that the advantage increases as the minimum frequency threshold decreases. This paper is an extension of the original manuscript presented at CPAIOR’17 [5].
引用
收藏
页码:548 / 570
页数:22
相关论文
共 50 条
  • [41] Mining weighted sequential patterns in incremental uncertain databases
    Roy, Kashob Kumar
    Moon, Md Hasibul Haque
    Rahman, Md Mahmudur
    Ahmed, Chowdhury Farhan
    Leung, Carson Kai-Sang
    INFORMATION SCIENCES, 2022, 582 : 865 - 896
  • [42] Mining sequential support affinity patterns with weight constraints
    Yun, Unil
    Distributed Computing and Internet Technology, Proceedings, 2006, 4317 : 414 - 423
  • [43] Time-constrained cost-sensitive decision tree induction
    Chen, Yen-Liang
    Wu, Chia-Chi
    Tang, Kwei
    INFORMATION SCIENCES, 2016, 354 : 140 - 152
  • [44] Constraint Programming approach to time-window and multiresource-constrained projects portfolio prototyping
    Bach, Irena
    Bocewicz, Grzegorz
    Banaszak, Zbigniew
    NEW FRONTIERS IN APPLIED ARTIFICIAL INTELLIGENCE, 2008, 5027 : 767 - 776
  • [45] Schedulability Analysis for Noncyclic Operation of Time-Constrained Cluster Tools With Time Variation
    Kim, Hyun-Jung
    Lee, Jun-Ho
    Lee, Tae-Eog
    IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2016, 13 (03) : 1409 - 1414
  • [46] Mining Sequential Patterns in Data Stream
    Huang, Qinhua
    Ouyang, Weimin
    ADVANCES IN NEURAL NETWORKS - ISNN 2009, PT 2, PROCEEDINGS, 2009, 5552 : 865 - 874
  • [47] A constraint programming approach for quantitative frequent pattern mining
    Laghzaoui, Mohammed El Amine
    Lebbah, Yahia
    INTERNATIONAL JOURNAL OF DATA MINING MODELLING AND MANAGEMENT, 2023, 15 (03) : 297 - 311
  • [48] Constraint Programming for Itemset Mining with Multiple Minimum Supports
    Belaid, Mohamed-Bachir
    Lazaar, Nadjib
    2021 IEEE 33RD INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2021), 2021, : 598 - 603
  • [49] Targeted mining of contiguous sequential patterns
    Hu, Kaixia
    Gan, Wensheng
    Huang, Shan
    Peng, Hao
    Fournier-Viger, Philippe
    INFORMATION SCIENCES, 2024, 653
  • [50] An algorithm for mining generalized sequential patterns
    Ren, JD
    Cheng, YB
    Yang, LL
    PROCEEDINGS OF THE 2004 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2004, : 1288 - 1292