Accelerated Frequent Closed Sequential Pattern Mining for uncertain data

被引:7
|
作者
You, Tao [1 ]
Sun, Yue [1 ]
Zhang, Ying [1 ]
Chen, Jinchao [1 ]
Zhang, Peng [1 ]
Yang, Mei [1 ]
机构
[1] Northwestern Polytech Univ, Sch Comp Sci, Xian 610072, Shaanxi, Peoples R China
基金
中国国家自然科学基金;
关键词
Uncertain database; Frequent closed sequences; Possible world semantics; SEQUENCES; ALGORITHM; ITEMSETS;
D O I
10.1016/j.eswa.2022.117254
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Data uncertainty has been taken into a consideration for mining and discovery of its hidden knowledge in a variety of applications. Due to the fact that the nature of closed sequences is closely related to possible world, more recent studies on uncertain closed sequential data mining has usually been challenged by the explosive possible worlds, which is exponential to the number of uncertain sequences in the database. Although basic Probabilistic Frequent Closed Sequences Mining (PFCSM-FF) strategy can solve this problem preliminarily, the inclusion-exclusion rules and closure checking methods used in PFCSM-FF makes mining algorithm very inefficient. And on this basis, another two improvements, PFCSM-CF and PFCSM-CC algorithms, are designed to reduce the search space and simplify the candidate sequence database, which significantly compress the computational scale. Substantial experiments on the real and synthetic datasets have demonstrated the efficiency improvement on the proposed PFCSM-CC and PFCSM-CF methods. Besides, the high usability of the proposed PFCSM-CC algorithm is further demonstrated according to the similarity of the time spent on existing probabilistic frequent sequence mining algorithm.
引用
收藏
页数:12
相关论文
共 50 条
  • [21] A Comparative Study of Different Frequent Pattern Mining Algorithm For Uncertain Data: A survey
    Goyal, Neha
    Jain, S. K.
    2016 IEEE INTERNATIONAL CONFERENCE ON COMPUTING, COMMUNICATION AND AUTOMATION (ICCCA), 2016, : 183 - 187
  • [22] Frequent pattern mining algorithm for uncertain data streams based on sliding window
    Yang, Junrui
    Yang, Cai
    Wei, Yanjun
    2016 8TH INTERNATIONAL CONFERENCE ON INTELLIGENT HUMAN-MACHINE SYSTEMS AND CYBERNETICS (IHMSC), VOL. 2, 2016, : 265 - 268
  • [23] On Probabilistic Models for Uncertain Sequential Pattern Mining
    Muzammal, Muhammad
    Raman, Rajeev
    ADVANCED DATA MINING AND APPLICATIONS, ADMA 2010, PT I, 2010, 6440 : 60 - 72
  • [24] UP-EVOLVE - UNCERTAIN FREQUENT PATTERN MINING
    Wang, Shu
    Ng, Vincent
    ICEIS 2011: PROCEEDINGS OF THE 13TH INTERNATIONAL CONFERENCE ON ENTERPRISE INFORMATION SYSTEMS, VOL 1, 2011, : 74 - 84
  • [25] UF-Evolve: Uncertain Frequent Pattern Mining
    Wang, Shu
    Vincent Ng
    ENTERPRISE INFORMATION SYSTEMS, ICEIS 2011, 2012, 102 : 98 - 116
  • [26] Mining frequent closed itemsets using conditional frequent pattern tree
    Singh, SR
    Patra, BK
    Giri, D
    Proceedings of the IEEE INDICON 2004, 2004, : 501 - 504
  • [27] Mining frequent itemsets from uncertain data
    Chui, Chun-Kit
    Kao, Ben
    Hung, Edward
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PROCEEDINGS, 2007, 4426 : 47 - +
  • [28] Mining maximal frequent itemsets in uncertain data
    Tang, Xianghong
    Yang, Quanwei
    Zheng, Yang
    Huazhong Keji Daxue Xuebao (Ziran Kexue Ban)/Journal of Huazhong University of Science and Technology (Natural Science Edition), 2015, 43 (09): : 29 - 34
  • [29] Frequent Itemsets Mining on Weighted Uncertain Data
    Alharbi, Manal
    Pathak, Sudipta
    Rajasekaran, Sanguthevar
    2014 IEEE INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND INFORMATION TECHNOLOGY (ISSPIT), 2014, : 201 - 206
  • [30] Sequential pattern mining applied to aeroengine condition monitoring with uncertain health data
    Palacios, Ana
    Martinez, Alvaro
    Sanchez, Luciano
    Couso, Ines
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2015, 44 : 10 - 24