Accelerated Frequent Closed Sequential Pattern Mining for uncertain data

被引:7
|
作者
You, Tao [1 ]
Sun, Yue [1 ]
Zhang, Ying [1 ]
Chen, Jinchao [1 ]
Zhang, Peng [1 ]
Yang, Mei [1 ]
机构
[1] Northwestern Polytech Univ, Sch Comp Sci, Xian 610072, Shaanxi, Peoples R China
基金
中国国家自然科学基金;
关键词
Uncertain database; Frequent closed sequences; Possible world semantics; SEQUENCES; ALGORITHM; ITEMSETS;
D O I
10.1016/j.eswa.2022.117254
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Data uncertainty has been taken into a consideration for mining and discovery of its hidden knowledge in a variety of applications. Due to the fact that the nature of closed sequences is closely related to possible world, more recent studies on uncertain closed sequential data mining has usually been challenged by the explosive possible worlds, which is exponential to the number of uncertain sequences in the database. Although basic Probabilistic Frequent Closed Sequences Mining (PFCSM-FF) strategy can solve this problem preliminarily, the inclusion-exclusion rules and closure checking methods used in PFCSM-FF makes mining algorithm very inefficient. And on this basis, another two improvements, PFCSM-CF and PFCSM-CC algorithms, are designed to reduce the search space and simplify the candidate sequence database, which significantly compress the computational scale. Substantial experiments on the real and synthetic datasets have demonstrated the efficiency improvement on the proposed PFCSM-CC and PFCSM-CF methods. Besides, the high usability of the proposed PFCSM-CC algorithm is further demonstrated according to the similarity of the time spent on existing probabilistic frequent sequence mining algorithm.
引用
收藏
页数:12
相关论文
共 50 条
  • [1] Frequent Pattern Mining with Uncertain Data
    Aggarwal, Charu C.
    Li, Yan
    Wang, Jianyong
    Wang, Jing
    KDD-09: 15TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2009, : 29 - 37
  • [2] A Review of Frequent Pattern Mining Algorithms for Uncertain Data
    Bhogadhi, Vani
    Chandak, M. B.
    PROCEEDINGS OF SAI INTELLIGENT SYSTEMS CONFERENCE (INTELLISYS) 2016, VOL 2, 2018, 16 : 974 - 983
  • [3] Vertical Frequent Pattern Mining from Uncertain Data
    Budhia, Bhavek P.
    Cuzzocrea, Alfredo
    Leung, Carson K.
    ADVANCES IN KNOWLEDGE-BASED AND INTELLIGENT INFORMATION AND ENGINEERING SYSTEMS, 2012, 243 : 1273 - 1282
  • [4] Constrained frequent pattern mining on univariate uncertain data
    Liu, Ying-Ho
    Wang, Chun-Sheng
    JOURNAL OF SYSTEMS AND SOFTWARE, 2013, 86 (03) : 759 - 778
  • [5] CLOSED SEQUENTIAL PATTERN MINING IN BIOLOGICAL DATA
    Jawahar, S.
    Harishchander, A.
    Devaraju, S.
    Ali, S. Ahamed Johnsha
    Manivasagan, C.
    Sumathi, P.
    INTERNATIONAL JOURNAL OF LIFE SCIENCE AND PHARMA RESEARCH, 2020, : 9 - 13
  • [6] On closed constrained frequent pattern mining
    Bonchi, F
    Lucchese, C
    FOURTH IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2004, : 35 - 42
  • [7] Weighted frequent sequential pattern mining
    Md Ashraful Islam
    Mahfuzur Rahman Rafi
    Al-amin Azad
    Jesan Ahammed Ovi
    Applied Intelligence, 2022, 52 : 254 - 281
  • [8] Weighted frequent sequential pattern mining
    Islam, Md Ashraful
    Rafi, Mahfuzur Rahman
    Azad, Al-amin
    Ovi, Jesan Ahammed
    APPLIED INTELLIGENCE, 2022, 52 (01) : 254 - 281
  • [9] Mining frequent closed itemsets with the frequent pattern list
    Tseng, FC
    Hsu, CC
    Chen, H
    2001 IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2001, : 653 - 654
  • [10] Finding efficiencies in frequent pattern mining from big uncertain data
    Carson Kai-Sang Leung
    Richard Kyle MacKinnon
    Fan Jiang
    World Wide Web, 2017, 20 : 571 - 594