Optimized cardinality-based generalized itemset mining using transaction ID and numeric encoding

被引:0
|
作者
Bac Le
Phuc Luong
机构
[1] VNU HCMC,Faculty of Information Technology, Department of Computer Science
[2] University of Science,undefined
来源
Applied Intelligence | 2018年 / 48卷
关键词
Generalized itemset; Cardinality constraints; Optimization; Closed itemset; Maximal itemset;
D O I
暂无
中图分类号
学科分类号
摘要
In recent years, generalization-based data mining techniques have become an interesting topic for many data scientists. Generalized itemset mining is an exploration technique that focuses on extracting high-level abstractions and correlations in a database. However, the problem that domain experts must always deal with is how to manage and interpret a large number of extracted patterns from a massive database of transactions. In generalized pattern mining, taxonomies that contain abstraction information for each dataset are defined, so the number of frequent patterns can grow enormously. Therefore, exploiting knowledge turns into a difficult and costly process. In this article, we introduce an approach that uses cardinality-based constraints with transaction id and numeric encoding to mine generalized patterns. We applied transaction id to support the computation of each frequent itemset as well as to encode taxonomies into a numeric type using two simple rules. We also attempted to apply the combination of cardinality cons- traints and closed or maximal patterns. Experiments show that our optimizations significantly improve the performance of the original method, and the importance of comprehensive information within closed and maximal patterns is worth considering in generalized frequent pattern mining.
引用
收藏
页码:2067 / 2080
页数:13
相关论文
共 50 条
  • [1] Optimized cardinality-based generalized itemset mining using transaction ID and numeric encoding
    Le, Bac
    Luong, Phuc
    APPLIED INTELLIGENCE, 2018, 48 (08) : 2067 - 2080
  • [2] Itemset generalization with cardinality-based constraints
    Cagliero, Luca
    Garza, Paolo
    INFORMATION SCIENCES, 2013, 244 : 161 - 174
  • [3] On the evaluation of cardinality-based generalized yes/no queries
    Bosc, Patrick
    Lietard, Nadia
    Pivert, Olivier
    2006 3RD INTERNATIONAL IEEE CONFERENCE INTELLIGENT SYSTEMS, VOLS 1 AND 2, 2006, : 198 - 203
  • [4] On the Evaluation of Cardinality-Based Generalized Yes/No Queries
    Bosc, Patrick
    Ibenhssaien, Nadia
    Pivert, Olivier
    INTELLIGENT TECHNIQUES AND TOOLS FOR NOVEL SYSTEM ARCHITECTURES, 2008, 109 : 65 - 79
  • [5] Frequent Itemset Mining with Differential Privacy Based on Transaction Truncation
    Xia, Ying
    Huang, Yu
    Zhang, Xu
    Bae, HaeYoung
    INFORMATION AND COMMUNICATIONS SECURITY, ICICS 2017, 2018, 10631 : 438 - 445
  • [6] Architectural evolution of FamiWare using cardinality-based feature models
    Gamez, Nadia
    Fuentes, Lidia
    INFORMATION AND SOFTWARE TECHNOLOGY, 2013, 55 (03) : 563 - 580
  • [7] Configuration of Cardinality-based Feature Models using Generative Constraint Satisfaction
    Dhungana, Deepak
    Falkner, Andreas
    Haselboeck, Alois
    2011 37TH EUROMICRO CONFERENCE ON SOFTWARE ENGINEERING AND ADVANCED APPLICATIONS (SEAA 2011), 2011, : 100 - 103
  • [8] A frequent itemset generation approach in data mining using transaction-labelling dynamic itemset counting method
    Balaram, Ambily
    Raju, Nedunchezhian
    INTERNATIONAL JOURNAL OF DATA MINING MODELLING AND MANAGEMENT, 2025, 17 (01)
  • [9] Towards a Compact SAT-Based Encoding of Itemset Mining Tasks
    Nekkache, Ikram
    Jabbour, Said
    Sais, Lakhdar
    Kamel, Nadjet
    INTEGRATION OF CONSTRAINT PROGRAMMING, ARTIFICIAL INTELLIGENCE, AND OPERATIONS RESEARCH, 2021, 12735 : 163 - 178
  • [10] Fuzzy based optimized itemset mining in high dimensional transactional database using adaptable FCM
    Saravanabhavan, C.
    Kirubakaran, S.
    Premkumar, R.
    Joyce, V. Jemmy
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2023, 44 (04) : 6957 - 6971