What to expect from a set of itemsets?

被引:0
作者
Delacroix, T. [1 ]
Lenca, P. [2 ]
Lallich, S. [3 ]
机构
[1] Univ Paris Saclay, Polytech Paris Saclay, Orsay, France
[2] IMT Atlantique, Lab STICC, F-29238 Brest, France
[3] Univ Lyon 2, Lab ERIC, Lyon, France
关键词
Pattern mining; Itemset mining; Interestingness; Redundancy; Maximum entropy model; Independence model; INTERESTINGNESS MEASURES;
D O I
10.1016/j.ins.2021.12.115
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Dealing with redundancy is one of the main challenges in frequency based data mining and itemset mining in particular. To tackle this issue in the most objective possible way, we introduce the theoretical bases of a new probabilistic concept: Mutual constrained independence (MCI). Thanks to this notion, we describe a MCI model for the frequencies of all itemsets which is the least binding in terms of model hypotheses defined by the knowledge of the frequencies of some of the itemsets. We provide a method for computing MCI models based on algebraic geometry. We establish the link between MCI models and a class of MaxEnt models which has already known to be used in pattern mining. As such, our research presents further insight on the nature of such models and an entirely novel approach for computing them. (C) 2021 Elsevier Inc. All rights reserved.
引用
收藏
页码:314 / 340
页数:27
相关论文
共 49 条
[21]  
Hamalainen W, 2008, UCI Machine Learning Repository
[22]   Frequent pattern mining: current status and future directions [J].
Han, Jiawei ;
Cheng, Hong ;
Xin, Dong ;
Yan, Xifeng .
DATA MINING AND KNOWLEDGE DISCOVERY, 2007, 15 (01) :55-86
[23]  
Hanhijärvi S, 2009, KDD-09: 15TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, P379
[24]   Untangling the complexity of multimorbidity with machine learning [J].
Hassaine, Abdelaali ;
Salimi-Khorshidi, Gholamreza ;
Canoy, Dexter ;
Rahimi, Kazem .
MECHANISMS OF AGEING AND DEVELOPMENT, 2020, 190
[25]  
Jaroszewicz S., 2002, Advances in Knowledge Discovery and Data Mining. 6th Pacific-Asia Conference, PAKDD 2002. Proceedings (Lecture Notes in Artificial Intelligence Vol.2336), P135
[26]  
Jaynes E. T., 2003, PROBABILITY THEORY L
[27]   ON THE RATIONALE OF MAXIMUM-ENTROPY METHODS [J].
JAYNES, ET .
PROCEEDINGS OF THE IEEE, 1982, 70 (09) :939-952
[28]   Defining and measuring multimorbidity: a systematic review of systematic reviews [J].
Johnston, Marjorie C. ;
Crilly, Michael ;
Black, Corri ;
Prescott, Gordon J. ;
Mercer, Stewart W. .
EUROPEAN JOURNAL OF PUBLIC HEALTH, 2019, 29 (01) :182-189
[29]   On interestingness measures of formal concepts [J].
Kuznetsov, S. O. ;
Makhalova, T. .
INFORMATION SCIENCES, 2018, 442 :202-219
[30]  
Le Bras Y, 2012, DATA MINING: FOUNDATIONS AND INTELLIGENT PARADIGMS, VOL 2: STATISTICAL, BAYESIAN, TIMES SERIES AND OTHER THEORETICAL ASPECTS, P77