Mining high average utility itemsets using artificial fish swarm algorithm with computed multiple minimum average utility thresholds

被引:1
作者
Nandhini, S. S. [1 ]
Kannimuthu, S. [2 ]
机构
[1] Bannari Amman Inst Technol, Sathyamangalam, Tamil Nadu, India
[2] Karpagam Coll Engn, Coimbatore, Tamil Nadu, India
关键词
Artificial fish swarm algorithm; data mining; frequent itemset mining; high average utility itemsets; itemset mining; utility mining;
D O I
10.3233/JIFS-231852
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
It is obvious that the problem of Frequent Itemset Mining (FIM) is very popular in data mining, which generates frequent itemsets from a transaction database. An extension of the frequent itemset mining is High Utility Itemset Mining (HUIM) which identifies itemsets with high utility from the transaction database. This gains popularity in data mining, because it identifies itemsets which have more value but the same was not identified as frequent by Frequent Itemset Mining. HUIM is generally referred to as Utility Mining. The utility of the items is measured based on parameters like cost, profit, quantity or any other measures preferred by the users. Compared to high utility itemsets (HUIs) mining, high average utility itemsets (HAUIs) mining is more precise by considering the number of items in the itemsets. In state-of-the-art algorithms that mines HUIS and HAUIs use a single fixed minimum utility threshold based on which HAUIs are identified. In this paper, the proposed algorithm mines HAUIs from transaction databases using Artificial Fish Swarm Algorithm (AFSA) with computed multiple minimum average utility thresholds. Computing the minimum average utility threshold for each item with the AFSA algorithm outperforms other state-of-the-art HAUI mining algorithms with multiple minimum utility thresholds and userdefined single minimum threshold in terms of number of HAUIs. It is observed that the proposed algorithm outperforms well in terms of execution time, number of candidates generated and memory consumption when compared to the state-of-the-art algorithms.
引用
收藏
页码:1597 / 1613
页数:17
相关论文
共 39 条