Efficient Approach for Damped Window-Based High Utility Pattern Mining With List Structure

被引:32
作者
Nam, Hyoju [1 ]
Yun, Unil [1 ]
Vo, Bay [2 ]
Tin Truong [3 ]
Deng, Zhi-Hong [4 ]
Yoon, Eunchul [5 ]
机构
[1] Sejong Univ, Dept Comp Engn, Seoul, South Korea
[2] Ho Chi Minh City Univ Technol HUTECH, Fac Informat Technol, Ho Chi Minh City 700000, Vietnam
[3] Ton Duc Thang Univ, Inst Computat Sci, Fac Math & Stat, Div Computat Math & Engn, Ho Chi Minh City 758307, Vietnam
[4] Peking Univ, Dept Machine Intelligence, Beijing 100871, Peoples R China
[5] Konkuk Univ, Dept Elect Engn, Seoul 05029, South Korea
基金
新加坡国家研究基金会;
关键词
Databases; Data mining; Data structures; Data models; Microsoft Windows; Heuristic algorithms; damped window model; pattern pruning; high utility patterns; stream data mining; FREQUENT ITEMSETS; ALGORITHMS;
D O I
10.1109/ACCESS.2020.2979289
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Traditional pattern mining is designed to handle binary database that assume all items in the database have same importance, there is a limitation to recognize accurate information from real-world databases using traditional method. To solve this problem, the high utility pattern mining approaches from non-binary database have been proposed and actively studied by many researchers. Lately, new data is progressively created with the passage of time in diverse area such as biometric data of a patient diagnosed in a medical device and log data of an internet user, and the volume of a database is gradually increasing. A database with these characteristics is called a dynamic database. Under these circumstances, high utility mining techniques suitable for analyzing dynamic databases have recently been extensively studied. In this paper, we propose a new list-based algorithm that mines high utility patterns considering the arrival time of each transaction in an incremental database environment. That is, our algorithm efficiently performs pattern pruning by using a damped window model that considers the importance of the previously inputted data lower than that of recently inserted data and identifies high utility patterns. Experimental results indicate that our proposed method has better performance than the state-of-the-art techniques in terms of runtime, memory, and scalability.
引用
收藏
页码:50958 / 50968
页数:11
相关论文
共 30 条
[1]   Frequent Pattern Mining on Time and Location Aware Air Quality Data [J].
Aggarwa, Apeksha ;
Toshniwal, Durga .
IEEE ACCESS, 2019, 7 :98921-98933
[2]   Efficient Tree Structures for High Utility Pattern Mining in Incremental Databases [J].
Ahmed, Chowdhury Farhan ;
Tanbeer, Syed Khairuzzaman ;
Jeong, Byeong-Soo ;
Lee, Young-Koo .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2009, 21 (12) :1708-1721
[3]   Mining Recent Maximal Frequent Itemsets Over Data Streams with Sliding Window [J].
Cai, Saihua ;
Hao, Shangbo ;
Sun, Ruizhi ;
Wu, Gang .
INTERNATIONAL ARAB JOURNAL OF INFORMATION TECHNOLOGY, 2019, 16 (06) :961-969
[4]  
Fournier-Viger P., 2015, P ASE BIGDATA SOC IN
[5]   A Survey of Utility-Oriented Pattern Mining [J].
Gan, Wensheng ;
Lin, Jerry Chun-Wei ;
Fournier-Viger, Philippe ;
Chao, Han-Chieh ;
Tseng, Vincent S. ;
Yu, Philip S. .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2021, 33 (04) :1306-1327
[6]   A Survey of Parallel Sequential Pattern Mining [J].
Gan, Wensheng ;
Lin, Jerry Chun-Wei ;
Fournier-Viger, Philippe ;
Chao, Han-Chieh ;
Yu, Philip S. .
ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2019, 13 (03)
[7]   HUOPM: High-Utility Occupancy Pattern Mining [J].
Gan, Wensheng ;
Lin, Jerry Chun-Wei ;
Fournier-Viger, Philippe ;
Chao, Han-Chieh ;
Yu, Philip S. .
IEEE TRANSACTIONS ON CYBERNETICS, 2020, 50 (03) :1195-1208
[8]   A survey of incremental high-utility itemset mining [J].
Gan, Wensheng ;
Lin, Jerry Chun-Wei ;
Fournier-Viger, Philippe ;
Chao, Han-Chieh ;
Hong, Tzung-Pei ;
Fujita, Hamido .
WILEY INTERDISCIPLINARY REVIEWS-DATA MINING AND KNOWLEDGE DISCOVERY, 2018, 8 (02)
[9]   Mining high utility itemsets based on the time decaying model [J].
Kim, Donggyu ;
Yun, Unil .
INTELLIGENT DATA ANALYSIS, 2016, 20 (05) :1157-1180
[10]   Pruning strategies for mining high utility itemsets [J].
Krishnamoorthy, Srikumar .
EXPERT SYSTEMS WITH APPLICATIONS, 2015, 42 (05) :2371-2381