Memory-Efficient Sequential Pattern Mining with Hybrid Tries

被引:0
作者
Hosseininasab, Amin [1 ]
van Hoeve, Willem-Jan [2 ]
Cire, Andre A. [3 ]
机构
[1] Univ Florida, Warrington Coll Business, Gainesville, FL 32611 USA
[2] Carnegie Mellon Univ, Tepper Sch Business, Pittsburgh, PA USA
[3] Univ Toronto, Rotman Sch Management, Toronto, ON, Canada
关键词
Sequential pattern mining; Memory efficiency; Large-scale pattern mining; Trie data set models; GENERATION;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper develops a memory-efficient approach for Sequential Pattern Mining (SPM), a fundamental topic in knowledge discovery that faces a well-known memory bottleneck for large data sets. Our methodology involves a novel hybrid trie data structure that exploits recurring patterns to compactly store the data set in memory; and a corresponding mining algorithm designed to effectively extract patterns from this compact representation. Numerical results on small to medium-sized real-life test instances show an average improvement of 85% in memory consumption and 49% in computation time compared to the state of the art. For large data sets, our algorithm stands out as the only capable SPM approach within 256GB of system memory, potentially saving 1.7TB in memory consumption.
引用
收藏
页数:29
相关论文
共 50 条
  • [21] MEOD: Memory-Efficient Outlier Detection on Streaming Data
    Karale, Ankita
    Lazarova, Milena
    Koleva, Pavlina
    Poulkov, Vladimir
    SYMMETRY-BASEL, 2021, 13 (03):
  • [22] A* Algorithm Inspired Memory-Efficient Detection for MIMO Systems
    Chang, Ronald Y.
    Chung, Wei-Ho
    Lin, Sian-Jheng
    IEEE WIRELESS COMMUNICATIONS LETTERS, 2012, 1 (05) : 508 - 511
  • [23] Generalized Net of the Process of Sequential Pattern Mining by Generalized Sequential Pattern Algorithm (GSP)
    Bureva, Veselina
    Sotirova, Evdokia
    Chountas, Panagiotis
    INTELLIGENT SYSTEMS'2014, VOL 2: TOOLS, ARCHITECTURES, SYSTEMS, APPLICATIONS, 2015, 323 : 831 - 838
  • [24] A Review on Sequential Pattern Mining using Pattern Growth Approach
    Patel, Roshani
    Chaudhari, Tarunika
    PROCEEDINGS OF THE 2016 IEEE INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS, SIGNAL PROCESSING AND NETWORKING (WISPNET), 2016, : 1424 - 1427
  • [25] A hybrid recommender system for e-learning based on context awareness and sequential pattern mining
    John K. Tarus
    Zhendong Niu
    Dorothy Kalui
    Soft Computing, 2018, 22 : 2449 - 2461
  • [26] A hybrid recommender system for e-learning based on context awareness and sequential pattern mining
    Tarus, John K.
    Niu, Zhendong
    Kalui, Dorothy
    SOFT COMPUTING, 2018, 22 (08) : 2449 - 2461
  • [27] Sequential pattern mining in databases with temporal uncertainty
    Jiaqi Ge
    Yuni Xia
    Jian Wang
    Chandima Hewa Nadungodage
    Sunil Prabhakar
    Knowledge and Information Systems, 2017, 51 : 821 - 850
  • [28] Generalized Sequential Pattern Mining with Item Intervals
    Hirate, Yu
    Yamana, Hayato
    JOURNAL OF COMPUTERS, 2006, 1 (03) : 51 - 60
  • [29] NetNMSP: Nonoverlapping maximal sequential pattern mining
    Yan Li
    Shuai Zhang
    Lei Guo
    Jing Liu
    Youxi Wu
    Xindong Wu
    Applied Intelligence, 2022, 52 : 9861 - 9884
  • [30] Detecting and exploiting symmetries in sequential pattern mining
    Nekkache, Ikram
    Jabbour, Said
    Kamel, Nadjet
    Sais, Lakhdar
    INTERNATIONAL JOURNAL OF DATA MINING MODELLING AND MANAGEMENT, 2022, 14 (04) : 309 - 334