Memory-Efficient Sequential Pattern Mining with Hybrid Tries

被引:0
作者
Hosseininasab, Amin [1 ]
van Hoeve, Willem-Jan [2 ]
Cire, Andre A. [3 ]
机构
[1] Univ Florida, Warrington Coll Business, Gainesville, FL 32611 USA
[2] Carnegie Mellon Univ, Tepper Sch Business, Pittsburgh, PA USA
[3] Univ Toronto, Rotman Sch Management, Toronto, ON, Canada
关键词
Sequential pattern mining; Memory efficiency; Large-scale pattern mining; Trie data set models; GENERATION;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper develops a memory-efficient approach for Sequential Pattern Mining (SPM), a fundamental topic in knowledge discovery that faces a well-known memory bottleneck for large data sets. Our methodology involves a novel hybrid trie data structure that exploits recurring patterns to compactly store the data set in memory; and a corresponding mining algorithm designed to effectively extract patterns from this compact representation. Numerical results on small to medium-sized real-life test instances show an average improvement of 85% in memory consumption and 49% in computation time compared to the state of the art. For large data sets, our algorithm stands out as the only capable SPM approach within 256GB of system memory, potentially saving 1.7TB in memory consumption.
引用
收藏
页数:29
相关论文
共 50 条
  • [31] CLOSED SEQUENTIAL PATTERN MINING IN BIOLOGICAL DATA
    Jawahar, S.
    Harishchander, A.
    Devaraju, S.
    Ali, S. Ahamed Johnsha
    Manivasagan, C.
    Sumathi, P.
    INTERNATIONAL JOURNAL OF LIFE SCIENCE AND PHARMA RESEARCH, 2020, : 9 - 13
  • [32] Sequential pattern mining in databases with temporal uncertainty
    Ge, Jiaqi
    Xia, Yuni
    Wang, Jian
    Nadungodage, Chandima Hewa
    Prabhakar, Sunil
    KNOWLEDGE AND INFORMATION SYSTEMS, 2017, 51 (03) : 821 - 850
  • [33] Generalized Sequential Pattern Mining with Item Intervals
    Hirate, Yu
    Yamana, Hayato
    JOURNAL OF COMPUTERS, 2006, 1 (03) : 51 - 60
  • [34] NetNMSP: Nonoverlapping maximal sequential pattern mining
    Yan Li
    Shuai Zhang
    Lei Guo
    Jing Liu
    Youxi Wu
    Xindong Wu
    Applied Intelligence, 2022, 52 : 9861 - 9884
  • [35] Detecting and exploiting symmetries in sequential pattern mining
    Nekkache, Ikram
    Jabbour, Said
    Kamel, Nadjet
    Sais, Lakhdar
    INTERNATIONAL JOURNAL OF DATA MINING MODELLING AND MANAGEMENT, 2022, 14 (04) : 309 - 334
  • [36] Sequential pattern mining in databases with temporal uncertainty
    Jiaqi Ge
    Yuni Xia
    Jian Wang
    Chandima Hewa Nadungodage
    Sunil Prabhakar
    Knowledge and Information Systems, 2017, 51 : 821 - 850
  • [37] Sequential Pattern Mining with the Micron Automata Processor
    Wang, Ke
    Sadredini, Elaheh
    Skadron, Kevin
    PROCEEDINGS OF THE ACM INTERNATIONAL CONFERENCE ON COMPUTING FRONTIERS (CF'16), 2016, : 135 - 144
  • [38] The evaluation of occupational accident with sequential pattern mining
    Mutlu, Nazli Gulum
    Altuntas, Serkan
    Dereli, Turkay
    SAFETY SCIENCE, 2023, 166
  • [39] Closed sequential pattern mining for sitemap generation
    Ceci, Michelangelo
    Lanotte, Pasqua Fabiana
    WORLD WIDE WEB-INTERNET AND WEB INFORMATION SYSTEMS, 2021, 24 (01): : 175 - 203
  • [40] NetNCSP: Nonoverlapping closed sequential pattern mining
    Wu, Youxi
    Zhu, Changrui
    Li, Yan
    Guo, Lei
    Wu, Xindong
    KNOWLEDGE-BASED SYSTEMS, 2020, 196 (196)