Spatio-Temporal Frequent Itemset Mining on Web Data

被引:6
|
作者
Aggarwal, Apeksha [1 ]
Toshniwal, Durga [1 ]
机构
[1] Indian Inst Technol Roorkee, Dept CSE, Roorkee, Uttar Pradesh, India
来源
2018 18TH IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS (ICDMW) | 2018年
关键词
Spatio-temporal; frequent pattern; association rule; time; location; ASSOCIATION RULES;
D O I
10.1109/ICDMW.2018.00166
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Web generates enormous volumes of spatiotemporal data every second. Such data includes transactional data on which association rule mining can he perliamed. Applications includes fraud detection, consumer purchase pattern identification, recommendation systems etc. Essence of spatiotemporal information alongwith the transactional data comes from the fact that the association rules or frequent patterns in the transactions are highly determined by the location and time of the occurrence of that transaction. For example, customer purchase of product depends upon the season and location of buying that product. To extract frequent patterns from such large databases, most existing algorithms demands enormous amounts of resources. The present work proposes a spatiotemporal association rule mining algorithm using hashing, to facilitate reduced memory access time and storage space. Hash based search technique is used to fasten the memory access by directly accessing the required spatio-temporal information from the schema. There are a numerous hash based search techniques that can be used. But to reduce collision, direct address hashing is focused upon primarily in this work. However, in future we plan to extend our results over different search techniques. Our results are compared with exiting Spatio-Temporal Apriori algorithm, which is one of the established association rule mining algorithm. Furthermore, experiments are demonstrated on several synthetically generated and web based datasets. Subsequently, a comparison over different datasets is given. Our algorithm shows improved results when evaluated over several metrics such as support of frequent itemsets and percentage gain in reduced memory access time. In future we plan to extend this work to various benchmark datasets.
引用
收藏
页码:1160 / 1165
页数:6
相关论文
共 50 条
  • [1] Mining of Cascading Spatio-Temporal Frequent Patterns from Massive Data Sets
    Vasavi, M.
    Murugan, A.
    Sharma, K. Venkatesh
    IMPENDING INQUISITIONS IN HUMANITIES AND SCIENCES, ICIIHS-2022, 2024, : 334 - 343
  • [2] A data mining proxy approach for efficient frequent itemset mining
    Jeffrey Xu Yu
    Zhiheng Li
    Guimei Liu
    The VLDB Journal, 2008, 17 : 947 - 970
  • [3] A data mining proxy approach for efficient frequent itemset mining
    Yu, Jeffrey Xu
    Li, Zhiheng
    Liu, Guimei
    VLDB JOURNAL, 2008, 17 (04): : 947 - 970
  • [4] Parallel Frequent Itemset Mining on Streaming Data
    He, Yanshan
    Yue, Min
    2014 10TH INTERNATIONAL CONFERENCE ON NATURAL COMPUTATION (ICNC), 2014, : 725 - 730
  • [5] Spatio-temporal data mining in ecological and veterinary epidemiology
    Moustakas, Aristides
    STOCHASTIC ENVIRONMENTAL RESEARCH AND RISK ASSESSMENT, 2017, 31 (04) : 829 - 834
  • [6] A Web Interface for Exploiting Spatio-Temporal Heterogeneous Data
    Tran, Ba-Huy
    Plumejeaud-Perreau, Christine
    Bouju, Alain
    WEB AND WIRELESS GEOGRAPHICAL INFORMATION SYSTEMS, W2GIS 2018, 2018, 10819 : 118 - 129
  • [7] A primer to frequent itemset mining for bioinformatics
    Naulaerts, Stefan
    Meysman, Pieter
    Bittremieux, Wout
    Trung Nghia Vu
    Vanden Berghe, Wim
    Goethals, Bart
    Laukens, Kris
    BRIEFINGS IN BIOINFORMATICS, 2015, 16 (02) : 216 - 231
  • [8] Parallel Incremental Frequent Itemset Mining for Large Data
    Song, Yu-Geng
    Cui, Hui-Min
    Feng, Xiao-Bing
    JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2017, 32 (02) : 368 - 385
  • [9] Spatio-Temporal Associative Mining for Earthquake Data Distribution in Indonesia
    Edelani, Renovita
    Barakbah, Ali Ridho
    Harsono, Tri
    EMITTER-INTERNATIONAL JOURNAL OF ENGINEERING TECHNOLOGY, 2019, 7 (02) : 586 - 606
  • [10] Inverted Index Automata Frequent Itemset Mining for Large Dataset Frequent Itemset Mining
    Dai, Xin
    Hamed, Haza Nuzly Abdull
    Su, Qichen
    Hao, Xue
    IEEE ACCESS, 2024, 12 : 195111 - 195130