A Schema Feature Based Frequent Pattern Mining Algorithm for Semi-structured Data Stream

被引:0
|
作者
Fu, Weiqi [1 ]
Liao, Husheng [1 ]
Jin, Xueyun [1 ]
机构
[1] Beijing Univ Technol, Fac Informat Technol, Beijing, Peoples R China
来源
PROCEEDINGS OF THE 2017 5TH INTERNATIONAL CONFERENCE ON FRONTIERS OF MANUFACTURING SCIENCE AND MEASURING TECHNOLOGY (FMSMT 2017) | 2017年 / 130卷
基金
北京市自然科学基金; 中国国家自然科学基金;
关键词
frequent pattern mining; semi-structured data stream; schema feature;
D O I
暂无
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Data mining is used to find useful information from massive data. Frequent pattern mining is one important task of data mining. Recently, the researches on frequent pattern mining for semi-structured data have made some progresses, and it also have a lot of focuses for data stream. However, only a few studies focus on both semi-structured data and data stream. This paper proposes an algorithm named SPrefixTreeISpan. We segment the semi-structured data stream first, and then uses the pattern-growth method to mine each segment. In the end, we maintain all the results on a structure called patternTree. At the same time, the mining algorithm is optimized by the inevitable parent-child relationship and the inevitable child-parent relationship extracted from XML schema. Experiment shows that SPrefixTreeISpan has better performance.
引用
收藏
页码:1329 / 1336
页数:8
相关论文
共 50 条
  • [41] Analysis of tree-based uncertain frequent pattern mining techniques without pattern losses
    Lee, Gangin
    Yun, Unil
    Lee, Kyung-Min
    JOURNAL OF SUPERCOMPUTING, 2016, 72 (11) : 4296 - 4318
  • [42] Analysis of tree-based uncertain frequent pattern mining techniques without pattern losses
    Gangin Lee
    Unil Yun
    Kyung-Min Lee
    The Journal of Supercomputing, 2016, 72 : 4296 - 4318
  • [43] Web-based Application Anomaly Detection Based on Efficient Frequent Pattern Mining
    Zhou, Jingli
    Yu, Jifeng
    Xiong, Liqin
    2ND INTERNATIONAL SYMPOSIUM ON COMPUTER NETWORK AND MULTIMEDIA TECHNOLOGY (CNMT 2010), VOLS 1 AND 2, 2010, : 256 - 259
  • [44] Gene association analysis: a survey of frequent pattern mining from gene expression data
    Alves, Ronnie
    Rodriguez-Baena, Domingo S.
    Aguilar-Ruiz, Jesus S.
    BRIEFINGS IN BIOINFORMATICS, 2010, 11 (02) : 210 - 224
  • [45] A Robust Associative Watermarking Technique based on Frequent Pattern Mining and Texture Analysis
    Ghadi, Musab
    Laouamer, Lamri
    Nana, Laurent
    Pascu, Anca
    PROCEEDINGS OF THE 8TH INTERNATIONAL CONFERENCE ON MANAGEMENT OF DIGITAL ECOSYSTEMS (MEDES 2016), 2016, : 73 - 81
  • [46] Radio Wave Environment Analysis at Different Locations Based on Frequent Pattern Mining
    Suzuki, Nobuo
    Matsuno, Hiromi
    KNOWLEDGE-BASED AND INTELLIGENT INFORMATION & ENGINEERING SYSTEMS, 2017, 112 : 1396 - 1403
  • [47] Spatiotemporal Traffic Modeling based on Frequent Pattern Mining in Wireless Cellular Network
    Gao, Luyu
    Zhang, Xing
    Wang, Wenbo
    Shen, Qiangqiang
    2017 IEEE INTERNATIONAL CONFERENCE ON INTERNET OF THINGS (ITHINGS) AND IEEE GREEN COMPUTING AND COMMUNICATIONS (GREENCOM) AND IEEE CYBER, PHYSICAL AND SOCIAL COMPUTING (CPSCOM) AND IEEE SMART DATA (SMARTDATA), 2017, : 60 - 67
  • [48] Pattern Discovery from Dynamic Data Streams using Frequent Pattern Mining with Multi-Support Thresholds
    Almuammar, Manal
    Fasli, Maria
    2017 INTERNATIONAL CONFERENCE ON THE FRONTIERS AND ADVANCES IN DATA SCIENCE (FADS), 2017, : 45 - 50
  • [49] Sequence-Growth : A Scalable and Effective Frequent Itemset Mining Algorithm for Big Data Based on MapReduce Framework
    Liang, Yen-hui
    Wu, Shiow-yang
    2015 IEEE INTERNATIONAL CONGRESS ON BIG DATA - BIGDATA CONGRESS 2015, 2015, : 393 - 400
  • [50] A horizontal partitioning-based method for frequent pattern mining in transport timetable
    Teixeira, Claudio
    Fragoso, Luana
    Mattoso, Marta
    Carvalho, Diego
    Bezerra, Eduardo
    Soares, Jorge
    Amorim, Glauco
    Ogasawara, Eduardo
    EXPERT SYSTEMS, 2022, 39 (02)