Mining microscopic and macroscopic changes in network data streams

被引:8
作者
Loglisci, Corrado [1 ,2 ]
Ceci, Michelangelo [1 ,2 ]
Impedovo, Angelo [1 ]
Malerba, Donato [1 ,2 ]
机构
[1] Univ Bari Aldo Moro, Dept Comp Sci, Bari, Italy
[2] CINI, Bari, Italy
关键词
Data mining; Change detection; Network analysis; Data stream; FREQUENT; ALGORITHMS;
D O I
10.1016/j.knosys.2018.07.011
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Network data streams are unbounded sequences of complex data produced at high rate which represent complex systems that evolve continuously over time. In this scenario, a problem worthy of being studied is the analysis of the changes, which may concern a complex system as a whole or small parts of it. In this paper, these are distinguished into macroscopic changes and microscopic changes: macroscopic changes have impact on a substantial part of the network, whereas microscopic changes concern variations occurring in specific portions of the network. The algorithm we propose, called KARMA, combines the frequent pattern mining framework with an automatic time-window detection approach. In this way, it is able to detect changes on the frequent sub-networks mined from different time-windows: network changes are then represented as variations of structural regularities frequently observed over the stream. KARMA takes an holistic perspective, in which the two kinds of change are related each other. This is the main novelty with respect to the recent studies, which do not simultaneously extract microscopic and macroscopic changes. Experiments on several real-world network data streams show the effectiveness and efficiency of our approach in comparison with competing algorithms and the usefulness of the changes detected.
引用
收藏
页码:294 / 312
页数:19
相关论文
共 50 条
  • [41] Mining maximal frequent patterns by considering weight conditions over data streams
    Yun, Unil
    Lee, Gangin
    Ryu, Keun Ho
    KNOWLEDGE-BASED SYSTEMS, 2014, 55 : 49 - 65
  • [42] An Efficient Algorithm for Mining Frequent Patterns over High Speed Data Streams
    Meng, Cai-xia
    2009 WRI WORLD CONGRESS ON SOFTWARE ENGINEERING, VOL 1, PROCEEDINGS, 2009, : 319 - 323
  • [43] On the design of hardware-software architectures for frequent itemsets mining on data streams
    Bustio-Martinez, Lazaro
    Cumplido, Rene
    Hernandez-Leon, Raudel
    Bande-Serrano, Jose M.
    Feregrino-Uribe, Claudia
    JOURNAL OF INTELLIGENT INFORMATION SYSTEMS, 2018, 50 (03) : 415 - 440
  • [44] A Martingale Framework for Detecting Changes in Data Streams by Testing Exchangeability
    Ho, Shen-Shyang
    Wechsler, Harry
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2010, 32 (12) : 2113 - 2127
  • [45] A Randomized Neural Network for Data Streams
    Pratama, Mahardhika
    Angelov, Plamen P.
    Lu, Jie
    Lughofer, Edwin
    Seera, Manjeevan
    Lim, C. P.
    2017 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2017, : 3423 - 3430
  • [46] Mining frequent itemsets in data streams using the weighted sliding window model
    Tsai, Pauray S. M.
    EXPERT SYSTEMS WITH APPLICATIONS, 2009, 36 (09) : 11617 - 11625
  • [47] Fast and light boosting for adaptive mining of data streams
    Chu, F
    Zaniolo, C
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PROCEEDINGS, 2004, 3056 : 282 - 292
  • [48] Efficient mining of frequent itemsets from data streams
    Leung, Carson Kai-Sang
    Brajczuk, Dale A.
    SHARING DATA, INFORMATION AND KNOWLEDGE, PROCEEDINGS, 2008, 5071 : 2 - 14
  • [49] Toward Mining Capricious Data Streams: A Generative Approach
    He, Yi
    Wu, Baijun
    Wu, Di
    Beyazit, Ege
    Chen, Sheng
    Wu, Xindong
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 32 (03) : 1228 - 1240
  • [50] Mining noisy data streams via a discriminative model
    Chu, F
    Wang, YZ
    Zaniolo, C
    DISCOVERY SCIENCE, PROCEEDINGS, 2004, 3245 : 47 - 59