A performance evaluation of data streams sampling algorithms over a sliding window

被引:0
作者
El Sibai, Rayane [1 ]
Chabchoub, Yousra [1 ]
Demerjian, Jacques [2 ]
Chiky, Raja [1 ]
Barbar, Kablan [2 ]
机构
[1] ISEP, LlSITE Lab, F-92130 Issy Les Moulineaux, France
[2] Lebanese Univ, Fac Sci, LARIFA EDST Lab, Fanar, Lebanon
来源
2018 IEEE MIDDLE EAST AND NORTH AFRICA COMMUNICATIONS CONFERENCE (MENACOMM) | 2018年
关键词
Data streams; sampling algorithms; Chain sampling; Simple Random sampling; Deterministic sampling; accuracy;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Over the last few years, a large number of applications have appeared generating data streams for which the exhaustive storage require high costs. It was, therefore, necessary to process these data in real-time without storing them. Real-time processing requires specifying the analysis tasks before the arrival of the data. Consequently, any future need will not be answered because all the received data are lost. That is why it is necessary to keep a data stream summary in order to provide an approximate answer to the future queries. This can be done using the sampling algorithms. This paper is a continuation of our previous work in which we studied the Chain sampling algorithm. In this paper, we discuss two other sampling techniques: Deterministic sampling and Simple Random sampling (SRS) and we compare their performance against that of Chain sampling. The results show that Chain sampling gives better results than SRS and Deterministic sampling in terms of execution time and sampling accuracy respectively.
引用
收藏
页码:211 / 216
页数:6
相关论文
共 13 条
[1]  
Aggarwal C.C., 2007, Data streams: models and algorithms, P169, DOI DOI 10.1007/978-0-387-47534-9_9
[2]  
Babcock B, 2002, SIAM PROC S, P633
[3]  
Chabchoub Y., 2010, Proceedings 2010 10th IEEE International Conference on Data Mining Workshops (ICDMW 2010), P1297, DOI 10.1109/ICDMW.2010.18
[4]  
Chabchoub Yousra, 2012, P 7 INT C RISK SEC I, P1
[5]  
Chiky Raja, 2008, METHODES DE SONDAGES, P314
[6]  
El Sibai R, 2016, 2016 INTERNATIONAL CONFERENCE ON DIGITAL ECONOMY (ICDEC), P29, DOI 10.1109/ICDEC.2016.7563142
[7]  
El Sibai R, 2015, 2015 IEEE SEVENTH INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING AND INFORMATION SYSTEMS (ICICIS), P487, DOI 10.1109/IntelCIS.2015.7397265
[8]  
Golab L, 2003, SIGMOD REC, V32, P5, DOI 10.1145/776985.776986
[9]  
Golab L., 2003, TECHNICAL REPORT
[10]  
Gravetter FJ., 2018, RES METHODS BEHAV SC