Spread Estimation With Non-Duplicate Sampling in High-Speed Networks

被引：27

作者：

Huang, He ^{[1
,2
]}

Sun, Yu-E ^{[2
,3
]}

Ma, Chaoyi ^{[2
]}

Chen, Shigang ^{[2
]}

Du, Yang ^{[1
]}

Wang, Haibo ^{[2
]}

Xiao, Qingjun ^{[4
]}

机构：

[1] Soochow Univ, Sch Comp Sci & Technol, Suzhou 215006, Peoples R China

[2] Univ Florida, Dept Comp & Informat Sci & Engn, Gainesville, FL 32611 USA

[3] Soochow Univ, Sch Rail Transportat, Suzhou 215131, Peoples R China

[4] Southeast Univ, Sch Cyber Sci & Engn, Nanjing 211189, Peoples R China

来源：

IEEE-ACM TRANSACTIONS ON NETWORKING | 2021年 / 29卷 / 05期

基金：

中国国家自然科学基金;

关键词：

System-on-chip; Estimation; Size measurement; Art; High-speed networks; Probabilistic logic; Throughput; Traffic measurement; sampling; spread estimation; high-speed network; TRAFFIC MEASUREMENT; COUNTER BRAIDS; ALGORITHMS;

D O I：

10.1109/TNET.2021.3078725

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

Per-flow spread measurement in high-speed networks has many practical applications. It is a more difficult problem than the traditional per-flow size measurement. Most prior work is based on sketches, focusing on reducing their space requirements in order to fit in on-chip cache memory. This design allows the measurement to be performed at the line rate, but it suffers from expensive computation for spread queries (unsuitable for online operations) and large errors in spread estimation for small flows. This paper complements the prior art with a new spread estimator design based on an on-chip/off-chip model. By storing traffic statistics in off-chip memory, our new design faces a key technical challenge to design an efficient online module of non-duplicate sampling that cuts down the off-chip memory access. We first propose a two-stage solution for non-duplicate sampling, which is efficient but cannot handle well a sampling probability that is either too small or too big. We then address this limitation through a three-stage solution that is more space-efficient. Our analysis shows that the proposed spread estimator is highly configurable for a variety of probabilistic performance guarantees. We implement our spread estimator in hardware using FPGA. The experiment results based on real Internet traffic traces show that our estimator produces spread estimation with much better accuracy than the prior art, reducing the mean relative (absolute) error by about one order of magnitude. Moreover, it increases the query throughput by around three orders of magnitude, making it suitable for supporting online queries in real time.

引用

页码：2073 / 2086

页数：14

共 39 条

[1]

[Anonymous], 2011, P 17 GIITG C COMM DI

[2]

[Anonymous], The caida ucsd anonymized internet traces 2015

[3]

[Anonymous], CISCO SAMPLED NETFLO

[4]

Braverman V., 2018, LIPIcs, P1

[5] An improved data stream summary: the count-min sketch and its applications [J].

Cormode, G ;

Muthukrishnan, S .

JOURNAL OF ALGORITHMS-COGNITION INFORMATICS AND LOGIC, 2005, 55 (01) :58-75

[6]

Dimitropoulos X, 2008, ACM SIGCOMM COMP COM, V38, P7, DOI 10.1145/1341431.1341433

[7] Learn more, sample less: Control of volume and variance in network measurement [J].

Duffield, N ;

Lund, C ;

Thorup, M .

IEEE TRANSACTIONS ON INFORMATION THEORY, 2005, 51 (05) :1756-1775

[8]

Duffield N., 2004, Performance Evaluation Review, V32, P85, DOI 10.1145/1012888.1005699

[9] New directions in traffic measurement and accounting: Focusing on the elephants, ignoring the mice [J].

Estan, C ;

Varghese, G .

ACM TRANSACTIONS ON COMPUTER SYSTEMS, 2003, 21 (03) :270-313

[10] Bitmap algorithms for counting active flows on high-speed links [J].

Estan, Cristian ;

Varghese, George ;

Fisk, Michael .

IEEE-ACM TRANSACTIONS ON NETWORKING, 2006, 14 (05) :925-937

← 1 2 3 4 →