Universal Online Sketch for Tracking Heavy Hitters and Estimating Moments of Data Streams

被引:0
|
作者
Xiao, Qingjun [1 ]
Tang, Zhiying [2 ]
Chen, Shigang [3 ]
机构
[1] Southeast Univ, Sch Cyber Sci & Engn, Nanjing, Peoples R China
[2] Southeast Univ, Sch Comp Sci & Engn, Nanjing, Peoples R China
[3] Florida Univ, Dept CISE, Gainesville, FL USA
来源
IEEE INFOCOM 2020 - IEEE CONFERENCE ON COMPUTER COMMUNICATIONS | 2020年
基金
中国国家自然科学基金; 美国国家科学基金会;
关键词
COUNTER ARCHITECTURE; FREQUENT;
D O I
10.1109/infocom41043.2020.9155454
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Traffic measurement is key to many network management tasks such as performance monitoring and cyber-security. Its aim is to inspect the packet stream passing through a network device, classify them into flows according to the header fields, and obtain statistics about the flows. For processing big streaming data in size-limited SRAM of line cards, many space-sublinear algorithms have been proposed, such as CountMin and CountSketch. However, most of them are designed for specific measurement tasks. Implementing multiple independent sketches places burden for online operations of a network device. It is highly desired to design a universal sketch that not only tracks individual large flows (called heavy hitters) but also reports overall traffic distribution statistics (called moments). The prior work UnivMon successfully tackled this ambitious quest. However, it incurs large and variable per-packet processing overhead, which may result in a significant throughput bottleneck in high-rate packet streaming, given that each packet requires 65 hashes and 64 memory accesses on average and many times of that in the worst case. To address this performance issue, we need to fundamentally redesign the solution architecture from hierarchical sampling to new progressive sampling and from CountSketch to new ActiveCM+, which ensure that per-packet overhead is a small constant (4 hash and 4 memory accesses) in the worst case, making it much more suitable for online operations, especially for pipeline implementation. The new design also makes effort to reduce memory footprint or equivalently improve measurement accuracy under the same memory. Our experiments show that our solution incurs just one sixteenth per-packet overhead of UnivMon, while improving measurement accuracy by three times under the same memory.
引用
收藏
页码:974 / 983
页数:10
相关论文
共 4 条
  • [1] Universal and Accurate Sketch for Estimating Heavy Hitters and Moments in Data Streams
    Xiao, Qingjun
    Cai, Xuyuan
    Qin, Yifei
    Tang, Zhiying
    Chen, Shigang
    Liu, Yu
    IEEE-ACM TRANSACTIONS ON NETWORKING, 2023, 31 (05) : 1919 - 1934
  • [2] Finding Correlated Heavy-Hitters over Data Streams
    Lahiri, Bibudh
    Tirthapura, Srikanta
    2009 IEEE 28TH INTERNATIONAL PERFORMANCE COMPUTING AND COMMUNICATIONS CONFERENCE (IPCC 2009), 2009, : 307 - 314
  • [3] Conditional heavy hitters: detecting interesting correlations in data streams
    Mirylenka, Katsiaryna
    Cormode, Graham
    Palpanas, Themis
    Srivastava, Divesh
    VLDB JOURNAL, 2015, 24 (03) : 395 - 414