Network-Aware Grouping in Distributed Stream Processing Systems

被引:2
|
作者
Chen, Fei [1 ]
Wu, Song [1 ]
Jin, Hai [1 ]
机构
[1] Huazhong Univ Sci & Technol, Sch Comp Sci & Technol, Serv Comp Technol & Syst Lab, Cluster & Grid Comp Lab, Wuhan 430074, Peoples R China
关键词
Stream processing; Load balancing; Grouping; Network distance;
D O I
10.1007/978-3-030-05051-1_1
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Distributed Stream Processing (DSP) systems have recently attracted much attention because of their ability to process huge volumes of real-time stream data with very low latency on clusters of commodity hardware. Existing workload grouping strategies in a DSP system can be classified into four categories (i.e. raw and blind, data skewness, cluster heterogeneity, and dynamic load-aware). However, these traditional stream grouping strategies do not consider network distance between two communicating operators. In fact, the traffic from different network channels makes a significant impact on performance. How to grouping tuples according to network distances to improve performance has been a critical problem. In this paper, we propose a network-aware grouping framework called Squirrel to improve the performance under different network distances. Identifying the network location of two communicating operators, Squirrel sets a weight and priority for each network channel. It introduces Weight Grouping to assign different numbers of tuples to each network channel according to channel's weight and priority. In order to adapt to changes in network conditions, input load, resources and other factors, Squirrel uses Dynamic Weight Control to adjust network channel's weight and priority online by analyzing runtime information. Experimental results prove Squirrel's effectiveness and show that Squirrel can achieve 1.67x improvement in terms of throughput and reduce the latency by 47%.
引用
收藏
页码:3 / 18
页数:16
相关论文
共 50 条
  • [1] SNR: Network-aware Geo-Distributed Stream Analytics
    Mostafaei, Habib
    Afridi, Shafi
    Abawajy, Jemal H.
    21ST IEEE/ACM INTERNATIONAL SYMPOSIUM ON CLUSTER, CLOUD AND INTERNET COMPUTING (CCGRID 2021), 2021, : 820 - 827
  • [2] SquirrelJoin: Network-Aware Distributed Join Processing with Lazy Partitioning
    Rupprecht, Lukas
    Culhane, William
    Pietzuch, Peter
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2017, 10 (11): : 1250 - 1261
  • [3] Nap: Network-Aware Data Partitions for Efficient Distributed Processing
    Raz, Or
    Avin, Chen
    Schmid, Stefan
    2019 IEEE 18TH INTERNATIONAL SYMPOSIUM ON NETWORK COMPUTING AND APPLICATIONS (NCA), 2019, : 69 - 77
  • [4] Network-Aware Stream Query Processing in Mobile Ad-Hoc Networks
    O'Keeffe, Dan
    Salonidis, Theodoros
    Pietzuch, Peter
    2015 IEEE MILITARY COMMUNICATIONS CONFERENCE (MILCOM 2015), 2015, : 1335 - 1340
  • [5] A distributed network-aware TSCH scheduling
    Vieira Junior, Ivanilson Franca
    Granjal, Jorge
    Curado, Marilia
    2023 19TH INTERNATIONAL CONFERENCE ON THE DESIGN OF RELIABLE COMMUNICATION NETWORKS, DRCN, 2023,
  • [6] Online Scheduling for Shuffle Grouping in Distributed Stream Processing Systems
    Rivetti, Nicolo
    Anceaume, Emmanuelle
    Busnel, Yann
    Querzoni, Leonardo
    Sericola, Bruno
    MIDDLEWARE '16: PROCEEDINGS OF THE 17TH INTERNATIONAL MIDDLEWARE CONFERENCE, 2016,
  • [7] Network-aware distributed computing: A case study
    Tangmunarunkit, H
    Steenkiste, P
    PARALLEL AND DISTRIBUTED PROCESSING, 1998, 1388 : 171 - 182
  • [8] Network-aware support for mobile distributed teams
    van der Kleij, Rick
    de Jong, Alexis
    te Brake, Guido
    de Greef, Tjerk
    COMPUTERS IN HUMAN BEHAVIOR, 2009, 25 (04) : 940 - 948
  • [9] A Network-aware and Partition-based Resource Management Scheme for Data Stream Processing
    Wang, Yidan
    Tari, Zahir
    Huang, Xiaoran
    Zomaya, Albert Y.
    PROCEEDINGS OF THE 48TH INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING (ICPP 2019), 2019,
  • [10] Network-Aware Distributed Machine Learning OverWide Area Network
    Zhou, Pan
    Sun, Gang
    Yu, Hongfang
    Chang, Victor
    MODERN INDUSTRIAL IOT, BIG DATA AND SUPPLY CHAIN, IIOTBDSC 2020, 2021, 218 : 55 - 62