Network-Aware Grouping in Distributed Stream Processing Systems

被引:2
|
作者
Chen, Fei [1 ]
Wu, Song [1 ]
Jin, Hai [1 ]
机构
[1] Huazhong Univ Sci & Technol, Sch Comp Sci & Technol, Serv Comp Technol & Syst Lab, Cluster & Grid Comp Lab, Wuhan 430074, Peoples R China
关键词
Stream processing; Load balancing; Grouping; Network distance;
D O I
10.1007/978-3-030-05051-1_1
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Distributed Stream Processing (DSP) systems have recently attracted much attention because of their ability to process huge volumes of real-time stream data with very low latency on clusters of commodity hardware. Existing workload grouping strategies in a DSP system can be classified into four categories (i.e. raw and blind, data skewness, cluster heterogeneity, and dynamic load-aware). However, these traditional stream grouping strategies do not consider network distance between two communicating operators. In fact, the traffic from different network channels makes a significant impact on performance. How to grouping tuples according to network distances to improve performance has been a critical problem. In this paper, we propose a network-aware grouping framework called Squirrel to improve the performance under different network distances. Identifying the network location of two communicating operators, Squirrel sets a weight and priority for each network channel. It introduces Weight Grouping to assign different numbers of tuples to each network channel according to channel's weight and priority. In order to adapt to changes in network conditions, input load, resources and other factors, Squirrel uses Dynamic Weight Control to adjust network channel's weight and priority online by analyzing runtime information. Experimental results prove Squirrel's effectiveness and show that Squirrel can achieve 1.67x improvement in terms of throughput and reduce the latency by 47%.
引用
收藏
页码:3 / 18
页数:16
相关论文
共 50 条
  • [21] Network-Aware Locality Scheduling for Distributed Data Operators in Data Centers
    Cheng, Long
    Wang, Ying
    Liu, Qingzhi
    Epema, Dick H. J.
    Liu, Cheng
    Mao, Ying
    Murphy, John
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2021, 32 (06) : 1494 - 1510
  • [22] An Optimal Network-Aware Scheduling Technique for Distributed Deep Learning in Distributed HPC Platforms
    Lee, Sangkwon
    Shah, Syed Asif Raza
    Seok, Woojin
    Moon, Jeonghoon
    Kim, Kihyeon
    Shah, Syed Hasnain Raza
    ELECTRONICS, 2023, 12 (14)
  • [23] Network-Aware Server Placement for Highly Interactive Distributed Virtual Environments
    Ta, Duong
    Zhou, Suiping
    Cai, Wentono
    Tang, Xueyan
    Ayani, Rassul
    DS-RT 2008: 12TH 2008 IEEE/ACM INTERNATIONAL SYMPOSIUM ON DISTRIBUTED SIMULATION AND REAL TIME APPLICATIONS, PROCEEDINGS, 2008, : 95 - +
  • [24] A Recursive Distributed Topology Discovery Service for Network-Aware Grid Clients
    Paolucci, Francesco
    Valcarenghi, Luca
    Castoldi, Piero
    Cugini, Filippo
    2009 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS, VOLS 1-8, 2009, : 1313 - +
  • [25] A Cluster-Based Data-Centric Model for Network-Aware Task Scheduling in Distributed Systems
    Ugo Fiore
    Francesco Palmieri
    Aniello Castiglione
    Alfredo De Santis
    International Journal of Parallel Programming, 2014, 42 : 755 - 775
  • [26] A Cluster-Based Data-Centric Model for Network-Aware Task Scheduling in Distributed Systems
    Fiore, Ugo
    Palmieri, Francesco
    Castiglione, Aniello
    De Santis, Alfredo
    INTERNATIONAL JOURNAL OF PARALLEL PROGRAMMING, 2014, 42 (05) : 755 - 775
  • [27] Enabling network-aware applications
    Tierney, BL
    Gunter, D
    Lee, J
    Stoufer, M
    Evans, JB
    10TH IEEE INTERNATIONAL SYMPOSIUM ON HIGH PERFORMANCE DISTRIBUTED COMPUTING, PROCEEDINGS, 2001, : 281 - 288
  • [28] Challenging Data Management in CMS Computing with Network-Aware Systems
    Bonacorsi, Daniele
    Wildish, Tony
    2013 IEEE NUCLEAR SCIENCE SYMPOSIUM AND MEDICAL IMAGING CONFERENCE (NSS/MIC), 2013,
  • [29] Network-aware join processing in global-scale database federations
    Wang, Xiaodan
    Burns, Randal
    Terzis, Andreas
    Deshpande, Amol
    2008 IEEE 24TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING, VOLS 1-3, 2008, : 586 - +
  • [30] Planning for network-aware paths
    Fu, XD
    Karamcheti, V
    DISTRIBUTED APPLICATIONS AND INTEROPERABLE SYSTEMS, PROCEEDINGS, 2003, 2893 : 187 - 199