Duality-Based Locality-Aware Stream Partitioning in Distributed Stream Processing Engines

被引:0
作者
Son, Siwoon [1 ]
Moon, Yang-Sae [1 ]
机构
[1] Kangwon Natl Univ, Chunchon, South Korea
来源
EURO-PAR 2019: PARALLEL PROCESSING WORKSHOPS | 2020年 / 11997卷
关键词
Distributed processing; Data stream; Locality; Duality;
D O I
10.1007/978-3-030-48340-1_57
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we propose duality-based locality-aware stream partitioning (LSP) in distributed stream processing engines (DSPEs). In general, LSP directly uses the locality concept of distributed batch processing engines (DBPEs). This concept does not fully take into account the characteristics of DSPEs and therefore does not maximize cluster resource utilization. To solve this problem, we first explain the limitations of existing LSP, and we then propose a duality relationship between DBPEs and DSPEs. We finally propose a simple but efficient ping-based mechanism to maximize the locality of DSPEs based on the duality. The insights uncovered in this paper can maximize the throughput and minimize the latency in stream partitioning.
引用
收藏
页码:725 / 730
页数:6
相关论文
共 28 条
  • [21] Decentralized Utility- and Locality-Aware Replication for Heterogeneous DHT-Based P2P Cloud Storage Systems
    Hassanzadeh-Nazarabadi, Yahya
    Kupcu, Alptekin
    Ozkasap, Oznur
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2020, 31 (05) : 1183 - 1193
  • [22] Storm-based distributed sampling system for multi-source stream environment
    Cho, Wonhyeong
    Gil, Myeong-Seon
    Choi, Mi-Jung
    Moon, Yang-Sae
    INTERNATIONAL JOURNAL OF DISTRIBUTED SENSOR NETWORKS, 2018, 14 (11):
  • [23] A Load Distribution Method Based on Distributed Hashing for P2P Sensor Data Stream Delivery System
    Kawakami, Tomoya
    Ishi, Yoshimasa
    Yoshihisa, Tomoki
    Teranishi, Yuuichi
    2014 38TH ANNUAL IEEE INTERNATIONAL COMPUTER SOFTWARE AND APPLICATIONS CONFERENCE WORKSHOPS (COMPSACW 2014), 2014, : 716 - 721
  • [24] A Density-Grid Based Clustering Algorithm on Data Stream Using Resilient Distributed Datasets
    Zhang, Yuan
    Zhang, Jiongmin
    ADVANCES IN ARTIFICIAL INTELLIGENCE, AI 2016, 2016, 9673 : 316 - 322
  • [25] A Distributed Rough Set Theory Algorithm based on Locality Sensitive Hashing for an Efficient Big Data Pre-processing
    Dagdia, Zaineb Chelly
    Zarges, Christine
    Beck, Gael
    Azzag, Hanene
    Lebbah, Mustapha
    2018 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2018, : 2597 - 2606
  • [26] Feature-Based High Availability Mechanism for Extreme Aggregation Tasks in Real-Time Data Stream Processing
    Ding, Weilong
    Han, Yanbo
    Wang, Jing
    Zhao, Zhuofeng
    JOURNAL OF INTERNET TECHNOLOGY, 2013, 14 (02): : 327 - 339
  • [27] Feature-based high-availability mechanism for quantile tasks in real-time data stream processing
    Ding, Weilong
    Han, Yanbo
    Wang, Jing
    Zhao, Zhuofeng
    SOFTWARE-PRACTICE & EXPERIENCE, 2014, 44 (07) : 855 - 871
  • [28] Urban Hotspot Detection from the Data Stream of Taxi Pick-up and Drop-off based on Distributed Multistage Grid Clustering
    Wang H.
    Xiang L.
    Guan X.
    Zhang Y.
    Journal of Geo-Information Science, 2023, 25 (07) : 1514 - 1530