Optimized distributed large-scale analytics over decentralized data sources with imperfect communication

被引:1
|
作者
Shahbazian, Reza [1 ]
Guerriero, Francesca [2 ]
机构
[1] Univ Calabria UniCal, Dept Math & Comp Sci, I-87036 Arcavacata Di Rende, CS, Italy
[2] Univ Calabria UniCal, Dept Mech Energy & Management Engn, I-87036 Arcavacata Di Rende, CS, Italy
来源
JOURNAL OF SUPERCOMPUTING | 2020年 / 76卷 / 11期
关键词
Big data; Large scale; Optimization; Distributed; Imperfect communication; BIG DATA; DIFFUSION LMS; ALGORITHM; NETWORKS; CONVEX;
D O I
10.1007/s11227-019-03129-5
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Tremendous amounts of data are generated by sensors and connected devices with high velocity in a variety of forms and large volumes. These characteristics, defined as big data, need new models and methods to be processed in near real-time. The nature of decentralized large-scale data sources requires distributed algorithms in which it is assumed that the data sources are capable of processing their own data and collaborating with neighbor sources. The network objective is to make an optimal decision, while the data are processed in a distributed manner. New technologies, like next generation of wireless communication and 5G, introduce practical issues such as imperfect communication that should be addressed. In this paper, we study a generalized form of distributed algorithms for decision-making over decentralized data sources. We propose an optimal algorithm that uses optimal weighting to combine the resource of neighbors. We define an optimization problem and find the solution by applying the proposed algorithm. We evaluate the performance of the developed algorithm by using both mathematical methods and computer simulations. We introduce the conditions in which the convergence of proposed algorithm is guaranteed and prove that the network error decreases considerably in comparison with some of the known modern methods.
引用
收藏
页码:8510 / 8528
页数:19
相关论文
共 50 条
  • [1] Optimized distributed large-scale analytics over decentralized data sources with imperfect communication
    Reza Shahbazian
    Francesca Guerriero
    The Journal of Supercomputing, 2020, 76 : 8510 - 8528
  • [2] Distributed optimization over large-scale systems for big data analytics
    Reza Shahbazian
    4OR, 2021, 19 : 309 - 310
  • [3] Distributed optimization over large-scale systems for big data analytics
    Shahbazian, Reza
    4OR-A QUARTERLY JOURNAL OF OPERATIONS RESEARCH, 2021, 19 (02): : 309 - 310
  • [4] Riffle: Optimized Shuffle Service for Large-Scale Data Analytics
    Zhang, Haoyu
    Cho, Brian
    Seyfe, Ergin
    Ching, Avery
    Freedman, Michael J.
    EUROSYS '18: PROCEEDINGS OF THE THIRTEENTH EUROSYS CONFERENCE, 2018,
  • [5] On-demand Self-adaptive Data Analytics in Large-scale Decentralized Networks
    Pournaras, Evangelos
    Nikolic, Jovan
    2017 IEEE 16TH INTERNATIONAL SYMPOSIUM ON NETWORK COMPUTING AND APPLICATIONS (NCA), 2017, : 185 - 194
  • [6] Improving Efficiency in Large-Scale Decentralized Distributed Training
    Mang, Wei
    Cui, Xiaodong
    Kayi, Abdullah
    Liu, Mingrui
    Finkler, Ulrich
    Kingsbury, Brian
    Saon, George
    Mroueh, Youssef
    Buyuktosunoglu, Alper
    Das, Payel
    Kung, David
    Picheny, Michael
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 3022 - 3026
  • [7] A Hybrid Data Model for Large-Scale Analytics
    Feo, John
    2018 ACM INTERNATIONAL CONFERENCE ON COMPUTING FRONTIERS, 2018, : 269 - 269
  • [8] Primal-Dual Methods for Large-Scale and Distributed Convex Optimization and Data Analytics
    Jakovetic, Dusan
    Bajovic, Dragana
    Xavier, Joao
    Moura, Jose M. F.
    PROCEEDINGS OF THE IEEE, 2020, 108 (11) : 1923 - 1938
  • [9] Adaptive Partitioning for Large-Scale Graph Analytics in Geo-Distributed Data Centers
    Zhou, Amelie Chi
    Luo, Juanyun
    Qiu, Ruibo
    Tan, Haobin
    He, Bingsheng
    Mao, Rui
    2022 IEEE 38TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2022), 2022, : 2818 - 2830
  • [10] Group communication for large-scale distributed systems over IP multicast networks
    Mathur, AG
    INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED PROCESSING TECHNIQUES AND APPLICATIONS, VOLS I-IV, PROCEEDINGS, 1998, : 710 - 717