Optimal and Efficient Distributed Online Learning for Big Data

被引:3
作者
Sayin, Muhammed O. [1 ]
Vanli, N. Denizcan [1 ]
Delibalta, Ibrahim [2 ,3 ]
Kozat, Suleyman S. [1 ]
机构
[1] Bilkent Univ, Dept Elect & Elect Engn, Ankara, Turkey
[2] AVEA Commun Serv Inc, AveaLabs, Istanbul, Turkey
[3] Koc Univ, Grad Sch Social Sci & Humanities, Istanbul, Turkey
来源
2015 IEEE INTERNATIONAL CONGRESS ON BIG DATA - BIGDATA CONGRESS 2015 | 2015年
关键词
distributed processing; online learning; optimal and efficient; static state estimation; Big Data; smart grid; DIFFUSION STRATEGIES; STATE ESTIMATION; CONSENSUS; NETWORKS; SCHEME;
D O I
10.1109/BigDataCongress.2015.27
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
We propose optimal and efficient distributed online learning strategies for Big Data applications. Here, we consider the optimal state estimation over distributed network of autonomous data sources. The autonomous data sources can generate and process data locally irrespective of any centralized control unit. We seek to enhance the learning rate through the distributed control of those autonomous data sources. We emphasize that although this problem attracted significant attention and extensively studied in different fields including services computing and machine learning disciplines, all the well-known strategies achieve suboptimal online learning performance in the mean square error sense. To this end, we introduce the oracle algorithm as the optimal distributed online learning strategy. We also propose the optimal and efficient distributed online learning algorithm that reduces the communication load tremendously, i.e., requires the undirected disclosure of only a single scalar. Finally, we demonstrate the significant performance gains due to the proposed strategies with respect to the state-of-the-art approaches.
引用
收藏
页码:126 / 133
页数:8
相关论文
共 33 条
  • [1] Achbany Youssef, 2008, IEEE Transactions on Services Computing, V1, P141, DOI 10.1109/TSC.2008.12
  • [2] ad M-H Wong S.-P.L., 2013, Proceedings of IEEE International Congress on Big Data, P17
  • [3] Anderson B.D.O., 1979, Optimal Filtering
  • [4] [Anonymous], 2007, MATH STAT DATA ANAL
  • [5] [Anonymous], 2013, Advances in Neural Information Processing Systems
  • [6] Reducing Data Transfer in Service-Oriented Architectures: The Circulate Approach
    Barker, Adam
    Weissman, Jon B.
    van Hemert, Jano I.
    [J]. IEEE TRANSACTIONS ON SERVICES COMPUTING, 2012, 5 (03) : 437 - 449
  • [7] Blondel VD, 2005, IEEE DECIS CONTR P, P2996
  • [8] Diffusion Strategies for Distributed Kalman Filtering and Smoothing
    Cattivelli, Federico S.
    Sayed, Ali H.
    [J]. IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2010, 55 (09) : 2069 - 2084
  • [9] Diffusion LMS Strategies for Distributed Estimation
    Cattivelli, Federico S.
    Sayed, Ali H.
    [J]. IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2010, 58 (03) : 1035 - 1048
  • [10] Chung F., 1992, Spectral Graph Theory