Creating Robust Deep Neural Networks With Coded Distributed Computing for IoT

被引:1
|
作者
Hadidi, Ramyad [1 ]
Cao, Jiashen [2 ]
Asgari, Bahar [2 ,3 ]
Kim, Hyesoon [2 ]
机构
[1] Rain AI, Atlanta, GA 30332 USA
[2] Georgia Tech, Atlanta, GA USA
[3] Univ Maryland, College Pk, MD USA
来源
2023 IEEE INTERNATIONAL CONFERENCE ON EDGE COMPUTING AND COMMUNICATIONS, EDGE | 2023年
关键词
Edge AI; Reliability; IoT; Edge; Distributed Computing; Collaborative Edge & Robotics;
D O I
10.1109/EDGE60047.2023.00029
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The increasing interest in serverless computation and ubiquitous wireless networks has led to numerous connected devices in our surroundings. Such IoT devices have access to an abundance of raw data, but their inadequate resources in computing limit their capabilities. With the emergence of deep neural networks (DNNs), the demand for the computing power of IoT devices is increasing. To overcome inadequate resources, several studies have proposed distribution methods for IoT devices that harvest the aggregated computing power of idle IoT devices in an environment. However, since such a distributed system strongly relies on each device, unstable latency, and intermittent failures, the common characteristics of IoT devices and wireless networks, cause high recovery overheads. To reduce this overhead, we propose a novel robustness method with a close-to-zero recovery latency for DNN computations. Our solution never loses a request or spends time recovering from a failure. To do so, first, we analyze how matrix computations in DNNs are affected by distribution. Then, we introduce a novel coded distributed computing (CDC) method, the cost of which, unlike that of modular redundancies, is constant when the number of devices increases. Our method is applied at the library level, without requiring extensive changes to the program, while still ensuring a balanced work assignment during distribution.
引用
收藏
页码:126 / 132
页数:7
相关论文
共 50 条
  • [21] Incentive-Based Coded Distributed Computing Management for Latency Reduction in IoT Services-A Game Theoretic Approach
    Kim, Nakyoung
    Kim, Daejin
    Lee, Joohyung
    Niyato, Dusit
    Choi, Jun Kyun
    IEEE INTERNET OF THINGS JOURNAL, 2021, 8 (10): : 8259 - 8278
  • [22] Secure Coded Computation for Efficient Distributed Learning in Mobile IoT
    Yang, Yilin
    D'Oliveira, Rafael G. L.
    El Rouayheb, Salim
    Yang, Xin
    Seferoglu, Hulya
    Chen, Yingying
    2021 18TH ANNUAL IEEE INTERNATIONAL CONFERENCE ON SENSING, COMMUNICATION, AND NETWORKING (SECON), 2021,
  • [23] Electrocardiogram signal classification in an IoT environment using an adaptive deep neural networks
    Mary, G. Aloy Anuja
    Sathyasri, B.
    Murali, K.
    Prabhu, L. Arokia Jesu
    Devi, N. Bharatha
    NEURAL COMPUTING & APPLICATIONS, 2023, 35 (21) : 15333 - 15342
  • [24] Real-Time Distributed Computing at Network Edges for Large Scale Industrial IoT Networks
    Oyekanlu, Emmanuel
    Scoles, Kevin
    2018 IEEE WORLD CONGRESS ON SERVICES (IEEE SERVICES 2018), 2018, : 63 - 64
  • [25] On Batch-Processing Based Coded Computing for Heterogeneous Distributed Computing Systems
    Wang, Baoqian
    Xie, Junfei
    Lu, Kejie
    Wan, Yan
    Fu, Shengli
    IEEE TRANSACTIONS ON NETWORK SCIENCE AND ENGINEERING, 2021, 8 (03): : 2438 - 2454
  • [26] Electrocardiogram signal classification in an IoT environment using an adaptive deep neural networks
    G. Aloy Anuja Mary
    B. Sathyasri
    K. Murali
    L. Arokia Jesu Prabhu
    N. Bharatha Devi
    Neural Computing and Applications, 2023, 35 : 15333 - 15342
  • [27] Towards Robust Synchronization in IoT Networks
    Gore, Rahul N.
    Elizabeth, Namita
    Dzung, Dacfey
    Ashok, S.
    2019 11TH INTERNATIONAL CONFERENCE ON COMMUNICATION SYSTEMS & NETWORKS (COMSNETS), 2019, : 678 - 683
  • [28] DTS: A Simulator to Estimate the Training Time of Distributed Deep Neural Networks
    Robinson, Wilfredo J. M.
    Esposito, Flavio
    Zuluaga, Maria A.
    2022 30TH INTERNATIONAL SYMPOSIUM ON MODELING, ANALYSIS, AND SIMULATION OF COMPUTER AND TELECOMMUNICATION SYSTEMS, MASCOTS, 2022, : 17 - 24
  • [29] Robust Machine Learning Systems: Reliability and Security for Deep Neural Networks
    Hanif, Muhammad Abdullah
    Khalid, Faiq
    Putra, Rachmad Vidya Wicaksana
    Rehman, Semeen
    Shafique, Muhammad
    2018 IEEE 24TH INTERNATIONAL SYMPOSIUM ON ON-LINE TESTING AND ROBUST SYSTEM DESIGN (IOLTS 2018), 2018, : 257 - 260
  • [30] A robust and trusted framework for IoT networks
    Joshi G.
    Sharma V.
    Journal of Ambient Intelligence and Humanized Computing, 2023, 14 (07) : 9001 - 9019