To cloud or not to cloud: an on-line scheduler for dynamic privacy-protection of deep learning workload on edge devices

被引:0
作者
Yibin Tang
Ying Wang
Huawei Li
Xiaowei Li
机构
[1] Chinese Academy of Sciences,State Key Laboratory of Computer Architecture, Institute of Computing Technology
[2] University of Chinese Academy of Sciences,undefined
[3] Peng Cheng Laboratory,undefined
[4] Wuhan Digital Engineering Institute,undefined
来源
CCF Transactions on High Performance Computing | 2021年 / 3卷
关键词
Real-time; Deep learning; Edge computing; Privacy protection;
D O I
暂无
中图分类号
学科分类号
摘要
Recently deep learning applications are thriving on edge and mobile computing scenarios, due to the concerns of latency constraints, data security and privacy, and other considerations. However, because of the limitation of power delivery, battery lifetime and computation resource, offering real-time neural network inference ability has to resort to the specialized energy-efficient architecture, and sometimes the coordination between the edge devices and the powerful cloud or fog facilities. This work investigates a realistic scenario when an on-line scheduler is needed to meet the requirement of latency even when the edge computing resources and communication speed are dynamically fluctuating, while protecting the privacy of users as well. It also leverages the approximate computing feature of neural networks and actively trade-off excessive neural network propagation paths for latency guarantee even when local resource provision is unstable. Combining neural network approximation and dynamic scheduling, the real-time deep learning system could adapt to different requirements of latency/accuracy and the resource fluctuation of mobile-cloud applications. Experimental results also demonstrate that the proposed scheduler significantly improves the energy efficiency of real-time neural networks on edge devices.
引用
收藏
页码:85 / 100
页数:15
相关论文
共 52 条
[1]  
Goldberg Y(2017)Neural network methods for natural language processing Synth. Lect. Hum. Lang. Technol. 10 1-309
[2]  
Gubbi J(2013)Internet of Things (IoT): a vision, architectural elements, and future directions Future Gener. Comput. Syst. 29 1645-1660
[3]  
Buyya R(2017)Neurosurgeon: collaborative intelligence between the cloud and mobile edge ACM SIGARCH Comput. Archit. News 45 615-629
[4]  
Marusic S(2007)Oblivious neural network computing via homomorphic encryption EURASIP J. Inf. Secur. 2007 1-11
[5]  
Palaniswami M(2020)A hybrid deep learning architecture for privacy-preserving mobile analytics IEEE Internet Things J. 7 4505-451
[6]  
Kang Y(2018)A convolutional neural network smartphone app for real-time voice activity detection IEEE Access 6 9017-9026
[7]  
Hauswald J(2019)MV-Net: toward real-time deep learning on mobile GPGPU systems ACM J. Emerg. Technol. Comput. Syst. (JETC) 15 35-14
[8]  
Gao C(2018)Convolutional neural network for bio-medical image segmentation with hardware acceleration Cogn. Syst. Res. 50 10-867
[9]  
Rovinski A(2017)Energy and delay tradeoff for application offloading in mobile cloud computing IEEE Syst. J. 11 858-111
[10]  
Mudge T(2014)Phone2Cloud: Exploiting computation offloading for energy saving on smartphones in mobile cloud computing Inf. Syst. Front. 16 95-187