A Neural Network Model for Cache and Memory Prediction of Neural Networks

被引:0
作者
Sha, Sai [1 ,3 ,4 ]
Luo, Yingwei [1 ,3 ,4 ]
Wang, Zhenlin [2 ]
Wang, Xiaolin [1 ,3 ,4 ]
机构
[1] Peking Univ, Beijing, Peoples R China
[2] Michigan Technol Univ, Houghton, MI 49931 USA
[3] Peking Univ Shenzhen, Shenzhen Key Lab Cloud Comp Technol & Applicat, SECE, Shenzhen, Peoples R China
[4] Peng Cheng Lab, Shenzhen, Peoples R China
来源
2018 IEEE INT CONF ON PARALLEL & DISTRIBUTED PROCESSING WITH APPLICATIONS, UBIQUITOUS COMPUTING & COMMUNICATIONS, BIG DATA & CLOUD COMPUTING, SOCIAL COMPUTING & NETWORKING, SUSTAINABLE COMPUTING & COMMUNICATIONS | 2018年
基金
美国国家科学基金会;
关键词
Neural network model; Cache; Memory;
D O I
10.1109/BDCloud.2018.00142
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Neural networks have been widely applied to various research and production fields. However, most recent research is focused on the establishment and selection of a specific neural network model. Less attention is paid to their system overhead despite of their massive computing and storage resource demand. This research focuses on a relatively new research direction that models the system-level memory and cache demand of neural networks. We utilize a neural network to learn and predict hit ratio curve and memory footprint of neural networks with their hyper-parameters as input. The prediction result is used to drive cache partitioning and memory partitioning to optimize co-execution of multiple neural networks. To demonstrate effectiveness of our approach, we model four common networks, BP neural network, convolutional neural network, recurrent neural network, and autoencoder. We investigate the influence of hyper-parameters of each model on the last level cache and memory demand. We resort to the BP algorithm as the learning tool to predict last level cache hit ratio curve and memory usage. Our experimental results show that cache and memory allocation schemes guided by our prediction optimize for a wide range of performance targets.
引用
收藏
页码:972 / 978
页数:7
相关论文
共 10 条
[1]  
Abadi M, 2016, PROCEEDINGS OF OSDI'16: 12TH USENIX SYMPOSIUM ON OPERATING SYSTEMS DESIGN AND IMPLEMENTATION, P265
[2]  
[Anonymous], 2018, P 13 EUROSYS C
[3]  
[Anonymous], 2012, COMPUTER ENCE
[4]  
Lee BC, 2006, ACM SIGPLAN NOTICES, V41, P185, DOI [10.1145/1168919.1168881, 10.1145/1168917.1168881]
[5]  
Mairal J., 2014, ADV NEURAL INFORM PR, V27, P2627
[6]  
Manu A., 2016, 2016 INT C CIRCUIT P, P1
[7]  
R Maei H, 2009, P INT C MACH LEARN I, P993
[8]  
Wei WJ, 2000, 2000 5TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS, VOLS I-III, P1647, DOI 10.1109/ICOSP.2000.893417
[9]  
2004, IEEE SIGNAL PROCESSI, V11, P115, DOI DOI 10.1109/LSP.2003.821649
[10]  
2012, CONF PROC INT SYMP C, P428