Towards Efficient Microarchitectural Design for Accelerating Unsupervised GAN-based Deep Learning

被引:40
作者
Song, Mingcong [1 ]
Zhang, Jiaqi [1 ]
Chen, Huixiang [1 ]
Li, Tao [1 ]
机构
[1] Univ Florida, Dept Elect & Comp Engn, Gainesville, FL 32611 USA
来源
2018 24TH IEEE INTERNATIONAL SYMPOSIUM ON HIGH PERFORMANCE COMPUTER ARCHITECTURE (HPCA) | 2018年
基金
美国国家科学基金会;
关键词
D O I
10.1109/HPCA.2018.00016
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Recently, deep learning based approaches have emerged as indispensable tools to perform big data analytics. Normally, deep learning models are first trained with a supervised method and then deployed to execute various tasks. The supervised method involves extensive human efforts to collect and label the large-scale dataset, which becomes impractical in the big data era where raw data is largely unlabeled and uncategorized. Fortunately, the adversarial learning, represented by Generative Adversarial Network (GAN), enjoys a great success on the unsupervised learning. However, the distinct features of GAN, such as massive computing phases and non-traditional convolutions challenge the existing deep learning accelerator designs. In this work, we propose the first holistic solution for accelerating the unsupervised GAN-based Deep Learning. We overcome the above challenges with an algorithm and architecture co-design approach. First, we optimize the training procedure to reduce on-chip memory consumption. We then propose a novel time multiplexed design to efficiently map the abundant computing phases to our microarchitecture. Moreover, we design high efficiency dataflows to achieve high data reuse and skip the zero operand multiplications in the non-traditional convolutions. Compared with traditional deep learning accelerators, our proposed design achieves the best performance (average 4.3X) with the same computing resource. Our design also has an average of 8.3X speedup over CPU and 6.2X energy-efficiency over NVIDIA GPU.
引用
收藏
页码:66 / 77
页数:12
相关论文
共 31 条
[1]  
[Anonymous], IEEE 12 INT C COMP V
[2]  
[Anonymous], 2016, EUR C COMP VIS ECCV
[3]  
[Anonymous], C NEUR INF PROC SYST
[4]  
[Anonymous], ACM IEEE 43 ANN INT
[5]  
[Anonymous], 19 INT C ARCH SUPP P
[6]  
[Anonymous], 2016, P 2016 INT C PAR ARC
[7]  
[Anonymous], 2016, INT C LEARN REPR ICL
[8]  
[Anonymous], ACM IEEE 43 ANN INT
[9]  
[Anonymous], 2017, ARXIV170107875
[10]  
[Anonymous], 2011, IEEE COMP SOC C COMP