Large-scale Restricted Boltzmann Machines on Single GPU

被引：0

作者：

Zhu, Yun ^{[1
]}

Zhang, Yanqing ^{[1
]}

Pan, Yi ^{[1
]}

机构：

[1] Georgia State Univ, Dept Comp Sci, Atlanta, GA 30303 USA

来源：

2013 IEEE INTERNATIONAL CONFERENCE ON BIG DATA | 2013年

关键词：

GPU; RBM; deep learning; parallel; high performance computing;

D O I：

暂无

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Recent works on deep belief network (DBNs) have shown that applying large-scale unsupervised feature learning model can dramatically improve the performance of the applications in many fields. Training billions of parameters in these models such as restricted boltzmann machines (RBMs) appears to be computational challenging for modern CPUs. Graphical Processing Units (GPUs) has been employed in many large-scale deep learning models for performance enhancement due to its massively parallel computing capability. Unfortunately, the limited device memory of GPUs imposes a restriction on the size of the model trained on a single GPU. Multi-GPUs approaches, on the other hand, suffer from inefficient communication and economic cost. In this paper, we proposed a novel memory efficient algorithm on single GPU that can train large-scale RBMs without size restriction and preserve the performance gain of GPU parallel computation. Particularly, the experiments demonstrated that our approach used 75% less memory storage at the cost of only 10% performance loss in training large-scale RBMs with billions of parameters.

引用

页数：6

共 23 条

[1] [Anonymous], WHAT IS UDA
[2] [Anonymous], INT ONF MACH LEARN
[3] [Anonymous], UDA STREAMS ONCURREN
[4] Deep, Big, Simple Neural Nets for Handwritten Digit Recognition
Ciresan, Dan Claudiu
Meier, Ueli
Gambardella, Luca Maria
Schmidhuber, Juergen
[J]. NEURAL COMPUTATION, 2010, 22 (12) : 3207 - 3220
[5] [Anonymous], NEUR NETW 1 NN 2 12
[6] [Anonymous], 2007, IEEE INT C ICML
[7] [Anonymous], NEURAL COMPUTATION
[8] [Anonymous], NIPS
[9] [Anonymous], 1 T AUD SPEECH LANG
[10] [Anonymous], UDA PROGR UID

← 1 2 3 →