A gradient approximation algorithm based weight momentum for restricted Boltzmann machine

被引:10
作者
Shen, Huihui [1 ,2 ,3 ,4 ]
Li, Hongwei [1 ,3 ]
机构
[1] China Univ Geosci, Sch Math & Phys, Wuhan 430074, Hubei, Peoples R China
[2] Hubei Univ Econ, Sch Stat & Informat Management, Wuhan 430205, Hubei, Peoples R China
[3] China Univ Geosci, Hubei Subsurface Multiscale Imaging Key Lab, Wuhan 430074, Hubei, Peoples R China
[4] Hubei Financial Dev & Financial Secur Res Ctr, Wuhan 430205, Hubei, Peoples R China
关键词
Deep learning; Restricted Boltzmann machine; Gradient method; Average contrastive divergence; Weight-decay momentum; PRODUCTS; EXPERTS; DEEP;
D O I
10.1016/j.neucom.2019.07.074
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Restricted Boltzmann Machine (RBM) is a powerful generative model in deep learning that can automatically conduct learning of data probability distributions without supervision. Deep architectures can effectively enhance the capability of image feature expression in image recognition. However, their learning time is long or they present poor performance given the same running time in a deep model. To address these problems, we propose a new gradient approximation algorithm called average contrastive divergence (ACD) with weight-decay momentum for training the RBM. It is an improved contrastive divergence (CD) algorithm combined with weight-decay momentum. Different combinations of the weight momentum term are added in the pre-training and fine-tuning phases of the RBM to accelerate the network convergence and improve the classification effect. Finally, the proposed algorithm is evaluated on the MNIST database, Extended Yale B and CMU-PIE face databases. The experimental results show that the proposed learning algorithm is a better approximation of the log-likelihood gradient method and outperforms the other algorithms. (C) 2019 Elsevier B.V. All rights reserved.
引用
收藏
页码:40 / 49
页数:10
相关论文
共 42 条
[1]  
[Anonymous], 2010, JMLR WORKSHOP C P AI
[2]  
[Anonymous], 2013, INT C MACHINE LEARNI
[3]  
[Anonymous], P WORLD C COMP INT W
[4]  
[Anonymous], 2010, P 13 INT C ARTIFICIA
[5]  
[Anonymous], 2019, IEEE T AUTOMATIC CON
[6]  
[Anonymous], 2010000 U TOR MACH L
[7]  
[Anonymous], 1964, COMP MATH MATH PHYS+
[8]   Justifying and Generalizing Contrastive Divergence [J].
Bengio, Yoshua ;
Delalleau, Olivier .
NEURAL COMPUTATION, 2009, 21 (06) :1601-1621
[9]   Learning Rotation-Invariant Convolutional Neural Networks for Object Detection in VHR Optical Remote Sensing Images [J].
Cheng, Gong ;
Zhou, Peicheng ;
Han, Junwei .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2016, 54 (12) :7405-7415
[10]  
Cho K., 2011, Proceedings of the 28th International Conference on Machine Learning, ICML, P105