Alternating Optimization Method Based on Nonnegative Matrix Factorizations for Deep Neural Networks

被引:0
作者
Sakurai, Tetsuya [1 ,2 ]
Imakura, Akira [1 ]
Inoue, Yuto [1 ]
Futamura, Yasunori [1 ]
机构
[1] Univ Tsukuba, Dept Comp Sci, Tsukuba, Ibaraki 3058573, Japan
[2] Japan Sci & Technol Agcy, CREST, Kawaguchi 3320012, Japan
来源
NEURAL INFORMATION PROCESSING, ICONIP 2016, PT IV | 2016年 / 9950卷
关键词
Nonnegative matrix factorizations; Alternating optimization method; Deep neural networks;
D O I
10.1007/978-3-319-46681-1_43
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The backpropagation algorithm for calculating gradients has been widely used in computation of weights for deep neural networks (DNNs). This method requires derivatives of objective functions and has some difficulties finding appropriate parameters such as learning rate. In this paper, we propose a novel approach for computing weight matrices of fully-connected DNNs by using two types of semi-nonnegative matrix factorizations (semi-NMFs). In this method, optimization processes are performed by calculating weight matrices alternately, and backpropagation (BP) is not used. We also present a method to calculate stacked autoencoder using a NMF. The output results of the autoencoder are used as pre-training data for DNNs. The experimental results show that our method using three types of NMFs attains similar error rates to the conventional DNNs with BP.
引用
收藏
页码:354 / 362
页数:9
相关论文
共 16 条
  • [1] [Anonymous], 2011, 22 INT JT C ART INT, DOI 10.5555/2283516.2283603
  • [2] Bengio Y., ADV NEURAL INFORM PR, P153
  • [3] Learning Understandable Neural Networks With Nonnegative Weight Constraints
    Chorowski, Jan
    Zurada, Jacek M.
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2015, 26 (01) : 62 - 69
  • [4] Convex and Semi-Nonnegative Matrix Factorizations
    Ding, Chris
    Li, Tao
    Jordan, Michael I.
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2010, 32 (01) : 45 - 55
  • [5] Glorot X., 2011, 14 INT C ART INT STA, P315, DOI DOI 10.1177/1753193410395357
  • [6] Glorot X, 2010, P 13 INT C ART INT S, P249
  • [7] Deep Neural Networks for Acoustic Modeling in Speech Recognition
    Hinton, Geoffrey
    Deng, Li
    Yu, Dong
    Dahl, George E.
    Mohamed, Abdel-rahman
    Jaitly, Navdeep
    Senior, Andrew
    Vanhoucke, Vincent
    Patrick Nguyen
    Sainath, Tara N.
    Kingsbury, Brian
    [J]. IEEE SIGNAL PROCESSING MAGAZINE, 2012, 29 (06) : 82 - 97
  • [8] Kingma D.P., 2015, INT C LEARN REPR ICL, DOI DOI 10.1002/9781118900772.ETRDS0277
  • [9] Krizhevsky A., 2009, LEARNING MULTIPLE LA
  • [10] Gradient-based learning applied to document recognition
    Lecun, Y
    Bottou, L
    Bengio, Y
    Haffner, P
    [J]. PROCEEDINGS OF THE IEEE, 1998, 86 (11) : 2278 - 2324