Tensor-Factorized Neural Networks

被引:65
作者
Chien, Jen-Tzung [1 ]
Bao, Yi-Ting [1 ]
机构
[1] Natl Chiao Tung Univ, Dept Elect & Comp Engn, Hsinchu 30010, Taiwan
关键词
Neural network (NN); pattern classification; tensor factorization (TF); and tensor-factorized error backpropagation; DECOMPOSITIONS;
D O I
10.1109/TNNLS.2017.2690379
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The growing interests in multiway data analysis and deep learning have drawn tensor factorization (TF) and neural network (NN) as the crucial topics. Conventionally, the NN model is estimated from a set of one-way observations. Such a vectorized NN is not generalized for learning the representation from multiway observations. The classification performance using vectorized NN is constrained, because the temporal or spatial information in neighboring ways is disregarded. More parameters are required to learn the complicated data structure. This paper presents a new tensor-factorized NN (TFNN), which tightly integrates TF and NN for multiway feature extraction and classification under a unified discriminative objective. This TFNN is seen as a generalized NN, where the affine transformation in an NN is replaced by the multilinear and multiway factorization for tensor-based NN. The multiway information is preserved through layerwise factorization. Tucker decomposition and nonlinear activation are performed in each hidden layer. The tensor-factorized error backpropagation is developed to train TFNN with the limited parameter size and computation time. This TFNN can be further extended to realize the convolutional TFNN (CTFNN) by looking at small subtensors through the factorized convolution. Experiments on real-world classification tasks demonstrate that TFNN and CTFNN attain substantial improvement when compared with an NN and a convolutional NN, respectively.
引用
收藏
页码:1998 / 2011
页数:14
相关论文
共 39 条
[1]   Convolutional Neural Networks for Speech Recognition [J].
Abdel-Hamid, Ossama ;
Mohamed, Abdel-Rahman ;
Jiang, Hui ;
Deng, Li ;
Penn, Gerald ;
Yu, Dong .
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2014, 22 (10) :1533-1545
[2]  
[Anonymous], 2013, IEEE T PATTERN ANAL, DOI DOI 10.1109/TPAMI.2012.59
[3]  
[Anonymous], 2013, NIPS
[4]  
[Anonymous], 2015, PROC IEEE INT WORKSH
[5]  
[Anonymous], 2011, NEURAL INFORM PROCES
[6]  
[Anonymous], 2009, NONNEGATIVE MATRIX T
[7]  
[Anonymous], 2011, P 28 INT C MACH LEAR
[8]  
[Anonymous], 2012, ABS12070580 CORR
[9]  
[Anonymous], 2013, P 26 INT C NEUR INF
[10]  
Barker T, 2013, INTERSPEECH, P827