Deep Neural Networks with Random Gaussian Weights: A Universal Classification Strategy?

被引:90
作者
Giryes, Raja [1 ]
Sapiro, Guillermo [2 ]
Bronstein, Alex M. [1 ]
机构
[1] Tel Aviv Univ, Fac Engn, Sch Elect Engn, IL-69978 Ramat Aviv, Israel
[2] Duke Univ, Dept Elect & Comp Engn, Durham, NC 27708 USA
基金
美国国家科学基金会; 欧洲研究理事会;
关键词
Artificial neural networks; computation theory; deep learning; learning systems; SIGNAL RECOVERY; SPARSE; PROJECTIONS;
D O I
10.1109/TSP.2016.2546221
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Three important properties of a classification machinery are i) the system preserves the core information of the input data; ii) the training examples convey information about unseen data; and iii) the system is able to treat differently points from different classes. In this paper, we show that these fundamental properties are satisfied by the architecture of deep neural networks. We formally prove that these networks with random Gaussian weights perform a distance-preserving embedding of the data, with a special treatment for in-class and out-of-class data. Similar points at the input of the network are likely to have a similar output. The theoretical analysis of deep networks here presented exploits tools used in the compressed sensing and dictionary learning literature, thereby making a formal connection between these important topics. The derived results allow drawing conclusions on the metric learning properties of the network and their relation to its structure, as well as providing bounds on the required size of the training set such that the training examples would represent faithfully the unseen data. The results are validated with state-of-the-art trained networks.
引用
收藏
页码:3444 / 3457
页数:14
相关论文
共 57 条
[1]   One-bit compressed sensing with non-Gaussian measurements [J].
Ai, Albert ;
Lapanowski, Alex ;
Plan, Yaniv ;
Vershynin, Roman .
LINEAR ALGEBRA AND ITS APPLICATIONS, 2014, 441 :222-239
[2]   Living on the edge: phase transitions in convex programs with random data [J].
Amelunxen, Dennis ;
Lotz, Martin ;
McCoy, Michael B. ;
Tropp, Joel A. .
INFORMATION AND INFERENCE-A JOURNAL OF THE IMA, 2014, 3 (03) :224-294
[3]  
[Anonymous], 1991, Probability in Banach Spaces
[4]  
[Anonymous], SCALE KERNEL METHODS
[5]  
[Anonymous], P ICLR WORKSH JAN
[6]  
[Anonymous], 2011, P 28 INT C MACH LEAR
[7]  
[Anonymous], 1984, C MODERN ANAL PROBAB
[8]  
[Anonymous], THEORET COM IN PRESS
[9]  
[Anonymous], P S FDN COMP SCI
[10]  
[Anonymous], 2015, 3 INT C LEARN REPR I