Generalization bounds for sparse random feature expansions ?

被引:28
作者
Hashemi, Abolfazl [1 ]
Schaeffer, Hayden [2 ]
Shi, Robert [3 ]
Topcu, Ufuk [3 ]
Tran, Giang [4 ]
Ward, Rachel [3 ]
机构
[1] Purdue Univ, W Lafayette, IN USA
[2] Carnegie Mellon Univ, Pittsburgh, PA 15213 USA
[3] Univ Texas Austin, Austin, TX USA
[4] Univ Waterloo, Waterloo, ON, Canada
基金
加拿大自然科学与工程研究理事会; 美国国家科学基金会;
关键词
Random features; Sparse optimization; Generalization error; Compressive sensing; SIGNAL RECOVERY; APPROXIMATION; PROJECTION; SUBSPACE;
D O I
10.1016/j.acha.2022.08.003
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
Random feature methods have been successful in various machine learning tasks, are easy to compute, and come with theoretical accuracy bounds. They serve as an alternative approach to standard neural networks since they can represent similar function spaces without a costly training phase. However, for accuracy, random feature methods require more measurements than trainable parameters, limiting their use for data-scarce applications. We introduce the sparse random feature expansion to obtain parsimonious random feature models. We leverage ideas from compressive sensing to generate random feature expansions with theoretical guarantees even in the data-scarce setting. We provide generalization bounds for functions in a certain class depending on the number of samples and the distribution of features. By introducing sparse features, i.e. features with random sparse weights, we provide improved bounds for low order functions. We show that our method outperforms shallow networks in several scientific machine learning tasks. (c) 2022 The Author(s). Published by Elsevier Inc. This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/).
引用
收藏
页码:310 / 330
页数:21
相关论文
共 60 条
[41]  
Potts D, 2021, Arxiv, DOI arXiv:1907.11412
[42]  
Rahimi A., 2007, NIPS, V20, P1177
[43]  
Rahimi A., 2009, Advances in Neural Information Processing Systems, P1313
[44]   Uniform Approximation of Functions with Random Bases [J].
Rahimi, Ali ;
Recht, Benjamin .
2008 46TH ANNUAL ALLERTON CONFERENCE ON COMMUNICATION, CONTROL, AND COMPUTING, VOLS 1-3, 2008, :555-+
[45]   Learning with infinitely many features [J].
Rakotomamonjy, A. ;
Flamary, R. ;
Yger, F. .
MACHINE LEARNING, 2013, 91 (01) :43-66
[46]   Interpolation via weighted l1 minimization [J].
Rauhut, Holger ;
Ward, Rachel .
APPLIED AND COMPUTATIONAL HARMONIC ANALYSIS, 2016, 40 (02) :321-351
[47]   Sparse Legendre expansions via l1-minimization [J].
Rauhut, Holger ;
Ward, Rachel .
JOURNAL OF APPROXIMATION THEORY, 2012, 164 (05) :517-533
[48]   Sparse additive models [J].
Ravikumar, Pradeep ;
Lafferty, John ;
Liu, Han ;
Wasserman, Larry .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2009, 71 :1009-1030
[49]   l1 regularization in infinite dimensional feature spaces [J].
Rosset, Saharon ;
Swirszcz, Grzegorz ;
Srebro, Nathan ;
Zhu, Ji .
LEARNING THEORY, PROCEEDINGS, 2007, 4539 :544-+
[50]  
Rudi A, 2017, ADV NEUR IN, V30