Optimal fast Johnson-Lindenstrauss embeddings for large data sets

被引:1
作者
Bamberger, Stefan [1 ]
Krahmer, Felix [1 ]
机构
[1] Techn Univ Munich, Dept Math, Boltzmannstr 3, D-85748 Garching, Germany
来源
SAMPLING THEORY SIGNAL PROCESSING AND DATA ANALYSIS | 2021年 / 19卷 / 01期
关键词
Johnson-Lindenstrauss embeddings; Hadamard transforms; Restricted isometry property; Fast matrix multiplication; RECTANGULAR MATRIX MULTIPLICATION; REDUCTION; PROOF;
D O I
10.1007/s43670-021-00003-5
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
Johnson-Lindenstrauss embeddings are widely used to reduce the dimension and thus the processing time of data. To reduce the total complexity, also fast algorithms for applying these embeddings are necessary. To date, such fast algorithms are only available either for a non-optimal embedding dimension or up to a certain threshold on the number of data points. We address a variant of this problem where one aims to simultaneously embed larger subsets of the data set. Our method follows an approach by Nelson et al. (New constructions of RIP matrices with fast multiplication and fewer rows. In: Proceedings of the Twenty-Fifth Annual ACM-SIAM Symposium on Discrete Algorithms, pp. 1515-1528, 2014): a subsampled Hadamard transform maps points into a space of lower, but not optimal dimension. Subsequently, a random matrix with independent entries projects to an optimal embedding dimension. For subsets whose size scales at least polynomially in the ambient dimension, the complexity of this method comes close to the number of operations just to read the data under mild assumptions on the size of the data set that are considerably less restrictive than in previous works. We also prove a lower bound showing that subsampled Hadamard matrices alone cannot reach an optimal embedding dimension. Hence, the second embedding cannot be omitted.
引用
收藏
页数:23
相关论文
共 30 条
[1]   Database-friendly random projections: Johnson-Lindenstrauss with binary coins [J].
Achlioptas, D .
JOURNAL OF COMPUTER AND SYSTEM SCIENCES, 2003, 66 (04) :671-687
[2]  
Ailon N., 2006, STOC'06. Proceedings of the 38th Annual ACM Symposium on Theory of Computing, P557, DOI 10.1145/1132516.1132597
[3]   An Almost Optimal Unrestricted Fast Johnson-Lindenstrauss Transform [J].
Ailon, Nir ;
Liberty, Edo .
ACM TRANSACTIONS ON ALGORITHMS, 2013, 9 (03)
[4]   Fast Dimension Reduction Using Rademacher Series on Dual BCH Codes [J].
Ailon, Nir ;
Liberty, Edo .
DISCRETE & COMPUTATIONAL GEOMETRY, 2009, 42 (04) :615-630
[5]   A Simple Proof of the Restricted Isometry Property for Random Matrices [J].
Baraniuk, Richard ;
Davenport, Mark ;
DeVore, Ronald ;
Wakin, Michael .
CONSTRUCTIVE APPROXIMATION, 2008, 28 (03) :253-263
[6]  
Belkin M, 2002, ADV NEUR IN, V14, P585
[7]   An Improved Lower Bound for Sparse Reconstruction from Subsampled Hadamard Matrices [J].
Blasiok, Jaroslaw ;
Lopatto, Patrick ;
Luh, Kyle ;
Marcinek, Jake ;
Rao, Shravas .
2019 IEEE 60TH ANNUAL SYMPOSIUM ON FOUNDATIONS OF COMPUTER SCIENCE (FOCS 2019), 2019, :1564-1567
[8]  
Brgisser P., 1997, Algebraic complexity theory, V315, DOI [10.1007/978-3-662-03338-8, DOI 10.1007/978-3-662-03338-8]
[9]   An elementary proof of a theorem of Johnson and Lindenstrauss [J].
Dasgupta, S ;
Gupta, A .
RANDOM STRUCTURES & ALGORITHMS, 2003, 22 (01) :60-65
[10]  
Foucart S., 2003, A mathematical introduction to compressive sensing