Multi-scale geometric methods for data sets II: Geometric Multi-Resolution Analysis

被引:79
作者
Allard, William K. [1 ]
Chen, Guangliang [1 ]
Maggioni, Mauro [1 ,2 ]
机构
[1] Duke Univ, Dept Math, Durham, NC 27708 USA
[2] Duke Univ, Dept Comp Sci, Durham, NC 27708 USA
基金
美国国家科学基金会;
关键词
Multi-scale analysis; Geometric wavelets; High-dimensional data sets; Frames; Sparse approximation; Dictionary learning; STRUCTURE DEFINITION; INTRINSIC DIMENSION; HARMONIC-ANALYSIS; DECOMPOSITION; DIFFUSIONS; LAPLACIAN; TOOL;
D O I
10.1016/j.acha.2011.08.001
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
Data sets are often modeled as samples from a probability distribution in R-D, for D large. It is often assumed that the data has some interesting low-dimensional structure, for example that of a d-dimensional manifold M, with d much smaller than D. When M is simply a linear subspace, one may exploit this assumption for encoding efficiently the data by projecting onto a dictionary of d vectors in R-D (for example found by SVD), at a cost (n + D)d for n data points. When M is nonlinear, there are no "explicit" and algorithmically efficient constructions of dictionaries that achieve a similar efficiency: typically one uses either random dictionaries, or dictionaries obtained by black-box global optimization. In this paper we construct data-dependent multi-scale dictionaries that aim at efficiently encoding and manipulating the data. Their construction is fast, and so are the algorithms that map data points to dictionary coefficients and vice versa, in contrast with L-1-type sparsity-seeking algorithms, but like adaptive nonlinear approximation in classical multi-scale analysis. In addition, data points are guaranteed to have a compressible representation in terms of the dictionary, depending on the assumptions on the geometry of the underlying probability distribution. (C) 2011 Elsevier Inc. All rights reserved.
引用
收藏
页码:435 / 462
页数:28
相关论文
共 69 条
[1]  
Aharon M., 2005, PROC SPARS, V5, P9, DOI DOI 10.1109/TSP.2006.881199
[2]  
[Anonymous], 2016, Appl. Numer. Harmon. Anal
[3]  
[Anonymous], 200227 STANF U DEP S
[4]  
[Anonymous], ADV NEURAL INF PROCE
[5]  
Baraniuk R., RANDOM PROJECTIONS S
[6]   Probabilistic approximation of metric spaces and its algorithmic applications [J].
Bartal, Y .
37TH ANNUAL SYMPOSIUM ON FOUNDATIONS OF COMPUTER SCIENCE, PROCEEDINGS, 1996, :184-193
[7]  
Beygelzimer A, 2006, P 23 INT C MACH LEAR, P97, DOI DOI 10.1145/1143844.1143857
[8]  
Binev P, 2005, J MACH LEARN RES, V6, P1297
[9]   Fast computation in adaptive tree approximation [J].
Binev, P ;
DeVore, R .
NUMERISCHE MATHEMATIK, 2004, 97 (02) :193-217
[10]   Fast high-dimensional approximation with sparse occupancy trees [J].
Binev, Peter ;
Dahmen, Wolfgang ;
Lamby, Philipp .
JOURNAL OF COMPUTATIONAL AND APPLIED MATHEMATICS, 2011, 235 (08) :2063-2076