Large sample spectral analysis of graph-based multi-manifold clustering

被引:0
作者
Trillos, Nicolas Garcia [1 ]
He, Pengfei [2 ]
Li, Chenghui [1 ]
机构
[1] Univ Wisconsin, Dept Stat, Madison, WI 53706 USA
[2] Michigan State Univ, Dept Stat & Probabil, E Lansing, MI USA
关键词
multi-manifold clustering; graph Laplacian; spectral convergence; manifold learning; discrete to continuum limit; CONVERGENCE; CONSISTENCY; LAPLACIANS; DISTANCES; GEOMETRY;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this work we study statistical properties of graph-based algorithms for multi-manifold clustering (MMC). In MMC the goal is to retrieve the multi-manifold structure underlying a given Euclidean data set when this one is assumed to be obtained by sampling a distribution on a union of manifolds M = M1 boolean OR center dot center dot center dot boolean OR MN that may intersect with each other and that may have different dimensions. We investigate sufficient conditions that similarity graphs on data sets must satisfy in order for their corresponding graph Laplacians to capture the right geometric information to solve the MMC problem. Precisely, we provide high probability error bounds for the spectral approximation of a tensorized Laplacian on M with a suitable graph Laplacian built from the observations; the recovered tensorized Laplacian contains all geometric information of all the individual underlying manifolds. We provide an example of a family of similarity graphs, which we call annular proximity graphs with angle constraints, satisfying these sufficient conditions. We contrast our family of graphs with other constructions in the literature based on the alignment of tangent planes. Extensive numerical experiments expand the insights that our theory provides on the MMC problem.
引用
收藏
页数:71
相关论文
共 66 条
[1]   Stability and Minimax Optimality of Tangential Delaunay Complexes for Manifold Reconstruction [J].
Aamari, Eddie ;
Levrard, Clement .
DISCRETE & COMPUTATIONAL GEOMETRY, 2018, 59 (04) :923-971
[2]  
Ahmed M., 2015, ACM Transactions on Spatial Algorithms and Systems, V1, P1
[3]  
[Anonymous], 2014, Advances in Neural Information Processing Systems (NIPS)
[4]  
Arias-Castro E, 2017, J MACH LEARN RES, V18, P1
[5]   Spectral clustering based on local linear approximations [J].
Arias-Castro, Ery ;
Chen, Guangliang ;
Lerman, Gilad .
ELECTRONIC JOURNAL OF STATISTICS, 2011, 5 :1537-1587
[6]   Clustering Based on Pairwise Distances When the Data is of Mixed Dimensions [J].
Arias-Castro, Ery .
IEEE TRANSACTIONS ON INFORMATION THEORY, 2011, 57 (03) :1692-1706
[7]  
Aubin T., 1998, Some Nonlinear Problems in Riemannian Geometry
[8]  
Babaeian A, 2018, Arxiv, DOI arXiv:1812.02327
[9]   Nonlinear subspace clustering using curvature constrained distances [J].
Babaeian, Amir ;
Babaee, Mohammadreaza ;
Bayestehtashk, Alireza ;
Bandarabadi, Mojtaba .
PATTERN RECOGNITION LETTERS, 2015, 68 :118-125
[10]   Towards a theoretical foundation for Laplacian-based manifold methods [J].
Belkin, M ;
Niyogi, P .
LEARNING THEORY, PROCEEDINGS, 2005, 3559 :486-500