Smoothed Analysis of Tensor Decompositions

被引:63
作者
Bhaskara, Aditya [1 ,5 ]
Charikar, Moses [2 ]
Moitra, Ankur [3 ,6 ]
Vijayaraghavan, Aravindan [4 ]
机构
[1] Google Res, Mountain View, CA 94043 USA
[2] Princeton Univ, Princeton, NJ 08544 USA
[3] MIT, Cambridge, MA 02139 USA
[4] CMU, Pittsburgh, PA USA
[5] Ecole Polytech Fed Lausanne, Lausanne, Switzerland
[6] Inst Adv Study, Princeton, NJ USA
来源
STOC'14: PROCEEDINGS OF THE 46TH ANNUAL 2014 ACM SYMPOSIUM ON THEORY OF COMPUTING | 2014年
关键词
RANK;
D O I
10.1145/2591796.2591881
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Low rank decomposition of tensors is a powerful tool for learning generative models. The uniqueness results that hold for tensors give them a significant advantage over matrices. However, tensors pose serious algorithmic challenges; in particular, much of the matrix algebra toolkit fails to generalize to tensors. Efficient decomposition in the overcomplete case (where rank exceeds dimension) is particularly challenging. We introduce a smoothed analysis model for studying these questions and develop an efficient algorithm for tensor decomposition in the highly overcomplete case (rank polynomial in the dimension). In this setting, we show that our algorithm is robust to inverse polynomial error a crucial property for applications in learning since we are only allowed a polynomial number of samples. While algorithms are known for exact tensor decomposition in some overcomplete settings, our main contribution is in analyzing their stability in the framework of smoothed analysis. Our main technical contribution is to show that tensor products of perturbed vectors are linearly independent in a robust sense (i.e. the associated matrix has singular values that are at least an inverse polynomial). This key result paves the way for applying tensor methods to learning problems in the smoothed setting. In particular, we use it to obtain results for learning multi-view models and mixtures of axis-aligned Gaussians where there are many more "components" than dimensions. The assumption here is that the model is not adversarially chosen, which we formalize by thinking of the model parameters as being perturbed. We believe this an appealing way to analyze realistic instances of learning problems, since this framework allows us to overcome many of the usual limitations of using tensor methods.
引用
收藏
页码:594 / 603
页数:10
相关论文
共 39 条
[1]   IDENTIFIABILITY OF PARAMETERS IN LATENT STRUCTURE MODELS WITH MANY OBSERVED VARIABLES [J].
Allman, Elizabeth S. ;
Matias, Catherine ;
Rhode, John A. .
ANNALS OF STATISTICS, 2009, 37 (6A) :3099-3132
[2]  
Anandkumar A., 2012, ADV NEURAL INFORM PR, P926
[3]  
Anandkumar A., 2012, ARXIV12107559
[4]  
Anandkumar A., 2013, COLT
[5]  
Anandkumar A., COLT
[6]  
[Anonymous], 1985, Matrix Analysis
[7]  
Arora Sanjeev, 2012, NIPS, P2384
[8]  
Belkin M., 2013, COLT
[9]  
BELKIN M, 2010, FOCS, P103, DOI DOI 10.1109/FOCS.2010.16
[10]  
BHASKARA A., 2013, ARXIV13048087