A new method of moments for latent variable models

被引:1
作者
Ruffini, Matted [1 ,2 ]
Casanellas, Marta [1 ,2 ]
Gayada, Ricard [1 ,2 ]
机构
[1] Univ Politecn Cataluna, Barcelona, Spain
[2] BGSMath, Barcelona, Spain
关键词
Spectral methods; Method of moments; Latent variable models; Topic modeling; TENSOR DECOMPOSITIONS; ALGORITHMS; RANK;
D O I
10.1007/s10994-018-5706-4
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present an algorithm for the unsupervised learning of latent variable models based on the method of moments. We give efficient estimates of the moments for two models that are well known, e.g., in text mining, the single-topic model and latent Dirichlet allocation, and we provide a tensor decomposition algorithm for the moments that proves to be robust both in theory and in practice. Experiments on synthetic data show that the proposed estimators outperform the existing ones in terms of reconstruction accuracy, and that the proposed tensor decomposition technique achieves the learning accuracy of the state-of-the-art method with significantly smaller running times. We also provide examples of applications to real-world text corpora for both single-topic model and LDA, obtaining meaningful results.
引用
收藏
页码:1431 / 1455
页数:25
相关论文
共 46 条
  • [1] Alighieri Dante., 1979, LA DIVINA COMMEDIA
  • [2] Anandkumar A., 2012, ADV NEURAL INFORM PR, V25
  • [3] Anandkumar A, 2014, J MACH LEARN RES, V15, P2773
  • [4] Anandkumar Animashree, 2012, C LEARN THEOR COLT
  • [5] [Anonymous], 1990, Matrix Perturbation Theory, Computer Science and Scientific Computing
  • [6] [Anonymous], 1970, UCLA Working Papers in Phonetics, DOI DOI 10.1134/S0036023613040165
  • [7] [Anonymous], 2013, ADV NEURAL INF PROCE
  • [8] [Anonymous], 1928, J MATH PHYS, DOI DOI 10.1002/SAPM19287139
  • [9] [Anonymous], 2017, Machine Learning for Healthcare Conference
  • [10] [Anonymous], 2013, P 4 C INNOVATIONS TH