Estimation of Gaussian mixture models via tensor moments with application to online learning

被引:1
作者
Rahmani, Donya [1 ]
Niranjan, Mahesan [2 ]
Fay, Damien [3 ]
Takeda, Akiko [4 ]
Brodzki, Jacek [1 ]
机构
[1] Univ Southampton, Sch Math, Southampton, Hants, England
[2] Univ Southampton, Sch Elect & Comp Sci, Southampton, Hants, England
[3] Logicblox Infor, Dept Analyt, Atlanta, GA USA
[4] Univ Tokyo, Dept Creat Informat, Tokyo, Japan
基金
英国工程与自然科学研究理事会;
关键词
Method of moments; Alternating gradient descent; Online learning; Tensor analysis; NUMERICAL OPTIMIZATION; EM; DECOMPOSITIONS; FACTORIZATION;
D O I
10.1016/j.patrec.2020.01.001
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we present an alternating gradient descent algorithm for estimating parameters of a spherical Gaussian mixture model by the method of moments (AGD-MoM). We formulate the problem as a constrained optimisation problem which simultaneously matches the third order moments from the data, represented as a tensor, and the second order moment, which is the empirical covariance matrix. We derive the necessary gradients (and second derivatives), and use them to implement alternating gradient search to estimate the parameters of the model. We show that the proposed method is applicable in both a batch as well as in a streaming (online) setting. Using synthetic and benchmark datasets, we demonstrate empirically that the proposed algorithm outperforms the more classical algorithms like Expectation Maximisation and variational Bayes. (c) 2020 Elsevier B.V. All rights reserved.
引用
收藏
页码:285 / 292
页数:8
相关论文
共 34 条
  • [1] Multivariate Regression with Incremental Learning of Gaussian Mixture Models
    Acevedo-Valle, Juan M.
    Trejo, Karla
    Angulo, Cecilio
    [J]. RECENT ADVANCES IN ARTIFICIAL INTELLIGENCE RESEARCH AND DEVELOPMENT, 2017, 300 : 196 - 205
  • [2] Anandkumar A, 2014, J MACH LEARN RES, V15, P2773
  • [3] [Anonymous], 2004, Monte Carlo methods
  • [4] [Anonymous], 2002, SIGNALSANDSYSTEMS
  • [5] [Anonymous], [No title captured]
  • [6] Chaudhuri CR08b] Kamalika, 2008, COLT, P9
  • [7] Chen Z, 2007, CORRELATIVE LEARNING: A BASIS FOR BRAIN AND ADAPTIVE SYSTEMS, P1
  • [8] Dasgupta S, 2007, J MACH LEARN RES, V8, P203
  • [9] On the best rank-1 and rank-(R1,R2,...,RN) approximation of higher-order tensors
    De Lathauwer, L
    De Moor, B
    Vandewalle, J
    [J]. SIAM JOURNAL ON MATRIX ANALYSIS AND APPLICATIONS, 2000, 21 (04) : 1324 - 1342
  • [10] Declercq A, 2008, VISAPP 2008: PROCEEDINGS OF THE THIRD INTERNATIONAL CONFERENCE ON COMPUTER VISION THEORY AND APPLICATIONS, VOL 1, P605