Multi-node Expectation-Maximization algorithm for finite mixture models

被引:1
作者
Lee, Sharon X. [1 ]
McLachlan, Geoffrey J. [2 ]
Leemaqz, Kaleb L. [3 ]
机构
[1] Univ Adelaide, Sch Math Sci, Adelaide, SA, Australia
[2] Univ Queensland, Dept Math, Brisbane, Qld, Australia
[3] Univ New South Wales, UNSW Business Sch, Sydney, NSW, Australia
基金
澳大利亚研究理事会;
关键词
EM algorithm; mixture model; parallel computing; MAXIMUM-LIKELIHOOD; SKEW;
D O I
10.1002/sam.11529
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Finite mixture models are powerful tools for modeling and analyzing heterogeneous data. Parameter estimation is typically carried out using maximum likelihood estimation via the Expectation-Maximization (EM) algorithm. Recently, the adoption of flexible distributions as component densities has become increasingly popular. Often, the EM algorithm for these models involves complicated expressions that are time-consuming to evaluate numerically. In this paper, we describe a parallel implementation of the EM algorithm suitable for both single-threaded and multi-threaded processors and for both single machine and multiple-node systems. Numerical experiments are performed to demonstrate the potential performance gain in different settings. Comparison is also made across two commonly used platforms-R and MATLAB. For illustration, a fairly general mixture model is used in the comparison.
引用
收藏
页码:297 / 304
页数:8
相关论文
共 31 条
[1]   Skew-normality for climatic data and dispersal models for plant epidemiology: When application fields drive spatial statistics [J].
Allard, D. ;
Soubeyrand, S. .
SPATIAL STATISTICS, 2012, 1 :50-64
[2]  
[Anonymous], 2010, P 9 USENIX C OP SYST
[3]   On fundamental skew distributions [J].
Arellano-Valle, RB ;
Genton, MG .
JOURNAL OF MULTIVARIATE ANALYSIS, 2005, 96 (01) :93-116
[4]   Structural Equation Models and Mixture Models With Continuous Nonnormal Skewed Distributions [J].
Asparouhov, Tihomir ;
Muthen, Bengt .
STRUCTURAL EQUATION MODELING-A MULTIDISCIPLINARY JOURNAL, 2016, 23 (01) :1-19
[5]   Multivariate mixture modeling using skew-normal independent distributions [J].
Barbosa Cabral, Celso Romulo ;
Lachos, Victor Hugo ;
Prates, Marcos O. .
COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2012, 56 (01) :126-142
[6]   Growth estimates of cardinalfish (Epigonus crassicaudus) based on scale mixtures of skew-normal distributions [J].
Contreras-Reyes, Javier E. ;
Arellano-Valle, Reinaldo B. .
FISHERIES RESEARCH, 2013, 147 :137-144
[7]  
Cook R. D., 1994, INTRO REGRESSION GRA
[8]   MAXIMUM LIKELIHOOD FROM INCOMPLETE DATA VIA EM ALGORITHM [J].
DEMPSTER, AP ;
LAIRD, NM ;
RUBIN, DB .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-METHODOLOGICAL, 1977, 39 (01) :1-38
[9]   Bayesian inference for finite mixtures of univariate and multivariate skew-normal and skew-t distributions [J].
Fruehwirth-Schnatter, Sylvia ;
Pyne, Saumyadipta .
BIOSTATISTICS, 2010, 11 (02) :317-336
[10]  
Gonzalez Joseph E., 2012, Proceedings of USENIX OSDI, P17, DOI DOI 10.5555/2387880.2387883