Maximum-Likelihood Estimation of Neural Mixture Densities: Model, Algorithm, and Preliminary Experimental Evaluation

被引:3
作者
Trentin, Edmondo [1 ]
机构
[1] Univ Siena, Dipartimento Ingn Informaz & Sci Matemat, Siena, Italy
来源
ARTIFICIAL NEURAL NETWORKS IN PATTERN RECOGNITION, ANNPR 2018 | 2018年 / 11081卷
关键词
Density estimation; Mixture density; Unsupervised learning; Constrained learning; Mixture of experts; NETWORKS; CLASSIFICATION; COMPRESSION;
D O I
10.1007/978-3-319-99978-4_14
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Unsupervised estimation of probability density functions by means of parametric mixture densities (e.g., Gaussian mixture models) may improve significantly over plain, single-density estimators in terms of modeling capabilities. Moreover, mixture densities (and even mixtures of mixture densities) may be exploited for the statistical description of phenomena whose data distributions implicitly depend on the distinct outcomes of a number of non-observable, latent states of nature. In spite of some recent advances in density estimation via neural networks, no proper mixtures of neural component densities have been investigated so far. The paper proposes a first algorithm for estimating Neural Mixture Densities based on the usual maximum-likelihood criterion, satisfying numerically a combination of hard and soft constraints aimed at ensuring a proper probabilistic interpretation of the resulting model. Preliminary results are presented and their statistical significance is assessed, corroborating the soundness of the approach with respect to established statistical techniques.
引用
收藏
页码:178 / 189
页数:12
相关论文
共 29 条
[1]   An introduction to MCMC for machine learning [J].
Andrieu, C ;
de Freitas, N ;
Doucet, A ;
Jordan, MI .
MACHINE LEARNING, 2003, 50 (1-2) :5-43
[2]  
[Anonymous], 1999, Monte Carlo Methods in Statistical Physics
[3]  
[Anonymous], 1973, Pattern Classification and Scene Analysis
[4]   Techniques for dealing with incomplete data: a tutorial and survey [J].
Aste, Marco ;
Boninsegna, Massimo ;
Freno, Antonino ;
Trentin, Edmondo .
PATTERN ANALYSIS AND APPLICATIONS, 2015, 18 (01) :1-29
[5]   Wireless Network Compression Via Memory-Enabled Overhearing Helpers [J].
Beirami, Ahmad ;
Sardari, Mohsen ;
Fekri, Faramarz .
IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2016, 15 (01) :176-190
[6]  
Bishop C. M, 2006, INFORM SCI STAT, DOI [10.1007/978-1-4615-7566-5, DOI 10.1007/978-1-4615-7566-5]
[7]  
Bishop C.M., 1995, Neural networks for pattern recognition
[8]   Recursive Neural Networks for Density Estimation Over Generalized Random Graphs [J].
Bongini, Marco ;
Rigutini, Leonardo ;
Trentin, Edmondo .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2018, 29 (11) :5441-5458
[9]  
Borenstein M, 2009, INTRO METAANALYSIS
[10]  
Cuthbertson K., 2004, QUANTITATIVE FINANCI