Unsupervised learning of Dirichlet process mixture models with missing data

被引:0
作者
Xunan ZHANG [1 ]
Shiji SONG [1 ]
Lei ZHU [2 ]
Keyou YOU [1 ]
Cheng WU [1 ]
机构
[1] Department of Automation, Tsinghua University
[2] China Ocean Mineral Resources R&D Association
基金
中国国家自然科学基金;
关键词
Dirichlet processes; missing data; clustering; variational Bayesian; image analysis;
D O I
暂无
中图分类号
TP391.41 [];
学科分类号
080203 ;
摘要
This study presents a novel approach to unsupervised learning for clustering with missing data.We first extend a finite mixture model to the infinite case by considering Dirichlet process mixtures, which can automatically determine the number of mixture components or clusters. Furthermore, we view the missing features as latent variables and compute the posterior distributions using the variational Bayesian expectation maximization algorithm, which optimizes the evidence lower bound on the complete-data log marginal likelihood. We demonstrate the performance on several artificial data sets with missing values. The experimental results indicate that the proposed method outperforms some classic imputation methods. We finally present an application to seabed hydrothermal sulfide color images analysis problem.
引用
收藏
页码:161 / 174
页数:14
相关论文
共 50 条
  • [31] Unsupervised learning of correlated multivariate Gaussian mixture models using MML
    Agusta, Y
    Dowe, DL
    AI 2003: ADVANCES IN ARTIFICIAL INTELLIGENCE, 2003, 2903 : 477 - 489
  • [32] Online damage detection of cutting tools using Dirichlet process mixture models?
    Wickramarachchi, Chandula T.
    Rogers, Timothy J.
    McLeay, Thomas E.
    Leahy, Wayne
    Cross, Elizabeth J.
    MECHANICAL SYSTEMS AND SIGNAL PROCESSING, 2022, 180
  • [33] Dirichlet Process Mixture Models made Scalable and Effective by means of Massive Distribution
    Meguelati, Khadidja
    Fontez, Benedicte
    Hilgert, Nadine
    Masseglia, Florent
    SAC '19: PROCEEDINGS OF THE 34TH ACM/SIGAPP SYMPOSIUM ON APPLIED COMPUTING, 2019, : 502 - 509
  • [34] Unsupervised learning of mixture models based on swarm intelligence and neural networks with optimal completion using incomplete data
    Abas, Ahmed R.
    EGYPTIAN INFORMATICS JOURNAL, 2012, 13 (02) : 103 - 109
  • [35] A hybrid SEM algorithm for high-dimensional unsupervised learning using a finite generalized dirichlet mixture
    Bouguila, Nizar
    Ziou, Djemel
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2006, 15 (09) : 2657 - 2668
  • [36] Online learning for the Dirichlet process mixture model via weakly conjugate approximation
    Jeong, Kuhwan
    Chae, Minwoo
    Kim, Yongdai
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2023, 179
  • [37] Missing Data Reconstruction Using Gaussian Mixture Models for Fingerprint Images
    Agaian, Sos S.
    Yeole, Rushikesh D.
    Rao, Shishir P.
    Mulawka, Marzena
    Troy, Mike
    Reinecke, Gary
    MOBILE MULTIMEDIA/IMAGE PROCESSING, SECURITY, AND APPLICATIONS 2016, 2016, 9869
  • [38] Pattern mixture models for clinical validation of biomarkers in the presence of missing data
    Gao, Fei
    Dong, Jun
    Zeng, Donglin
    Rong, Alan
    Ibrahim, Joseph G.
    STATISTICS IN MEDICINE, 2017, 36 (19) : 2994 - 3004
  • [39] Scalable Clustering: Large Scale Unsupervised Learning of Gaussian Mixture Models with Outliers
    Zhou, Yijia
    Gallivan, Kyle A.
    Barbu, Adrian
    JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 2024,
  • [40] Dirichlet process mixture models for single-cell RNA-seq clustering
    Adossa, Nigatu A.
    Rytkonen, Kalle T.
    Elo, Laura L.
    BIOLOGY OPEN, 2022, 11 (04):