Unsupervised learning of Dirichlet process mixture models with missing data

被引:0
作者
Xunan ZHANG [1 ]
Shiji SONG [1 ]
Lei ZHU [2 ]
Keyou YOU [1 ]
Cheng WU [1 ]
机构
[1] Department of Automation, Tsinghua University
[2] China Ocean Mineral Resources R&D Association
基金
中国国家自然科学基金;
关键词
Dirichlet processes; missing data; clustering; variational Bayesian; image analysis;
D O I
暂无
中图分类号
TP391.41 [];
学科分类号
080203 ;
摘要
This study presents a novel approach to unsupervised learning for clustering with missing data.We first extend a finite mixture model to the infinite case by considering Dirichlet process mixtures, which can automatically determine the number of mixture components or clusters. Furthermore, we view the missing features as latent variables and compute the posterior distributions using the variational Bayesian expectation maximization algorithm, which optimizes the evidence lower bound on the complete-data log marginal likelihood. We demonstrate the performance on several artificial data sets with missing values. The experimental results indicate that the proposed method outperforms some classic imputation methods. We finally present an application to seabed hydrothermal sulfide color images analysis problem.
引用
收藏
页码:161 / 174
页数:14
相关论文
共 50 条
  • [41] Human Action Recognition using Accelerated Variational Learning of Infinite Dirichlet Mixture Models
    Fan, Wentao
    Sallay, Hassen
    Bouguila, Nizar
    Du, Ji-Xiang
    2015 IEEE 14TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA), 2015, : 451 - 456
  • [42] A hierarchical Dirichlet process mixture of generalized Dirichlet distributions for feature selection
    Fan, Wentao
    Sallay, Hassen
    Bouguila, Nizar
    Bourouis, Sami
    COMPUTERS & ELECTRICAL ENGINEERING, 2015, 43 : 48 - 65
  • [43] Dirichlet Process Log Skew-Normal Mixture with a Missing-at-Random-Covariate in Insurance Claim Analysis
    Kim, Minkun
    Lindberg, David
    Crane, Martin
    Bezbradica, Marija
    ECONOMETRICS, 2023, 11 (04)
  • [44] Research on dirichlet process mixture model for clustering
    Zhang B.
    Zhang K.
    Zhong L.
    Zhang X.
    Ingenierie des Systemes d'Information, 2019, 24 (02): : 183 - 189
  • [45] Accommodating missing data in mixture models for classification by opinion-changing behavior
    Hill, JL
    JOURNAL OF EDUCATIONAL AND BEHAVIORAL STATISTICS, 2001, 26 (02) : 233 - 268
  • [46] Gaussian Scale Mixture Models for Robust Linear Multivariate Regression with Missing Data
    Ala-Luhtala, Juha
    Piche, Robert
    COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION, 2016, 45 (03) : 791 - 813
  • [47] Application of Pattern Mixture Models to Address Missing Data in Longitudinal Data Analysis Using SPSS
    Son, Heesook
    Friedmann, Erika
    Thomas, Sue A.
    NURSING RESEARCH, 2012, 61 (03) : 195 - 203
  • [48] Characterizing daily physical activity patterns with unsupervised learning via functional mixture models
    Ensari, Ipek
    Caceres, Billy A.
    Jackman, Kasey B.
    Goldsmith, Jeff
    Suero-Tejeda, Niurka M.
    Odlum, Michelle L.
    Bakken, Suzanne
    JOURNAL OF BEHAVIORAL MEDICINE, 2025, 48 (01) : 149 - 161
  • [49] Clustering and finding the number of clusters by unsupervised learning of mixture models using vector quantization
    Yoon, Sangho
    Gray, Robert M.
    2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL III, PTS 1-3, PROCEEDINGS, 2007, : 1081 - +
  • [50] Mining Numbers in Text Using Suffix Arrays and Clustering Based on Dirichlet Process Mixture Models
    Yoshida, Minoru
    Sato, Issei
    Nakagawa, Hiroshi
    Terada, Akira
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PT II, PROCEEDINGS, 2010, 6119 : 230 - +