ParticleMDI: particle Monte Carlo methods for the cluster analysis of multiple datasets with applications to cancer subtype identification

被引:0
|
作者
Nathan Cunningham
Jim E. Griffin
David L. Wild
机构
[1] University of Warwick Coventry,Department of Statistics
[2] University College London,undefined
来源
Advances in Data Analysis and Classification | 2020年 / 14卷
关键词
Cluster analysis; Mixture models; Bayesian inference; Particle Monte Carlo; 62H30; 62P10; 65C05;
D O I
暂无
中图分类号
学科分类号
摘要
We present a novel nonparametric Bayesian approach for performing cluster analysis in a context where observational units have data arising from multiple sources. Our approach uses a particle Gibbs sampler for inference in which cluster allocations are jointly updated using a conditional particle filter within a Gibbs sampler, improving the mixing of the MCMC chain. We develop several approaches to improving the computational performance of our algorithm. These methods can achieve greater than an order-of-magnitude improvement in performance at no cost to accuracy and can be applied more broadly to Bayesian inference for mixture models with a single dataset. We apply our algorithm to the discovery of risk cohorts amongst 243 patients presenting with kidney renal clear cell carcinoma, using samples from the Cancer Genome Atlas, for which there are gene expression, copy number variation, DNA methylation, protein expression and microRNA data. We identify 4 distinct consensus subtypes and show they are prognostic for survival rate (p<0.0001\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$p < 0.0001$$\end{document}).
引用
收藏
页码:463 / 484
页数:21
相关论文
共 2 条
  • [1] ParticleMDI: particle Monte Carlo methods for the cluster analysis of multiple datasets with applications to cancer subtype identification
    Cunningham, Nathan
    Griffin, Jim E.
    Wild, David L.
    ADVANCES IN DATA ANALYSIS AND CLASSIFICATION, 2020, 14 (02) : 463 - 484
  • [2] particleMDI: A Julia Package for the Integrative Cluster Analysis of Multiple Datasets
    Cunningham, Nathan
    Griffin, Jim E.
    Wild, David L.
    Lee, Anthony
    BAYESIAN STATISTICS AND NEW GENERATIONS, BAYSM 2018, 2019, 296 : 65 - 74