Negative binomial graphical model with excess zeros

被引:1
作者
Park, Beomjin [1 ]
Choi, Hosik [2 ]
Park, Changyi [1 ]
机构
[1] Univ Seoul, Dept Stat, Seoul, South Korea
[2] Univ Seoul, Grad Sch, Dept Urban Big Data Convergence, Seoul, South Korea
关键词
count data; Markov random field; over-dispersion; undirected graphical model; zero inflation; VARIABLE SELECTION; EM ALGORITHM; POISSON; NETWORKS; DEMAND;
D O I
10.1002/sam.11536
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Markov random field or undirected graphical models (GM) are a popular class of GM useful in various fields because they provide an intuitive and interpretable graph expressing the complex relationship between random variables. The zero-inflated local Poisson graphical model has been proposed as a graphical model for count data with excess zeros. However, as count data are often characterized by over-dispersion, the local Poisson graphical model may suffer from a poor fit to data. In this paper, we propose a zero-inflated local negative binomial (NB) graphical model. Due to the dependencies of parameters in our models, a direct optimization of the objective function is difficult. Instead, we devise expectation-minimization algorithms based on two different parametrizations for the NB distribution. Through a simulation study, we illustrate the effectiveness of our method for learning network structure from over-dispersed count data with excess zeros. We further apply our method to real data to estimate its network structure.
引用
收藏
页码:449 / 465
页数:17
相关论文
共 50 条
[1]   A Local Poisson Graphical Model for Inferring Networks From Sequencing Data [J].
Allen, Genevera I. ;
Liu, Zhandong .
IEEE TRANSACTIONS ON NANOBIOSCIENCE, 2013, 12 (03) :189-198
[2]  
BESAG J, 1974, J ROY STAT SOC B MET, V36, P192
[3]  
Binder H, 2008, STAT APPL GENET MOL, V7
[4]   FITTING THE NEGATIVE BINOMIAL DISTRIBUTION TO BIOLOGICAL DATA - NOTE ON THE EFFICIENT FITTING OF THE NEGATIVE BINOMIAL [J].
BLISS, CI ;
FISHER, RA .
BIOMETRICS, 1953, 9 (02) :176-200
[5]   Scale-free networks are rare [J].
Broido, Anna D. ;
Clauset, Aaron .
NATURE COMMUNICATIONS, 2019, 10 (1)
[6]  
BROOKS RJ, 1984, J R STAT SOC C-APPL, V33, P285
[7]  
Bühlmann P, 2002, ANN STAT, V30, P927
[8]   Network analysis for count data with excess zeros [J].
Choi, Hosik ;
Gim, Jungsoo ;
Won, Sungho ;
Kim, You Jin ;
Kwon, Sunghoon ;
Park, Changyi .
BMC GENETICS, 2017, 18
[9]  
Classic3 Dataset, 2009, CLASSIC3 DAT NOV 29