From Conditional Independence to Parallel Execution in Hierarchical Models

被引:1
作者
Nemeth, Balazs [1 ]
Haber, Tom [1 ,2 ]
Liesenborgs, Jori [1 ]
Lamotte, Wim [1 ]
机构
[1] Hasselt Univ tUL, Expertise Ctr Digital Media, Martelarenlaan 42, B-3500 Hasselt, Belgium
[2] IMEC, Exasci Lab, Kapeldreef 75, B-3001 Leuven, Belgium
来源
COMPUTATIONAL SCIENCE - ICCS 2020, PT I | 2020年 / 12137卷
关键词
High performance computing; Descriptive language; Probabilistic modelling; Automatic parallelization; Dataflow; Hierarchical models;
D O I
10.1007/978-3-030-50371-0_12
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Hierarchical models describe phenomena by grouping data into multiple levels. Due to the size of these models, parallel execution is required to avoid prohibitively long computing time. While it is occasionally possible to specify some of these models using parallel building blocks, this limits expressivity. Therefore, a more general generative specification is preferred. To leverage parallel computing capacity, these specifications can be annotated, but doing so effectively assumes that the modeler has expertise from computer science. This paper outlines how to identify parallel parts automatically by leveraging the conditional independence property in the graphical model extracted from the dataflow graph of model specifications. Computation related to random variables with the same depth in the graphical model are identified as candidates for parallel execution. Since subsequent proposals in the parameter space exploration of the model are clustered together, the results show that the well known longest processing time scheduling heuristic deals adequately with load imbalance. The proposed parallelization is evaluated on two pharmacometrics models, a domain where hierarchical models with load imbalance are common due to the numeric simulation of pharmacokinetics and pharmacodynamics of human subjects. The varying number of measurements taken per subject further exacerbates load imbalance.
引用
收藏
页码:161 / 174
页数:14
相关论文
共 25 条
[1]  
[Anonymous], 2007, HDB SCHEDULING INT H, DOI DOI 10.1007/978-3-540-32220-7
[2]  
Beck M, 1989, CONTROL FLOW DATAFLO
[3]   Julia: A Fresh Approach to Numerical Computing [J].
Bezanson, Jeff ;
Edelman, Alan ;
Karpinski, Stefan ;
Shah, Viral B. .
SIAM REVIEW, 2017, 59 (01) :65-98
[4]  
Carey V., 2001, MIXED EFFECTS MODELS, DOI [10.1007/b98882, DOI 10.1007/B98882]
[5]   Stan: A Probabilistic Programming Language [J].
Carpenter, Bob ;
Gelman, Andrew ;
Hoffman, Matthew D. ;
Lee, Daniel ;
Goodrich, Ben ;
Betancourt, Michael ;
Brubaker, Marcus A. ;
Guo, Jiqiang ;
Li, Peter ;
Riddell, Allen .
JOURNAL OF STATISTICAL SOFTWARE, 2017, 76 (01) :1-29
[6]   EXPLAINING THE GIBBS SAMPLER [J].
CASELLA, G ;
GEORGE, EI .
AMERICAN STATISTICIAN, 1992, 46 (03) :167-174
[7]  
Chakravarthi V.S, 2020, PRACTICAL APPROACH V, P173, DOI 10.1007/978-3-030-23049-4
[8]  
Christoph Kessler e., 2007, PROCEEDINGS, PARS, pag, P13
[9]  
CULLER A, 1986, ANNU REV COMPUT SCI, V1, P225
[10]   Dynamic population pharmacokinetic-pharmacodynamic modelling and simulation supports similar efficacy in glycosylated haemoglobin response with once or twice-daily dosing of canagliflozin [J].
de Winter, Willem ;
Dunne, Adrian ;
de Trixhe, Xavier Woot ;
Devineni, Damayanthi ;
Hsu, Chyi-Hung ;
Pinheiro, Jose ;
Polidori, David .
BRITISH JOURNAL OF CLINICAL PHARMACOLOGY, 2017, 83 (05) :1072-1081