Multi-GPU Implementation of the Uniformization Method for Solving Markov Models

被引：0

作者：

Karwacki, Marek ^{[1
]}

Bylina, Beata ^{[1
]}

Bylina, Jaroslaw ^{[1
]}

机构：

[1] Marie Curie Sklodowska Univ, Inst Math, Pl M Curie Sklodowskiej 5, PL-20031 Lublin, Poland

来源：

2012 FEDERATED CONFERENCE ON COMPUTER SCIENCE AND INFORMATION SYSTEMS (FEDCSIS) | 2012年

关键词：

Markovian models; uniformization method; GPU; multi-GPU; heterogeneous computations; parallel computing; wireless network models;

D O I：

暂无

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Markovian models can generate very large sparse matrices, which are difficult to store and solve. A useful method for finding transient probabilities in Markovian models is the uniformization. The aim of this paper is to show that the performance of the uniformization can be improved using multi-GPU architecture. We propose partitioning scheme for HYB sparse matrix storage format and some optimization techniques adjusted so as to minimize communication between GPUs during iterative sparse matrix-vector multiplication, which is the most time consuming step. The results of experiments show that on multi-GPU we can solve larger matrices than on single device and accelerate computations in comparison to a multithreaded CPU. Computational test have been carried out in double precision for a wireless network models. Using multi-GPU we were able to solve model which is described by a matrix of the size 3.6 x 10(7).

引用

页码：533 / 537

页数：5

共 11 条

[1]

[Anonymous], 1994, Introduction to the Numerical Solutions of Markov Chains

[2]

Bell N., 2008, Efficient sparse matrix-vector multiplication on CUDA

[3]

Bylina B., 2011, THEOR APPL INFORM, V23, P127

[4]

Bylina B, 2012, COMM COM INF SC, V291, P401, DOI 10.1007/978-3-642-31217-5_42

[5]

Bylina J., CCIS 291 COMP NETW 2, P411

[6]

Bylina J., CCIS 160 COMPUTER NE, P80

[7] Uniformization and hypergraph partitioning for the distributed computation of response time in very large Markov models [J].

Dingle, NJ ;

Harrison, PG ;

Knottenbelt, WJ .

JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2004, 64 (08) :908-920

[8]

Kreutzer M., 2011, ABS11125588 CORR

[9]

NVIDIA Corporation, 2009, CUDA PROGR GUID

[10] Expokit: A software package for computing matrix exponentials [J].

Sidje, RB .

ACM TRANSACTIONS ON MATHEMATICAL SOFTWARE, 1998, 24 (01) :130-156

← 1 2 →