CGMgraph/CGMlib: Implementing and testing cgm graph algorithms on PC clusters and shared memory machines

被引：26

作者：

Chan, A ^{[1
]}

Dehne, F

Taylor, R

机构：

[1] Fayetteville State Univ, Dept Math & Comp Sci, Fayetteville, NC 28301 USA

[2] Griffith Univ, Sch ICT, Brisbane, Qld 4111, Australia

[3] Carleton Univ, Sch Comp Sci, Ottawa, ON K1S 5B6, Canada

来源：

INTERNATIONAL JOURNAL OF HIGH PERFORMANCE COMPUTING APPLICATIONS | 2005年 / 19卷 / 01期

关键词：

coarse-grained multiprocessor; graph algorithms; library; parallel algorithms implementation; processor cluster;

D O I：

10.1177/1094342005051196

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

In this paper, we present CGMgraph, the first integrated library of parallel graph methods for PC clusters based on Coarse Grained Multicomputer (CGM) algorithms. CGMgraph implements parallel methods for various graph problems. Our implementations of deterministic list ranking, Euler tour, connected components, spanning forest, and bipartite graph detection are, to our knowledge, the first efficient implementations for PC clusters. Our library also includes CGMlib, a library of basic CGM tools such as sorting, prefix sum, one-to-all broadcast, all-to-one gather, h-Relation, all-to-all broadcast, array balancing, and CGM partitioning. Both libraries are available for download at http: //www. scs. carleton. ca/similar tocgm. In the experimental part of this paper, we demonstrate the performance of our methods on four different architectures: a gigabit connected high performance PC cluster, a smaller PC cluster connected via fast ethernet, a network of workstations, and a shared memory machine. Our experiments show that our library provides good parallel speedup and scalability on all four platforms. The communication overhead is, in most cases, small and does not grow significantly with an increasing number of processors. This is a very important feature of CGM algorithms which makes them very efficient in practice.

引用

页码：81 / 97

页数：17

共 16 条

[1]

CACERE E, 2000, LECT NOTES COMPUTER, V1928, P83

[2]

Chan A, 2003, LECT NOTES COMPUT SC, V2840, P117

[3]

Chan A., 1999, Parallel Processing Letters, V9, P533, DOI 10.1142/S0129626499000499

[4]

CHAN A, 1998, TR9806 CARL U SCH CO

[5] Efficient parallel graph algorithms for coarse-grained multicomputers and BSP [J].

Dehne, F ;

Ferreira, A ;

Cáceres, E ;

Song, SW ;

Roncato, A .

ALGORITHMICA, 2002, 33 (02) :183-200

[6]

DEHNE F, 1996, AS COMP SCI C ASIAN, P1

[7]

DEHNE F, 1993, ACM S COMP GEOM, P298

[8]

GOODRICH MT, 1996, P 28 ANN ACM S THEOR

[9]

LASSOUS IG, 2000, RR3885 INRIA

[10]

REIDMILLER M, 1994, ACM S PAR ALG ARCH, P104

← 1 2 →