Viable architectures for high-performance computing

被引：2

作者：

Ziavras, SG ^{[1
]}

Wang, Q

Papathanasiou, P

机构：

[1] New Jersey Inst Technol, Dept Elect & Comp Engn, Newark, NJ 07102 USA

[2] New Jersey Inst Technol, Dept Comp Sci, Newark, NJ 07102 USA

[3] Dataline Comp Inst, Piraeus 18900, Greece

来源：

COMPUTER JOURNAL | 2003年 / 46卷 / 01期

关键词：

D O I：

10.1093/comjnl/46.1.36

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

Existing interprocessor connection networks are often plagued by poor topological properties that result in large memory latencies for distributed shared-memory (DSM) computers or multicomputers. On the other hand, scalable networks with very good topological properties are often impossible to build because of their prohibitively high very large scale integration (VLSI) (e.g. wiring) complexity. Such a network is the generalized hypercube (GH). The GH supports full connectivity of all of its nodes in each dimension and is characterized by outstanding topological properties. Also, low-dimensional GHs have very large bisection widths. We present here the class of highly-overlapping windows (HOWs) networks, which are capable of lower complexity than GHs, comparable performance and better scalability. HOWs are obtained from GHs by uniformly removing edges to produce feasible systems of lower wiring complexity. Resulting systems contain numerous highly-overlapping GHs of smaller size. The GH, the binary hypercube and the mesh all belong to this new class of interconnections. In practical cases, HOWs have higher bisection width than tori with similar node and channel costs. Also, HOWs have a very large degree of fault tolerance. This paper focuses on 2-D HOW systems. We analyze the hardware cost of HOWs, present graph embeddings and communications algorithms for HOWs, carry out performance comparisons with binary hypercubes and GHs and simulate HOWs under heavy communication loads. Our results show the suitability of HOWs for very-high-performance computing.

引用

页码：36 / 54

页数：19

共 40 条

[1] PERFORMANCE OF MULTICOMPUTER NETWORKS UNDER PIN-OUT CONSTRAINTS
ABRAHAM, S
PADMANABHAN, K
[J]. JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 1991, 12 (03) : 237 - 248
[2] LIMITS ON INTERCONNECTION NETWORK PERFORMANCE
AGARWAL, A
[J]. IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 1991, 2 (04) : 398 - 412
[3] Blue Gene: A vision for protein science using a petaflop supercomputer
Allen, F
Almasi, G
Andreoni, W
Beece, D
Berne, BJ
Bright, A
Brunheroto, J
Cascaval, C
Castanos, J
Coteus, P
Crumley, P
Curioni, A
Denneau, M
Donath, W
Eleftheriou, M
Fitch, B
Fleischer, B
Georgiou, CJ
Germain, R
Giampapa, M
Gresh, D
Gupta, M
Haring, R
Ho, H
Hochschild, P
Hummel, S
Jonas, T
Lieber, D
Martyna, G
Maturu, K
Moreira, J
Newns, D
Newton, M
Philhower, R
Picunko, T
Pitera, J
Pitman, M
Rand, R
Royyuru, A
Salapura, V
Sanomiya, A
Shah, R
Sham, Y
Singh, S
Snir, M
Suits, F
Swetz, R
Swope, WC
Vishnumurthy, N
Ward, TJC
[J]. IBM SYSTEMS JOURNAL, 2001, 40 (02) : 310 - 327
[4] [Anonymous], 1994, PARALLEL ALGORITHMS
[5] Antonio J. K., 1993, Proceedings of Seventh International Parallel Processing Symposium (Cat. No.93TH0513-2), P387, DOI 10.1109/IPPS.1993.262914
[6] Broadcasting multiple messages in the multiport model
Bar-Noy, A
Ho, CT
[J]. IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 1999, 10 (05) : 500 - 508
[7] BHUYAN LN, 1984, IEEE T COMPUT, V33, P323, DOI 10.1109/TC.1984.1676437
[8] Culler DavidE., 1999, PARALLEL COMPUTER AR
[9] DALLY W, 1990, VLSI PARALLEL COMPUT, P140
[10] PERFORMANCE ANALYSIS OF K-ARY N-CUBE INTERCONNECTION NETWORKS
DALLY, WJ
[J]. IEEE TRANSACTIONS ON COMPUTERS, 1990, 39 (06) : 775 - 785

← 1 2 3 4 →