ODD MEMORY-SYSTEMS - A NEW APPROACH

被引:2
作者
SEZNEC, A
LENFANT, J
机构
[1] IRISA, 35042 Rennes Cedex, Campus de Beaulieu
关键词
D O I
10.1006/jpdc.1995.1063
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
To reject the use of a prime (or odd) number N of memory banks in a vector processor, it is generally advanced that address computation for such a memory system would require systematic Euclidean division by the number N. We first show that the Chinese Remainder Theorem allows one to define a very simple mapping of data onto the memory banks for which address computation does not require any Euclidean division. Massively parallel SIMD computers may have thousands of processors. When the memory on such a machine is globally shared, routing vectors from memory to the processors is a major difficulty; the control for the interconnection network cannot be generally computed at execution time. When the number of memory banks and processors is a product of prime numbers, the family of permutations needed for routing vectors from memory to the processors through the interconnection network has very specific properties. The Chinese Remainder Network presented in the paper is able to execute all these permutations in a single path and may be easily controlled. (c) 1995 Academic Press, Inc.
引用
收藏
页码:248 / 256
页数:9
相关论文
共 28 条
  • [1] Benes V.E., Mathematical Theory of Connecting Networks and Telephone Traffic, (1968)
  • [2] Budnick P., Kuck D., The organization and use of parallel memories, IEEE Trans. Comput., C20, pp. 1566-1569, (1971)
  • [3] Chor B., Leiserson C.E., Rivest R.L., Shearer J.B., An application of number theory to the organization of raster-graphics memory, J. Arroc. Comput. Mach., 33, pp. 86-104, (1986)
  • [4] Dupont De Dinechin B., A ultra fast Euclidean division algorithm for prime memory systems, Supercomputing, 91, pp. 56-66, (1991)
  • [5] Frailong J.M., Jalby W., Lenfant J., XOR-schemes: A flexible organization in parallel memories, Proc. Of 1985 International Conference on Parallel Processing, pp. 276-283, (1985)
  • [6] Gao Q.S., The Chinese remainder theorem and the prime memory system, Proc. 20Th International Symposium on Computer Architecture, pp. 337-340, (1993)
  • [7] Gajski D., Kuck D., Lawrie D., Sameh A., CEDAR, a large scale multiprocessor, Proc. International Conference on Parallel Processing, pp. 524-529, (1985)
  • [8] Kuck D.J., Stokes R.A., The Burroughs Scientific Processor (BSP), IEEE Trans. Comput., C31, pp. 948-960, (1982)
  • [9] Harper D.T., Jump J.R., Performance evaluation of vector accesses in parallel memories using a skewed storage scheme, Proc. 13th International Symposium on Computer Architecture, pp. 324-328, (1986)
  • [10] Harper D.T., Jump J.R., Vector accesses in parallel memories using a skewed storage scheme, IEEE Trans. Comput., C36, pp. 1440-1449, (1987)