Performance analysis of the parallel Karatsuba multiplication algorithm for distributed memory architectures

被引:9
作者
Cesari, G
Maeder, R
机构
[1] Inst. for Theor. Computer Science, ETH
[2] Universita degli Studi di Trieste, DEEI
关键词
D O I
10.1006/jsco.1996.0026
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
We present three parallel implementations of the Karatsuba algorithm for long integer multiplication on a distributed memory architecture and discuss the experimental results obtained on a Paragon computer. The first two implementations have both time complexity O(n) on n(log2 3) processors, but present different behavior for inputs of practical size. The third algorithm has complexity O(n log(2) n) on n processors, offering therefore better asymptotic efficiency. A refinement of the asymptotic analysis for the important case of a constant number of processors takes into account sequential parts of the algorithm and communications overhead. It is shown that the theoretically best speed-up and efficiency can be obtained with two of the algorithms for sufficient problem size. The experimental results confirm the analysis. (C) 1996 Academic Press Limited
引用
收藏
页码:467 / 473
页数:7
相关论文
共 50 条
  • [31] Karatsuba-ZOT Multiplication Algorithm and Its Application in Cryptography
    Jahani, Shahram
    Samsudin, Azman
    [J]. INDUSTRIAL INSTRUMENTATION AND CONTROL SYSTEMS, PTS 1-4, 2013, 241-244 : 2417 - 2423
  • [32] Programming effort vs. performance with a hybrid programming model for distributed memory parallel architectures
    Rodman, A
    Brorsson, M
    [J]. EURO-PAR'99: PARALLEL PROCESSING, 1999, 1685 : 888 - 898
  • [33] Performance of parallel FDTD method for shared- and distributed-memory architectures: Application tobioelectromagnetics
    Ruiz-Cabello, Miguel N.
    Angulo, Luis M. Diaz
    Cobos Sanchez, Clemente
    Moglie, Franco
    Garcia, Salvador G.
    [J]. PLOS ONE, 2020, 15 (09):
  • [34] Performance evaluation of or-parallel logic programming systems on distributed shared-memory architectures
    Calegario, VM
    Dutra, ID
    [J]. EURO-PAR'99: PARALLEL PROCESSING, 1999, 1685 : 1484 - 1491
  • [35] PSEUDOSPECTRAL CORRELATION METHODS ON DISTRIBUTED-MEMORY PARALLEL ARCHITECTURES
    MARTINEZ, TJ
    CARTER, EA
    [J]. CHEMICAL PHYSICS LETTERS, 1995, 241 (5-6) : 490 - 496
  • [36] Development and performance analysis of real-world applications for distributed and parallel architectures
    Fahringer, T
    Blaha, P
    Hössinger, A
    Luitz, J
    Mehofer, E
    Moritsch, H
    Scholz, B
    [J]. CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2001, 13 (10) : 841 - 868
  • [37] PARTITIONING AND MAPPING FOR PARALLEL NESTED DISSECTION ON DISTRIBUTED MEMORY ARCHITECTURES
    CHARRIER, P
    ROMAN, J
    [J]. LECTURE NOTES IN COMPUTER SCIENCE, 1992, 634 : 295 - 306
  • [38] Resource estimation for parallel architectures with distributed processor/memory nodes
    Thornton, M.
    Andrews, D.L.
    [J]. Journal of Computing and Information Technology, 1998, 6 (04): : 359 - 371
  • [39] Mesh partitioning and load balancing for distributed memory parallel architectures
    Walshaw, C
    Cross, M
    Everett, MG
    [J]. PARALLEL AND DISTRIBUTED PROCESSING FOR COMPUTATIONAL MECHANICS: SYSTEMS AND TOOLS, 1997, : 110 - 123
  • [40] A uniform memory model for distributed data objects on parallel architectures
    Balaji, V
    Numrich, RW
    [J]. USE OF HIGH PERFORMANCE COMPUTING IN METEOROLOGY, 2005, : 272 - 294