An efficient algorithm for out-of-core matrix transposition

被引:16
|
作者
Suh, J
Prasanna, VK
机构
[1] USC, Inst Sci Informat, Arlington, VA 22203 USA
[2] Univ So Calif, Dept Elect Engn Syst, Los Angeles, CA 90089 USA
关键词
matrix transpose; data transfer time; index computation time; I/O time; out-of-core; execution time;
D O I
10.1109/12.995452
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Efficient transposition of Out-of-core matrices has been widely studied. These efforts have focused on reducing the number of I/O operations. However, in state-of-the-art architectures, memory-memory data transfer time and index computation time are also significant components of the overall time. In this paper, we propose an algorithm that considers the index computation time and the I/O time and reduces the overall execution time. Our algorithm reduces the total execution time by reducing the number of I/O operations and eliminating the index computation. In doing so, two techniques are employed: writing the data onto disk in predefined patterns and balancing the number of disk read and write operations. The index computation time, which is an expensive operation involving two divisions and a multiplication, is eliminated by partitioning the memory into read and write buffers, The expensive in-processor permutation is replaced by data collection from the read buffer to the write buffer. Even though this partitioning may increase the number of I/O operations for some cases, it results in an overall reduction in the execution time due to the elimination of the expensive index computation. Our algorithm is analyzed using the well-known Linear Model and the Parallel Disk Model. The experimental results on Sun Enterprise, SGI R12000, and Pentium III show that our algorithm reduces the overall execution time by up to 50 percent compared with the best known algorithms in the literature.
引用
收藏
页码:420 / 438
页数:19
相关论文
共 50 条
  • [21] MultiLogVC: Efficient Out-of-Core Graph Processing Framework for Flash Storage
    Matam, Kiran Kumar
    Hashemi, Hanieh
    Annavaram, Murali
    2021 IEEE 35TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM (IPDPS), 2021, : 245 - 255
  • [22] Highly Efficient Parallel Schemes Using Out-of-Core Solver for MoM
    Zhang, Yu
    Sarkar, Tapan K.
    Ghosh, Prasanta
    Taylor, Mary
    De, Arijit
    2007 IEEE APPLIED ELECTROMAGNETICS CONFERENCE, 2007, : 187 - 190
  • [23] An efficient method for very large scale out-of-core terrain visualization
    Zhang, Huijie
    Sun, Jigui
    Yu, Haihong
    Qi, Changsong
    ICAT 2006: 16TH INTERNATIONAL CONFERENCE ON ARTIFICIAL REALITY AND TELEXISTENCE - WORSHOPS, PROCEEDINGS, 2006, : 36 - 41
  • [24] Efficient synthesis of out-of-core algorithms using a nonlinear optimization solver
    Krishnan, S
    Krishnamoorthy, S
    Baumgartner, G
    Lam, CC
    Ramanujam, J
    Sadayappan, P
    Choppella, V
    JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2006, 66 (05) : 659 - 673
  • [25] Kaleido: An Efficient Out-of-core Graph Mining System on A Single Machine
    Zhao, Cheng
    Zhang, Zhibin
    Xu, Peng
    Zheng, Tianqi
    Guo, Jiafeng
    2020 IEEE 36TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2020), 2020, : 673 - 684
  • [26] Efficient out-of-core algorithms for linear relaxation using blocking covers
    Leiserson, CE
    Rao, S
    Toledo, S
    JOURNAL OF COMPUTER AND SYSTEM SCIENCES, 1997, 54 (02) : 332 - 344
  • [27] Efficient out-of-core rendering of complex 3D scenes
    Gao, Yu
    Deng, Baosong
    Wei, Yingmei
    Wu, Lingda
    TECHNOLOGIES FOR E-LEARNING AND DIGITAL ENTERTAINMENT, PROCEEDINGS, 2006, 3942 : 883 - 892
  • [28] A Highly Efficient I/O-based Out-of-Core Stencil Algorithm with Globally Optimized Temporal Blocking
    Midorikawa, Hiroko
    Tan, Hideyuki
    2017 IEEE PACIFIC RIM CONFERENCE ON COMMUNICATIONS, COMPUTERS AND SIGNAL PROCESSING (PACRIM), 2017,
  • [29] OUT-OF-CORE SOLUTION OF LINEAR EQUATIONS WITH NON-SYMMETRIC COEFFICIENT MATRIX
    HASBANI, Y
    ENGELMAN, M
    COMPUTERS & FLUIDS, 1979, 7 (01) : 13 - 31
  • [30] Out-of-core segmentation by deformable models
    Giraldi, G
    Schaefer, L
    Farias, R
    Silva, R
    FUZZY LOGIC AND APPLICATIONS, 2006, 2955 : 216 - 223