Minimizing I/Os in Out-of-Core Task Tree Scheduling

被引:1
|
作者
Marchal, Loris [1 ,2 ]
McCauley, Samuel [3 ]
Simon, Bertrand [1 ,2 ]
Vivien, Frederic [1 ,2 ]
机构
[1] ENS Lyon, INRIA, CNRS, 46 Allee Italie, Lyon, France
[2] Univ Lyon, LIP, ENS Lyon, 46 Allee Italie, Lyon, France
[3] IT Univ Copenhagen, Rued Langgards Vej 7, DK-2300 Copenhagen S, Denmark
关键词
D O I
10.1109/IPDPSW.2017.58
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Scientific applications are usually described as directed acyclic graphs, where nodes represent tasks and edges represent dependencies between tasks. For some applications, such as the multifrontal method of sparse matrix factorization, this graph is a tree: each task produces a single output data, used by a single task (its parent in the tree). We focus on the case when the data manipulated by tasks have a large size, which is especially the case in the multifrontal method. To process a task, both its inputs and its output must fit in the main memory. Moreover, output results of tasks have to be stored between their production and their use by the parent task. It may therefore happen, during an execution, that not all data fit together in memory. In particular, this is the case if the total available memory is smaller than the minimum memory required to process the whole tree. In such a case, some data have to be temporarily written to disk and read afterwards. These Input/Output (I/O) operations are very expensive; hence, the need to minimize them. We revisit this open problem in this paper. Specifically, our goal is to minimize the total volume of I/O while processing a given task tree. We first formalize and generalize known results, then prove that existing solutions can be arbitrarily worse than optimal. Finally, we propose a novel heuristic algorithm, based on the optimal tree traversal for memory minimization. We demonstrate good performance of this new heuristic through simulations on both synthetic trees and realistic trees built from actual sparse matrices.
引用
收藏
页码:884 / 893
页数:10
相关论文
共 50 条
  • [31] Out-of-Core Parallel Frontier Search with MapReduce
    Reinefeld, Alexander
    Schuett, Thorsten
    HIGH PERFORMANCE COMPUTING SYSTEMS AND APPLICATIONS, 2010, 5976 : 323 - 336
  • [32] Out-of-core SVD performance for document indexing
    Martin, Dian I.
    Martin, John C.
    Berry, Michael W.
    Browne, Murray
    APPLIED NUMERICAL MATHEMATICS, 2007, 57 (11-12) : 1230 - 1239
  • [33] Out-of-core remeshing of large polygonal meshes
    Ahn, Minsu
    Guskov, Igor
    Lee, Seungyong
    IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2006, 12 (05) : 1221 - 1228
  • [34] Out-of-core wavefront computations with reduced synchronization
    Clauss, Pierre-Nicolas
    Gustedt, Jens
    Suter, Frederic
    PROCEEDINGS OF THE 16TH EUROMICRO CONFERENCE ON PARALLEL, DISTRIBUTED AND NETWORK-BASED PROCESSING, 2008, : 293 - +
  • [35] Out-of-core rendering of large, unstructured grids
    Farias, R
    Silva, CT
    IEEE COMPUTER GRAPHICS AND APPLICATIONS, 2001, 21 (04) : 42 - 50
  • [36] On the performance of parallel factorization of out-of-core matrices
    Caron, E
    Utard, G
    PARALLEL COMPUTING, 2004, 30 (03) : 357 - 375
  • [37] Out-of-Core Solution of Eigenproblems for Macromolecular Simulations
    Aliaga, Jose I.
    Davidovic, Davor
    Quintana-Orti, Enrique S.
    PARALLEL PROCESSING AND APPLIED MATHEMATICS (PPAM 2013), PT I, 2014, 8384 : 490 - 499
  • [38] The design of a new out-of-core multifrontal solver
    Reid, John K.
    Scott, Jennifer A.
    APPLIED PARALLEL COMPUTING: STATE OF THE ART IN SCIENTIFIC COMPUTING, 2007, 4699 : 598 - +
  • [39] Towards Out-of-core Neural Networks on Microcontrollers
    Miao, Hongyu
    Lin, Felix Xiaozhu
    2022 IEEE/ACM 7TH SYMPOSIUM ON EDGE COMPUTING (SEC 2022), 2022, : 1 - 13
  • [40] Out-of-core Algorithms for Binary Partition Hierarchies
    Josselin Lefèvre
    Jean Cousty
    Benjamin Perret
    Harold Phelippeau
    Journal of Mathematical Imaging and Vision, 2025, 67 (2)