Workflow resiliency for large-scale distributed applications

被引:5
作者
Toan Nguyen [1 ]
Desideri, Jean-Antoine [1 ]
Selmin, Vittorio [2 ]
机构
[1] INRIA, Ctr Rech Grenoble Rhone Alpes, FR-38334 Saint Ismier, France
[2] Alenia Aeronaut, I-10146 Turin, Italy
来源
2009 THIRD INTERNATIONAL CONFERENCE ON ADVANCED ENGINEERING COMPUTING AND APPLICATIONS IN SCIENCES (ADVCOMP 2009) | 2009年
关键词
workflows; resiliency; distributed computing; parallel computing; large-scale applications;
D O I
10.1109/ADVCOMP.2009.9
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Large-scale simulation and optimization are demanding applications that require high-performance computing platforms. Because their economic impact is fundamental to the industry, they also require robust, seamless and effective mechanisms to support dynamic user interactions, as well as fault-tolerance and resiliency on parallel computing platforms. Distributed workflows are considered here as a means to support large-scale dynamic and resilient multiphysics simulation and optimization applications, such as multiphysics aircraft simulation.
引用
收藏
页码:7 / +
页数:2
相关论文
共 50 条
[11]   DHPV: a distributed algorithm for large-scale graph partitioning [J].
Wilfried Yves Hamilton Adoni ;
Tarik Nahhal ;
Moez Krichen ;
Abdeltif El byed ;
Ismail Assayad .
Journal of Big Data, 7
[12]   Configuration monitoring tool for large-scale distributed computing [J].
Wu, Y ;
Graham, G ;
Lu, X ;
Afaq, A ;
Kim, BJ ;
Fisk, I .
NUCLEAR INSTRUMENTS & METHODS IN PHYSICS RESEARCH SECTION A-ACCELERATORS SPECTROMETERS DETECTORS AND ASSOCIATED EQUIPMENT, 2004, 534 (1-2) :66-69
[13]   Distributed Data Processing for Large-Scale Simulations on Cloud [J].
Lu, Tianjian ;
Hoyer, Stephan ;
Wang, Qing ;
Hu, Lily ;
Chen, Yi-Fan .
2021 JOINT IEEE INTERNATIONAL SYMPOSIUM ON ELECTROMAGNETIC COMPATIBILITY, SIGNAL & POWER INTEGRITY, AND EMC EUROPE (EMC+SIPI AND EMC EUROPE), 2021, :53-58
[14]   A distributed clustering algorithm for large-scale dynamic networks [J].
Thibault Bernard ;
Alain Bui ;
Laurence Pilard ;
Devan Sohier .
Cluster Computing, 2012, 15 :335-350
[15]   A distributed clustering algorithm for large-scale dynamic networks [J].
Bernard, Thibault ;
Bui, Alain ;
Pilard, Laurence ;
Sohier, Devan .
CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2012, 15 (04) :335-350
[16]   Large-scale distributed computing for accelerated structure solution [J].
Shankland, K. ;
Griffin, T. A. N. ;
van de Streek, J. ;
Cole, J. C. ;
Shankland, N. ;
Florence, A. J. ;
David, W. I. F. .
ZEITSCHRIFT FUR KRISTALLOGRAPHIE, 2009, :227-232
[17]   Running large-scale applications on cluster grids [J].
Frattolillo, F .
INTERNATIONAL JOURNAL OF HIGH PERFORMANCE COMPUTING APPLICATIONS, 2005, 19 (02) :157-172
[18]   A Distributed Low-Complexity Coding Solution for Large-Scale Distributed FFT [J].
Yazdanialahabadi, Arash ;
Ardakani, Masoud .
IEEE TRANSACTIONS ON COMMUNICATIONS, 2020, 68 (11) :6617-6628
[19]   A Large-Scale Distributed Sorting Algorithm Based on Cloud Computing [J].
Pang, Na ;
Zhu, Dali ;
Fan, Zheming ;
Rong, Wenjing ;
Feng, Weimiao .
APPLICATIONS AND TECHNIQUES IN INFORMATION SECURITY, ATIS 2015, 2015, 557 :226-237
[20]   Resource Allocation for Energy Efficient Large-Scale Distributed Systems [J].
Lee, Young Choon ;
Zomaya, Albert Y. .
INFORMATION SYSTEMS, TECHNOLOGY AND MANAGEMENT, PROCEEDINGS, 2010, 54 :16-19