Employing artificial intelligence to steer exascale workflows with colmena

被引:1
|
作者
Ward, Logan [1 ]
Pauloski, J. Gregory [2 ]
Hayot-Sasson, Valerie [2 ]
Babuji, Yadu [2 ]
Brace, Alexander [2 ]
Chard, Ryan [1 ]
Chard, Kyle [2 ]
Thakur, Rajeev [1 ]
Foster, Ian [1 ]
机构
[1] Argonne Natl Lab, Data Sci & Learning Div, 9700 S Cass Ave, Argonne, IL 60439 USA
[2] Univ Chicago, Dept Comp Sci, Chicago, IL USA
来源
INTERNATIONAL JOURNAL OF HIGH PERFORMANCE COMPUTING APPLICATIONS | 2025年 / 39卷 / 01期
关键词
Workflows; artificial intelligence; computational steering;
D O I
10.1177/10943420241288242
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Computational workflows are a common class of application on supercomputers, yet the loosely coupled and heterogeneous nature of workflows often fails to take full advantage of their capabilities. We created Colmena to leverage the massive parallelism of a supercomputer by using Artificial Intelligence (AI) to learn from and adapt a workflow as it executes. Colmena allows scientists to define how their application should respond to events (e.g., task completion) as a series of cooperative agents. In this paper, we describe the design of Colmena, the challenges we overcame while deploying applications on exascale systems, and the science workflows we have enhanced through interweaving AI. The scaling challenges we discuss include developing steering strategies that maximize node utilization, introducing data fabrics that reduce communication overhead of data-intensive tasks, and implementing workflow tasks that cache costly operations between invocations. These innovations coupled with a variety of application patterns accessible through our agent-based steering model have enabled science advances in chemistry, biophysics, and materials science using different types of AI. Our vision is that Colmena will spur creative solutions that harness AI across many domains of scientific computing.
引用
收藏
页码:52 / 64
页数:13
相关论文
共 50 条
  • [1] ExaWorks: Workflows for Exascale
    Al-Saadi, Aymen
    Ahn, Dong H.
    Babuji, Yadu
    Chard, Kyle
    Corbett, James
    Hategan, Mihael
    Herbein, Stephen
    Jha, Shantenu
    Laney, Daniel
    Merzky, Andre
    Munson, Todd
    Salim, Michael
    Titov, Mikhail
    Turilli, Matteo
    Uram, Thomas D.
    Wozniak, Justin M.
    PROCEEDINGS OF 16TH WORKSHOP ON WORKFLOWS IN SUPPORT OF LARGE-SCALE SCIENCE (WORKS21), 2021, : 50 - 57
  • [2] The role of artificial intelligence in clinical imaging and workflows
    Wilson, Diane U.
    Bailey, Michael Q.
    Craig, John
    VETERINARY RADIOLOGY & ULTRASOUND, 2022, 63 : 897 - 902
  • [3] Artificial intelligence Tiny swimbots get a steer from a 'brain'
    Stokel-Walker, Chris
    NEW SCIENTIST, 2021, 245 (3328) : 20 - 20
  • [4] Flux: Overcoming scheduling challenges for exascale workflows
    Ahn, Dong H.
    Bass, Ned
    Chu, Albert
    Garlick, Jim
    Grondona, Mark
    Herbein, Stephen
    Ingolfsson, Helgi I.
    Koning, Joseph
    Patki, Tapasya
    Scogland, Thomas R. W.
    Springmeyer, Becky
    Taufer, Michela
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2020, 110 : 202 - 213
  • [5] Flux: Overcoming Scheduling Challenges for Exascale Workflows
    Ahn, Dong H.
    Bass, Ned
    Chu, Albert
    Garlick, Jim
    Grondona, Mark
    Herbein, Stephen
    Koning, Joseph
    Patki, Tapasya
    Scogland, Thomas R. W.
    Springmeyer, Becky
    Taufer, Michela
    PROCEEDINGS OF WORKS 2018: 13TH IEEE/ACM WORKSHOP ON WORKFLOWS IN SUPPORT OF LARGE-SCALE SCIENCE (WORKS), 2018, : 10 - 19
  • [6] Application of artificial intelligence centric workflows for evaluation of neuroradiology emergencies
    Shakoor, Delaram
    Al-Dasuqi, Khalid
    Cavallo, Joe
    Ikuta, Ichiro
    Payabvash, Syedmehdi
    Malhotra, Ajay
    CLINICAL IMAGING, 2023, 101 : 133 - 136
  • [7] Advancing Lymphoma Diagnosis with Artificial Intelligence-Enhanced Workflows
    Yi, Hongmei
    Yan, Fang
    Da, Qian
    Wang, Chaofu
    LABORATORY INVESTIGATION, 2024, 104 (03) : S1511 - S1512
  • [8] From data to knowledge to discoveries: Artificial intelligence and scientific workflows
    Gil, Yolanda
    SCIENTIFIC PROGRAMMING, 2009, 17 (03) : 231 - 246
  • [9] Performance analysis and data reduction for exascale scientific workflows
    Kelly, Christopher
    Xu, Wei
    Pouchard, Line C.
    Van Dam, Hubertus
    Islam, Tanzima Z.
    Yoo, Shinjae
    Van Dam, Kerstin Kleese
    INTERNATIONAL JOURNAL OF HIGH PERFORMANCE COMPUTING APPLICATIONS, 2025,
  • [10] Artificial intelligence in timber forensics employing DNA barcode database
    Dev, Suma Arun
    Unnikrishnan, Remya
    Prathibha, P. S.
    Sijimol, K.
    Sreekumar, V. B.
    AzharAli, A.
    Anoop, E. V.
    Viswanath, Syam
    3 BIOTECH, 2023, 13 (06)