AI Meets Exascale Computing: Advancing Cancer Research With Large-Scale High Performance Computing

被引:20
|
作者
Bhattacharya, Tanmoy [1 ]
Brettin, Thomas [2 ]
Doroshow, James H. [3 ]
Evrard, Yvonne A. [4 ]
Greenspan, Emily J. [5 ]
Gryshuk, Amy L. [6 ]
Hoang, Thuc T. [7 ]
Lauzon, Carolyn B. Vea [8 ]
Nissley, Dwight [9 ]
Penberthy, Lynne [10 ]
Stahlberg, Eric [11 ]
Stevens, Rick [2 ,12 ]
Streitz, Fred [13 ]
Tourassi, Georgia [14 ]
Xia, Fangfang [15 ]
Zaki, George [11 ]
机构
[1] Los Alamos Natl Lab, Theoret Div, Los Alamos, NM USA
[2] Argonne Natl Lab, Comp Environm & Life Sci Directorate, Lemont, IL USA
[3] NCI, Div Canc Treatment & Diag, Bethesda, MD 20892 USA
[4] Frederick Natl Lab Canc Res, Appl Dev & Res Directorate, Frederick, MD USA
[5] NCI, Ctr Biomed Informat & Informat Technol, Bethesda, MD 20892 USA
[6] Lawrence Livermore Natl Lab, Phys & Life Sci Directorate, Livermore, CA 94550 USA
[7] US DOE, Natl Nucl Secur Adm, Adv Simulat & Comp, Washington, DC 20585 USA
[8] US DOE, Off Sci, Adv Sci Comp Res, Washington, DC 20585 USA
[9] Frederick Natl Lab Canc Res, NCI RAS Initiat, Canc Res Technol Program, Frederick, MD USA
[10] NCI, Div Canc Control & Populat Sci, Bethesda, MD 20892 USA
[11] Frederick Natl Lab Canc Res, Biomed Informat & Data Sci Directorate, Frederick, MD 21701 USA
[12] Univ Chicago, Dept Comp Sci, Chicago, IL 60637 USA
[13] Lawrence Livermore Natl Lab, High Performance Comp Innovat Ctr, Livermore, CA 94550 USA
[14] Oak Ridge Natl Lab, Hlth Data Sci Inst, Oak Ridge, TN USA
[15] Argonne Natl Lab, Data Sci & Learning Div, Lemont, IL USA
来源
FRONTIERS IN ONCOLOGY | 2019年 / 9卷
基金
美国国家卫生研究院;
关键词
cancer research; high performance computing; artificial intelligence; deep learning; natural language processing; multi-scale modeling; precision medicine; uncertainty quantification; RESOURCE; DISCOVERY;
D O I
10.3389/fonc.2019.00984
中图分类号
R73 [肿瘤学];
学科分类号
100214 ;
摘要
The application of data science in cancer research has been boosted by major advances in three primary areas: (1) Data: diversity, amount, and availability of biomedical data; (2) Advances in Artificial Intelligence (AI) and Machine Learning (ML) algorithms that enable learning from complex, large-scale data; and (3) Advances in computer architectures allowing unprecedented acceleration of simulation and machine learning algorithms. These advances help build in silico ML models that can provide transformative insights from data including: molecular dynamics simulations, next-generation sequencing, omics, imaging, and unstructured clinical text documents. Unique challenges persist, however, in building ML models related to cancer, including: (1) access, sharing, labeling, and integration of multimodal and multi-institutional data across different cancer types; (2) developing AI models for cancer research capable of scaling on next generation high performance computers; and (3) assessing robustness and reliability in the AI models. In this paper, we review the National Cancer Institute (NCI) -Department of Energy (DOE) collaboration, Joint Design of Advanced Computing Solutions for Cancer (JDACS4C), a multi-institution collaborative effort focused on advancing computing and data technologies to accelerate cancer research on three levels: molecular, cellular, and population. This collaboration integrates various types of generated data, pre-exascale compute resources, and advances in ML models to increase understanding of basic cancer biology, identify promising new treatment options, predict outcomes, and eventually prescribe specialized treatments for patients with cancer.
引用
收藏
页数:8
相关论文
共 50 条
  • [1] A Large-Scale Study of Failures in High-Performance Computing Systems
    Schroeder, Bianca
    Gibson, Garth A.
    IEEE TRANSACTIONS ON DEPENDABLE AND SECURE COMPUTING, 2010, 7 (04) : 337 - 350
  • [2] An ASP model for large-scale genomics in a high-performance computing environment
    Cuticchia, J
    Zaifman, L
    Wallace, S
    Hulbert, G
    Silk, GW
    HIGH PERFORMANCE COMPUTING SYSTEMS AND APPLICATIONS, 2003, 727 : 3 - 3
  • [3] JUWELS Booster - A Supercomputer for Large-Scale AI Research
    Kesselheim, Stefan
    Herten, Andreas
    Krajsek, Kai
    Ebert, Jan
    Jitsev, Jenia
    Cherti, Mehdi
    Langguth, Michael
    Gong, Bing
    Stadtler, Scarlet
    Mozaffari, Amirpasha
    Cavallaro, Gabriele
    Sedona, Rocco
    Schug, Alexander
    Strube, Alexandre
    Kamath, Roshni
    Schultz, Martin G.
    Riedel, Morris
    Lippert, Thomas
    HIGH PERFORMANCE COMPUTING - ISC HIGH PERFORMANCE DIGITAL 2021 INTERNATIONAL WORKSHOPS, 2021, 12761 : 453 - 468
  • [4] Advancing a distributed multi-scale computing framework for large-scale high-throughput discovery in materials science
    Knap, J.
    Spear, C. E.
    Borodin, O.
    Leiter, K. W.
    NANOTECHNOLOGY, 2015, 26 (43)
  • [5] High-performance computing framework with desynchronized information propagation for large-scale simulations
    Bujas, Jakub
    Dworak, Dawid
    Turek, Wojciech
    Byrski, Aleksander
    JOURNAL OF COMPUTATIONAL SCIENCE, 2019, 32 : 70 - 86
  • [6] AdapCK: Optimizing I/O for Checkpointing on Large-Scale High Performance Computing Systems
    Jia, Jie
    Liu, Yi
    Liu, Yanke
    Chen, Yifan
    Lin, Fang
    EURO-PAR 2024: PARALLEL PROCESSING, PT III, EURO-PAR 2024, 2024, 14803 : 342 - 355
  • [7] Accelerating Large-Scale Molecular Similarity Search through Exploiting High Performance Computing
    Zhu, Chun Jiang
    Zhu, Tan
    Li, Haining
    Bi, Jinbo
    Song, Minghu
    2019 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2019, : 330 - 333
  • [8] HPC2lusterScape: Increasing Transparency and Efficiency of Shared High-Performance Computing Clusters for Large-scale AI Models
    Park, Heungseok
    Cho, Aeree
    Jeon, Hyojun
    Lee, Hayoung
    Yang, Youngil
    Lee, Sungjae
    Lee, Heungsub
    Choo, Jaegul
    2023 IEEE VISUALIZATION IN DATA SCIENCE, VDS, 2023, : 21 - 29
  • [9] Research Trends In High Performance Computing Application On Large Scale Power System Operation
    Amgai, Ranjit
    Shi, Jian
    Abdelwahed, Sherif
    Fu, Yong
    PROCEEDINGS OF THE 2012 GRAND CHALLENGES IN MODELING & SIMULATION (GCMS '12), 2012, 44 (11): : 112 - 119
  • [10] Predictive Dynamic Simulation for Large-Scale Power Systems through High-Performance Computing
    Huang, Zhenyu
    Jin, Shuangshuang
    Diao, Ruisheng
    2012 SC COMPANION: HIGH PERFORMANCE COMPUTING, NETWORKING, STORAGE AND ANALYSIS (SCC), 2012, : 347 - 354