AI Meets Exascale Computing: Advancing Cancer Research With Large-Scale High Performance Computing

被引:20
作者
Bhattacharya, Tanmoy [1 ]
Brettin, Thomas [2 ]
Doroshow, James H. [3 ]
Evrard, Yvonne A. [4 ]
Greenspan, Emily J. [5 ]
Gryshuk, Amy L. [6 ]
Hoang, Thuc T. [7 ]
Lauzon, Carolyn B. Vea [8 ]
Nissley, Dwight [9 ]
Penberthy, Lynne [10 ]
Stahlberg, Eric [11 ]
Stevens, Rick [2 ,12 ]
Streitz, Fred [13 ]
Tourassi, Georgia [14 ]
Xia, Fangfang [15 ]
Zaki, George [11 ]
机构
[1] Los Alamos Natl Lab, Theoret Div, Los Alamos, NM USA
[2] Argonne Natl Lab, Comp Environm & Life Sci Directorate, Lemont, IL USA
[3] NCI, Div Canc Treatment & Diag, Bethesda, MD 20892 USA
[4] Frederick Natl Lab Canc Res, Appl Dev & Res Directorate, Frederick, MD USA
[5] NCI, Ctr Biomed Informat & Informat Technol, Bethesda, MD 20892 USA
[6] Lawrence Livermore Natl Lab, Phys & Life Sci Directorate, Livermore, CA 94550 USA
[7] US DOE, Natl Nucl Secur Adm, Adv Simulat & Comp, Washington, DC 20585 USA
[8] US DOE, Off Sci, Adv Sci Comp Res, Washington, DC 20585 USA
[9] Frederick Natl Lab Canc Res, NCI RAS Initiat, Canc Res Technol Program, Frederick, MD USA
[10] NCI, Div Canc Control & Populat Sci, Bethesda, MD 20892 USA
[11] Frederick Natl Lab Canc Res, Biomed Informat & Data Sci Directorate, Frederick, MD 21701 USA
[12] Univ Chicago, Dept Comp Sci, Chicago, IL 60637 USA
[13] Lawrence Livermore Natl Lab, High Performance Comp Innovat Ctr, Livermore, CA 94550 USA
[14] Oak Ridge Natl Lab, Hlth Data Sci Inst, Oak Ridge, TN USA
[15] Argonne Natl Lab, Data Sci & Learning Div, Lemont, IL USA
来源
FRONTIERS IN ONCOLOGY | 2019年 / 9卷
基金
美国国家卫生研究院;
关键词
cancer research; high performance computing; artificial intelligence; deep learning; natural language processing; multi-scale modeling; precision medicine; uncertainty quantification; RESOURCE; DISCOVERY;
D O I
10.3389/fonc.2019.00984
中图分类号
R73 [肿瘤学];
学科分类号
100214 ;
摘要
The application of data science in cancer research has been boosted by major advances in three primary areas: (1) Data: diversity, amount, and availability of biomedical data; (2) Advances in Artificial Intelligence (AI) and Machine Learning (ML) algorithms that enable learning from complex, large-scale data; and (3) Advances in computer architectures allowing unprecedented acceleration of simulation and machine learning algorithms. These advances help build in silico ML models that can provide transformative insights from data including: molecular dynamics simulations, next-generation sequencing, omics, imaging, and unstructured clinical text documents. Unique challenges persist, however, in building ML models related to cancer, including: (1) access, sharing, labeling, and integration of multimodal and multi-institutional data across different cancer types; (2) developing AI models for cancer research capable of scaling on next generation high performance computers; and (3) assessing robustness and reliability in the AI models. In this paper, we review the National Cancer Institute (NCI) -Department of Energy (DOE) collaboration, Joint Design of Advanced Computing Solutions for Cancer (JDACS4C), a multi-institution collaborative effort focused on advancing computing and data technologies to accelerate cancer research on three levels: molecular, cellular, and population. This collaboration integrates various types of generated data, pre-exascale compute resources, and advances in ML models to increase understanding of basic cancer biology, identify promising new treatment options, predict outcomes, and eventually prescribe specialized treatments for patients with cancer.
引用
收藏
页数:8
相关论文
共 50 条
  • [31] Transforming medical sciences with high-performance computing, high-performance data analytics and AI
    Lewandowski, Natalie
    Koller, Bastian
    TECHNOLOGY AND HEALTH CARE, 2023, 31 (04) : 1505 - 1507
  • [32] Advanced Computing and Optimization Infrastructure for Extremely Large-Scale Graphs on Post Peta-Scale Supercomputers
    Fujisawa, Katsuki
    Endo, Toshio
    Yasui, Yuichiro
    MATHEMATICAL SOFTWARE, ICMS 2016, 2016, 9725 : 265 - 274
  • [33] Advanced Computing and Optimization Infrastructure for Extremely Large-Scale Graphs on Post Peta-Scale Supercomputers
    Fujisawa, Katsuki
    Suzumura, Toyotaro
    Sato, Hitoshi
    Ueno, Koji
    Yasui, Yuichiro
    Iwabuchi, Keita
    Endo, Toshio
    OPTIMIZATION IN THE REAL WORLD: TOWARD SOLVING REAL-WORLD OPTIMIZATION PROBLEMS, 2016, 13 : 1 - 13
  • [34] Distributed computing as a virtual supercomputer: Tools to run and manage large-scale BOINC simulations
    Giorgino, Toni
    Harvey, M. J.
    de Fabritiis, Gianni
    COMPUTER PHYSICS COMMUNICATIONS, 2010, 181 (08) : 1402 - 1409
  • [35] Deep and reinforcement learning for automated task scheduling in large-scale cloud computing systems
    Rjoub, Gaith
    Bentahar, Jamal
    Wahab, Omar Abdel
    Bataineh, Ahmed Saleh
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2021, 33 (23)
  • [36] Massive-Scale Gaze Analytics Exploiting High Performance Computing
    Duchowski, Andrew T.
    Bolte, Takumi
    Krejtz, Krzysztof
    INTELLIGENT DECISION TECHNOLOGIES, 2015, 39 : 137 - 147
  • [37] A Perspective on Scalable AI on High-Performance Computing and Leadership Class Supercomputing Facilities
    Pasini, Massimiliano Lupo
    IEEE COMPUTATIONAL INTELLIGENCE MAGAZINE, 2024, 19 (03) : 6 - 8
  • [38] High performance computing for first-principles Kohn-Sham density functional theory towards exascale supercomputers
    Qin, Xinming
    Chen, Junshi
    Luo, Zhaolong
    Wan, Lingyun
    Li, Jielan
    Jiao, Shizhe
    Zhang, Zhenlin
    Jiang, Qingcai
    Hu, Wei
    An, Hong
    Yang, Jinlong
    CCF TRANSACTIONS ON HIGH PERFORMANCE COMPUTING, 2023, 5 (01) : 26 - 42
  • [39] High performance computing for first-principles Kohn-Sham density functional theory towards exascale supercomputers
    Xinming Qin
    Junshi Chen
    Zhaolong Luo
    Lingyun Wan
    Jielan Li
    Shizhe Jiao
    Zhenlin Zhang
    Qingcai Jiang
    Wei Hu
    Hong An
    Jinlong Yang
    CCF Transactions on High Performance Computing, 2023, 5 : 26 - 42
  • [40] BIGS: A Framework for Large-Scale Image Processing and Analysis Over Distributed and Heterogeneous Computing Resources
    Ramos-Pollan, Raul
    Gonzalez, Fabio A.
    Caicedo, Juan C.
    Cruz-Roa, Angel
    Camargo, Jorge E.
    Vanegas, Jorge A.
    Perez, Santiago A.
    David Bermeo, Jose
    Sebastian Otalora, Juan
    Rozo, Paola K.
    Arevalo, John E.
    2012 IEEE 8TH INTERNATIONAL CONFERENCE ON E-SCIENCE (E-SCIENCE), 2012,