Optimal decision-making in high-throughput virtual screening pipelines

被引:2
作者
Woo, Hyun-Myung [1 ]
Qian, Xiaoning [2 ,3 ]
Tan, Li [3 ]
Jha, Shantenu [3 ,4 ]
Alexander, Francis J. [5 ]
Dougherty, Edward R. [2 ]
Yoon, Byung-Jun [2 ,3 ]
机构
[1] Incheon Natl Univ, Dept Biomed & Robot Engn, Incheon 22012, South Korea
[2] Texas A&M Univ, Dept Elect & Comp Engn, College Stn, TX 77843 USA
[3] Brookhaven Natl Lab, Computat Sci Initiat, Upton, NY 11973 USA
[4] Rutgers State Univ, Dept Elect & Comp Engn, Piscataway, NJ 08854 USA
[5] Argonne Natl Lab, Comp Environm & Life Sci, Lemont, IL 60439 USA
来源
PATTERNS | 2023年 / 4卷 / 11期
基金
新加坡国家研究基金会; 美国国家科学基金会;
关键词
LITHIUM-ION BATTERIES; LONG NONCODING RNAS; REDOX PROPERTIES; ENERGY-STORAGE; DESIGN; DERIVATIVES; LNCRNA; THERMODYNAMICS; DISCOVERY; LI;
D O I
10.1016/j.patter.2023.100875
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The need for efficient computational screening of molecular candidates that possess desired properties frequently arises in various scientific and engineering problems, including drug discovery and materials design. However, the enormous search space containing the candidates and the substantial computational cost of high-fidelity property prediction models make screening practically challenging. In this work, we propose a general framework for constructing and optimizing a high-throughput virtual screening (HTVS) pipeline that consists of multi-fidelity models. The central idea is to optimally allocate the computational resources to models with varying costs and accuracy to optimize the return on computational investment. Based on both simulated and real-world data, we demonstrate that the proposed optimal HTVS framework can significantly accelerate virtual screening without any degradation in terms of accuracy. Furthermore, it enables an adaptive operational strategy for HTVS, where one can trade accuracy for efficiency.
引用
收藏
页数:16
相关论文
共 55 条
  • [1] IMPECCABLE: Integrated Modeling PipelinE for COVID Cure by Assessing Better LEads
    Al Saadi, Aymen
    Alfe, Dario
    Babuji, Yadu
    Bhati, Agastya
    Blaiszik, Ben
    Brace, Alexander
    Brettin, Thomas
    Chard, Kyle
    Chard, Ryan
    Clyde, Austin
    Coveney, Peter
    Foster, Ian
    Gibbs, Tom
    Jha, Shantenu
    Keipert, Kristopher
    Kranzlmuller, Dieter
    Kurth, Thorsten
    Lee, Hyungro
    Li, Zhuozhao
    Ma, Heng
    Mathias, Gerald
    Merzky, Andre
    Partin, Alexander
    Ramanathan, Arvind
    Shah, Ashka
    Stern, Abraham
    Stevens, Rick
    Tan, Li
    Titov, Mikhail
    Trifan, Anda
    Tsaris, Aristeidis
    Turilli, Matteo
    Huub Van Dam
    Wan, Shunzhou
    Wifling, David
    Yin, Junqi
    [J]. 50TH INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING, 2021,
  • [2] Bohacek RS, 1996, MED RES REV, V16, P3, DOI 10.1002/(SICI)1098-1128(199601)16:1<3::AID-MED1>3.3.CO
  • [3] 2-D
  • [4] Cancer LncRNA Census reveals evidence for deep functional conservation of long noncoding RNAs in tumorigenesis
    Carlevaro-Fita, Joana
    Lanzos, Andres
    Feuerbach, Lars
    Hong, Chen
    Mas-Ponte, David
    Pedersen, Jakob Skou
    Johnson, Rory
    Abascal, Federico
    Amin, Samirkumar B.
    Bader, Gary D.
    Barenboim, Jonathan
    Beroukhim, Rameen
    Bertl, Johanna
    Boroevich, Keith A.
    Brunak, Soren
    Campbell, Peter J.
    Carlevaro-Fita, Joana
    Chakravarty, Dimple
    Chan, Calvin Wing Yiu
    Chen, Ken
    Choi, Jung Kyoon
    Deu-Pons, Jordi
    Dhingra, Priyanka
    Diamanti, Klev
    Feuerbach, Lars
    Fink, J. Lynn
    Fonseca, Nuno A.
    Frigola, Joan
    Gambacorti-Passerini, Carlo
    Garsed, Dale W.
    Gerstein, Mark
    Getz, Gad
    Gonzalez-Perez, Abel
    Guo, Qianyun
    Gut, Ivo G.
    Haan, David
    Hamilton, Mark P.
    Haradhvala, Nicholas J.
    Harmanci, Arif O.
    Helmy, Mohamed
    Herrmann, Carl
    Hess, Julian M.
    Hobolth, Asger
    Hodzic, Ermin
    Hong, Chen
    Hornshoj, Henrik
    Isaev, Keren
    Izarzugaza, Jose M. G.
    Johnson, Todd A.
    Juul, Malene
    [J]. COMMUNICATIONS BIOLOGY, 2020, 3 (01)
  • [5] NEW DEVELOPMENT OF NONLINEAR-OPTICAL CRYSTALS FOR THE ULTRAVIOLET REGION WITH MOLECULAR ENGINEERING APPROACH
    CHEN, CT
    WANG, YB
    XIA, YN
    WU, BC
    TANG, DY
    WU, KC
    ZENG, WR
    YU, LH
    MEI, LF
    [J]. JOURNAL OF APPLIED PHYSICS, 1995, 77 (06) : 2268 - 2272
  • [6] Developing an in silico pipeline for faster drug candidate discovery: Virtual high throughput screening with the Signature molecular descriptor using support vector machine models
    Chen, Jonathan Jun Feng
    Visco, Donald Patrick, Jr.
    [J]. CHEMICAL ENGINEERING SCIENCE, 2017, 159 : 31 - 42
  • [7] Accelerating Electrolyte Discovery for Energy Storage with High-Throughput Screening
    Cheng, Lei
    Assary, Rajeev S.
    Qu, Xiaohui
    Jain, Anubhav
    Ong, Shyue Ping
    Rajput, Nay Nidhi
    Persson, Kristin
    Curtiss, Larry A.
    [J]. JOURNAL OF PHYSICAL CHEMISTRY LETTERS, 2015, 6 (02): : 283 - 291
  • [8] High-Throughput Virtual Screening and Validation of a SARS-CoV-2 Main Protease Noncovalent Inhibitor
    Clyde, Austin
    Galanie, Stephanie
    Kneller, Daniel W.
    Ma, Heng
    Babuji, Yadu
    Blaiszik, Ben
    Brace, Alexander
    Brettin, Thomas
    Chard, Kyle
    Chard, Ryan
    Coates, Leighton
    Foster, Ian
    Hauner, Darin
    Kertesz, Vilmos
    Kumar, Neeraj
    Lee, Hyungro
    Li, Zhuozhao
    Merzky, Andre
    Schmidt, Jurgen G.
    Tan, Li
    Titov, Mikhail
    Trifan, Anda
    Turilli, Matteo
    Van Dam, Hubertus
    Chennubhotla, Srinivas C.
    Jha, Shantenu
    Kovalevsky, Andrey
    Ramanathan, Arvind
    Head, Martha S.
    Stevens, Rick
    [J]. JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2022, 62 (01) : 116 - 128
  • [9] Genetic variants at the 9p21 locus contribute to atherosclerosis through modulation of ANRIL and CDKN2A/B
    Congrains, Ada
    Kamide, Kei
    Oguro, Ryousuke
    Yasuda, Osamu
    Miyata, Keishi
    Yamamoto, Eiichiro
    Kawai, Tatsuo
    Kusunoki, Hiroshi
    Yamamoto, Hiroko
    Takeya, Yasushi
    Yamamoto, Koichi
    Onishi, Miyuki
    Sugimoto, Ken
    Katsuya, Tomohiro
    Awata, Nobuhisa
    Ikebe, Kazunori
    Gondo, Yasuyuki
    Oike, Yuichi
    Ohishi, Mitsuru
    Rakugi, Hiromi
    [J]. ATHEROSCLEROSIS, 2012, 220 (02) : 449 - 455
  • [10] MAXIMUM LIKELIHOOD FROM INCOMPLETE DATA VIA EM ALGORITHM
    DEMPSTER, AP
    LAIRD, NM
    RUBIN, DB
    [J]. JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-METHODOLOGICAL, 1977, 39 (01): : 1 - 38