Augmenting large language models with chemistry tools

被引:169
作者
Bran, Andres M. [1 ,2 ]
Cox, Sam [3 ,4 ]
Schilter, Oliver [1 ,2 ,5 ]
Baldassari, Carlo [5 ]
White, Andrew D. [3 ,4 ]
Schwaller, Philippe [1 ,2 ]
机构
[1] Ecole Polytech Fed Lausanne, Lab Artificial Chem Intelligence LIAC, ISIC, Lausanne, Switzerland
[2] Ecole Polytech Fed Lausanne, Natl Ctr Competence Res NCCR Catalysis, Lausanne, Switzerland
[3] Univ Rochester, Dept Chem Engn, Rochester, NY 14627 USA
[4] FutureHouse, San Francisco, CA 94107 USA
[5] IBM Res Europe, Accelerated Discovery, CH-8803 Ruschlikon, Switzerland
基金
瑞士国家科学基金会; 美国国家卫生研究院; 美国国家科学基金会;
关键词
TRANSFORMER; PREDICTION; DESIGN;
D O I
10.1038/s42256-024-00832-8
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Large language models (LLMs) have shown strong performance in tasks across domains but struggle with chemistry-related problems. These models also lack access to external knowledge sources, limiting their usefulness in scientific applications. We introduce ChemCrow, an LLM chemistry agent designed to accomplish tasks across organic synthesis, drug discovery and materials design. By integrating 18 expert-designed tools and using GPT-4 as the LLM, ChemCrow augments the LLM performance in chemistry, and new capabilities emerge. Our agent autonomously planned and executed the syntheses of an insect repellent and three organocatalysts and guided the discovery of a novel chromophore. Our evaluation, including both LLM and expert assessments, demonstrates ChemCrow's effectiveness in automating a diverse set of chemical tasks. Our work not only aids expert chemists and lowers barriers for non-experts but also fosters scientific advancement by bridging the gap between experimental and computational chemistry. Large language models can be queried to perform chain-of-thought reasoning on text descriptions of data or computational tools, which can enable flexible and autonomous workflows. Bran et al. developed ChemCrow, a GPT-4-based agent that has access to computational chemistry tools and a robotic chemistry platform, which can autonomously solve tasks for designing or synthesizing chemicals such as drugs or materials.
引用
收藏
页码:525 / 535
页数:13
相关论文
共 103 条
[1]  
[Anonymous], 2023, Purchasable Mcule
[2]  
[Anonymous], 2023, Rdkit: Open-source cheminformatics
[3]  
[Anonymous], 2024, CHEM WEAPONS CONVENT
[4]  
Askell A, 2019, Arxiv, DOI [arXiv:1907.04534, DOI 10.48550/ARXIV.1907.04534]
[5]   REINVENT 2.0: An AI Tool for De Novo Drug Design [J].
Blaschke, Thomas ;
Arus-Pous, Josep ;
Chen, Hongming ;
Margreitter, Christian ;
Tyrchan, Christian ;
Engkvist, Ola ;
Papadopoulos, Kostas ;
Patronov, Atanas .
JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2020, 60 (12) :5918-5922
[6]   Autonomous chemical research with large language models [J].
Boiko, Daniil A. ;
Macknight, Robert ;
Kline, Ben ;
Gomes, Gabe .
NATURE, 2023, 624 (7992) :570-+
[7]  
Bommasani R., 2021, arXiv, DOI [10.48550/arXiv.2108.07258, DOI 10.48550/ARXIV.2108.07258]
[8]  
Bran Andres M, 2024, Zenodo, DOI 10.5281/ZENODO.10884639
[9]  
Bran Andres M, 2024, Zenodo, DOI 10.5281/ZENODO.10884645
[10]  
Brown TB, 2020, ADV NEUR IN, V33