Opportunities for retrieval and tool augmented large language models in scientific facilities

被引:6
作者
Prince, Michael H. [1 ]
Chan, Henry [2 ]
Vriza, Aikaterini [2 ]
Zhou, Tao [2 ]
Sastry, Varuni K. [3 ]
Luo, Yanqi [1 ]
Dearing, Matthew T. [4 ]
Harder, Ross J. [1 ]
Vasudevan, Rama K. [5 ]
Cherukara, Mathew J. [1 ]
机构
[1] Argonne Natl Lab, Adv Photon Source, Lemont 60439, IL USA
[2] Argonne Natl Lab, Ctr Nanoscale Mat, Lemont, IL USA
[3] Argonne Natl Lab, Argonne Leadership Comp Facil, Lemont, IL USA
[4] Argonne Natl Lab, Business & Informat Syst, Lemont, IL USA
[5] Oak Ridge Natl Lab, Ctr Nanophase Mat, Oak Ridge, TN USA
关键词
Photons - Problem oriented languages;
D O I
10.1038/s41524-024-01423-2
中图分类号
O64 [物理化学(理论化学)、化学物理学];
学科分类号
070304 ; 081704 ;
摘要
Upgrades to advanced scientific user facilities such as next-generation x-ray light sources, nanoscience centers, and neutron facilities are revolutionizing our understanding of materials across the spectrum of the physical sciences, from life sciences to microelectronics. However, these facility and instrument upgrades come with a significant increase in complexity. Driven by more exacting scientific needs, instruments and experiments become more intricate each year. This increased operational complexity makes it ever more challenging for domain scientists to design experiments that effectively leverage the capabilities of and operate on these advanced instruments. Large language models (LLMs) can perform complex information retrieval, assist in knowledge-intensive tasks across applications, and provide guidance on tool usage. Using x-ray light sources, leadership computing, and nanoscience centers as representative examples, we describe preliminary experiments with a Context-Aware Language Model for Science (CALMS) to assist scientists with instrument operations and complex experimentation. With the ability to retrieve relevant information from facility documentation, CALMS can answer simple questions on scientific capabilities and other operational procedures. With the ability to interface with software tools and experimental hardware, CALMS can conversationally operate scientific instruments. By making information more accessible and acting on user needs, LLMs could expand and diversify scientific facilities' users and accelerate scientific output.
引用
收藏
页数:8
相关论文
共 40 条
[11]  
Hoffmann J, 2022, ADV NEUR IN
[12]  
Huang JX, 2023, 2023 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING, EMNLP 2023, P1051
[13]  
HuggingFace, 2023, About us
[14]  
HuggingFace, 2023, Can foundation models label data like humans?
[15]   14 examples of how LLMs can transform materials science and chemistry: a reflection on a large language model hackathon [J].
Jablonka, Kevin Maik ;
Ai, Qianxiang ;
Al-Feghali, Alexander ;
Badhwar, Shruti ;
Bocarsly, Joshua D. ;
Bran, Andres M. ;
Bringuier, Stefan ;
Brinson, L. Catherine ;
Choudhary, Kamal ;
Circi, Defne ;
Cox, Sam ;
de Jong, Wibe A. ;
Evans, Matthew L. ;
Gastellu, Nicolas ;
Genzling, Jerome ;
Gil, Maria Victoria ;
Gupta, Ankur K. ;
Hong, Zhi ;
Imran, Alishba ;
Kruschwitz, Sabine ;
Labarre, Anne ;
Lala, Jakub ;
Liu, Tao ;
Ma, Steven ;
Majumdar, Sauradeep ;
Merz, Garrett W. ;
Moitessier, Nicolas ;
Moubarak, Elias ;
Mourino, Beatriz ;
Pelkie, Brenden ;
Pieler, Michael ;
Ramos, Mayk Caldas ;
Rankovic, Bojana ;
Rodriques, Samuel G. ;
Sanders, Jacob N. ;
Schwaller, Philippe ;
Schwarting, Marcus ;
Shi, Jiale ;
Smit, Berend ;
Smith, Ben E. ;
Van Herck, Joren ;
Voelker, Christoph ;
Ward, Logan ;
Warren, Sean ;
Weiser, Benjamin ;
Zhang, Sylvester ;
Zhang, Xiaoqi ;
Zia, Ghezal Ahmad ;
Scourtas, Aristana ;
Schmidt, K. J. .
DIGITAL DISCOVERY, 2023, 2 (05) :1233-1250
[16]  
Jablonka KM, 2023, chemRxiv, DOI [10.26434/chemrxiv-2023-fw8n4, 10.26434/chemrxiv-2023-fw8n4, DOI 10.26434/CHEMRXIV-2023-FW8N4]
[17]   Commentary: The Materials Project: A materials genome approach to accelerating materials innovation [J].
Jain, Anubhav ;
Shyue Ping Ong ;
Hautier, Geoffroy ;
Chen, Wei ;
Richards, William Davidson ;
Dacek, Stephen ;
Cholia, Shreyas ;
Gunter, Dan ;
Skinner, David ;
Ceder, Gerbrand ;
Persson, Kristin A. .
APL MATERIALS, 2013, 1 (01)
[18]   ChatGPT for good? On opportunities and challenges of large language models for education [J].
Kasneci, Enkelejda ;
Sessler, Kathrin ;
Kuechemann, Stefan ;
Bannert, Maria ;
Dementieva, Daryna ;
Fischer, Frank ;
Gasser, Urs ;
Groh, Georg ;
Guennemann, Stephan ;
Huellermeier, Eyke ;
Krusche, Stepha ;
Kutyniok, Gitta ;
Michaeli, Tilman ;
Nerdel, Claudia ;
Pfeffer, Juergen ;
Poquet, Oleksandra ;
Sailer, Michael ;
Schmidt, Albrecht ;
Seidel, Tina ;
Stadler, Matthias ;
Weller, Jochen ;
Kuhn, Jochen ;
Kasneci, Gjergji .
LEARNING AND INDIVIDUAL DIFFERENCES, 2023, 103
[19]  
Kojima T, 2022, ADV NEUR IN
[20]  
Lewis P, 2020, ADV NEUR IN, V33