MechGPT, a Language-Based Strategy for Mechanics and Materials Modeling That Connects Knowledge Across Scales, Disciplines, and Modalities

被引:28
作者
Buehler, Markus J. [1 ,2 ]
机构
[1] MIT, Lab Atomist & Mol Mech LAMM, 77 Massachusetts Ave, Cambridge, MA 02139 USA
[2] MIT, Schwarzman Coll Comp, Ctr Computat Sci & Engn, 77 Massachusetts Ave, Cambridge, MA 02139 USA
基金
美国农业部;
关键词
mechanics; materials; failure; AI; scientific ML; attention; transformer; language model; GPT; human-machine;
D O I
10.1115/1.4063843
中图分类号
O3 [力学];
学科分类号
08 ; 0801 ;
摘要
For centuries, researchers have sought out ways to connect disparate areas of knowledge. While early scholars (Galileo, da Vinci, etc.) were experts across fields, specialization took hold later. With the advent of Artificial Intelligence, we can now explore relationships across areas (e.g., mechanics-biology) or disparate domains (e.g., failure mechanics-art). To achieve this, we use a fine-tuned large language model (LLM), here for a subset of knowledge in multiscale materials failure. The approach includes the use of a general-purpose LLM to distill question-answer pairs from raw sources followed by LLM fine-tuning. The resulting MechGPT LLM foundation model is used in a series of computational experiments to explore its capacity for knowledge retrieval, various language tasks, hypothesis generation, and connecting knowledge across disparate areas. While the model has some ability to recall knowledge from training, we find that LLMs are particularly useful for extracting structural insights through Ontological Knowledge Graphs. These interpretable graph structures provide explanatory insights, frameworks for new research questions, and visual representations of knowledge that also can be used in retrieval-augmented generation. Three versions of MechGPT are discussed, featuring different sizes from 13 x 109 to 70 x 109 parameters, and reaching context lengths of more than 10,000 tokens. This provides ample capacity for sophisticated retrieval augmented strategies, as well as agent-based modeling where multiple LLMs interact collaboratively and/or adversarially, the incorporation of new data from the literature or web searches, as well as multimodality.
引用
收藏
页数:35
相关论文
共 56 条
  • [1] Abid A, 2019, Arxiv, DOI [arXiv:1906.02569, DOI 10.48550/ARXIV.1906.02569]
  • [2] Multiscale models of cardiac muscle biophysics and tissue remodeling in hypertrophic cardiomyopathies
    Aboelkassem, Yasser
    Powers, Joseph D.
    McCabe, Kimberly J.
    McCulloch, Andrew D.
    [J]. CURRENT OPINION IN BIOMEDICAL ENGINEERING, 2019, 11 : 35 - 44
  • [3] Multiscale Modeling of Silk and Silk-Based Biomaterials-A Review
    Barreiro, Diego Lopez
    Yeo, Jingjie
    Tarakanova, Anna
    Martin-Martinez, Francisco J.
    Buehler, Markus J.
    [J]. MACROMOLECULAR BIOSCIENCE, 2019, 19 (03)
  • [4] MODELS OF NATURAL-LANGUAGE UNDERSTANDING
    BATES, M
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1995, 92 (22) : 9977 - 9982
  • [5] Blecher L, 2023, Arxiv, DOI arXiv:2308.13418
  • [6] A Review of the Application of Machine Learning and Data Mining Approaches in Continuum Materials Mechanics
    Bock, Frederic E.
    Aydin, Roland C.
    Cyron, Christian J.
    Huber, Norbert
    Kalidindi, Surya R.
    Klusemann, Benjamin
    [J]. FRONTIERS IN MATERIALS, 2019, 6
  • [7] Bottou L, 2023, Arxiv, DOI [arXiv:2310.01425, 10.48550/arXiv.2310.01425]
  • [8] Perspective: Large Language Models in Applied Mechanics
    Brodnik, Neal R.
    Carton, Samuel
    Muir, Caelin
    Ghosh, Satanu
    Downey, Doug
    Echlin, McLean P.
    Pollock, Tresa M.
    Daly, Samantha
    [J]. JOURNAL OF APPLIED MECHANICS-TRANSACTIONS OF THE ASME, 2023, 90 (10):
  • [9] Brown TB, 2020, ADV NEUR IN, V33
  • [10] Buehler M.J., 2008, ATOMISTIC MODELING M, DOI DOI 10.1007/978-0-387-76426-9