Steel design based on a large language model

被引:2
|
作者
Tian, Shaohan [1 ]
Jiang, Xue [1 ,2 ]
Wang, Weiren [1 ]
Jing, Zhihua [1 ]
Zhang, Chi [1 ]
Zhang, Cheng [1 ]
Lookman, Turab [3 ]
Su, Yanjing [1 ]
机构
[1] Univ Sci & Technol Beijing, Inst Adv Mat & Technol, Beijing Adv Innovat Ctr Mat Genome Engn, Beijing 100083, Peoples R China
[2] Liaoning Acad Mat, Shenyang 110000, Liaoning, Peoples R China
[3] AiMaterials Res LLC, Santa Fe, NM 87501 USA
基金
中国国家自然科学基金;
关键词
Property prediction; Steel design; Materials language model; Deep learning; Artificial intelligence; MACHINE; STRENGTH;
D O I
10.1016/j.actamat.2024.120663
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
The success of artificial intelligence (AI) in materials research heavily relies on the integrity of structured data and the construction of precise descriptors. In this study, we present an end-to-end pipeline from materials text to properties for steels based on a large language model. The objective is to enable quantitative predictions of properties with high-accuracy and explore new steels. The pipeline includes a materials language encoder, named SteelBERT, and a multimodal deep learning framework that maps the composition and text sequence of complex fabrication processes to mechanical properties. We demonstrate high accuracy on mechanical properties, including yield strength (YS), ultimate tensile strength (UTS), and elongation (EL) by predicting determination coefficients (R2) reaching 78.17 % ( f 3.40 %), 82.56 % ( f 1.96 %), and 81.44 % ( f 2.98 %) respectively. Further, through an additional fine-tuning strategy for the design of specific steels with small datasets, we show how the performance can be refined. With only 64 experimental samples of 15Cr austenitic stainless steels, we obtain an optimized model with R2 of 89.85 % ( f 6.17 %), 88.34 % ( f 5.95 %) and 87.24 % ( f 5.15 %) for YS, UTS and EL, that requires the user to input composition and text sequence for processing and which outputs mechanical properties. The model efficiently optimizes the text sequence for the fabrication process by suggesting a secondary round of cold rolling and tempering to yield an exceptional YS of 960 MPa, UTS of 1138 MPa, and EL of 32.5 %, exceeding those of reported 15Cr austenitic stainless steels.
引用
收藏
页数:13
相关论文
共 50 条
  • [21] LLMPC: Large Language Model Predictive Control
    Maher, Gabriel
    COMPUTERS, 2025, 14 (03)
  • [22] Generating Simulated Data with a Large Language Model
    Kerley, Jeffrey
    Anderson, Derek T.
    Buck, Andrew R.
    Alvey, Brendan
    SYNTHETIC DATA FOR ARTIFICIAL INTELLIGENCE AND MACHINE LEARNING: TOOLS, TECHNIQUES, AND APPLICATIONS II, 2024, 13035
  • [23] Design pattern recognition: a study of large language models
    Pandey, Sushant Kumar
    Chand, Sivajeet
    Horkoff, Jennifer
    Staron, Miroslaw
    Ochodek, Miroslaw
    Durisic, Darko
    EMPIRICAL SOFTWARE ENGINEERING, 2025, 30 (03)
  • [24] A Method for Synthesizing Ontology-Based Textual Design Datasets: Evaluating the Potential of Large Language Model in Domain-Specific Dataset Generation
    Qiu, Yunjian
    Jin, Yan
    JOURNAL OF MECHANICAL DESIGN, 2025, 147 (04)
  • [25] FramedTruth: A Frame-Based Model Utilising Large Language Models for Misinformation Detection
    Wang, Guan
    Frederick, Rebecca
    Haghighi, Boshra Talebi
    Wong, B. L. William
    Rupar, Verica
    Li, Weihua
    Bai, Quan
    INTELLIGENT INFORMATION AND DATABASE SYSTEMS, PT I, ACIIDS 2024, 2024, 14795 : 135 - 146
  • [26] An improved transformer-based model for detecting phishing, spam and ham emails: A large language model approach
    Jamal, Suhaima
    Wimmer, Hayden
    Sarker, Iqbal H.
    SECURITY AND PRIVACY, 2024, 7 (05)
  • [27] Model-Based Residual Stress Design in Multiphase Seamless Steel Tubes
    Leitner, Silvia
    Winter, Gerald
    Klarner, Juergen
    Antretter, Thomas
    Ecker, Werner
    MATERIALS, 2020, 13 (02)
  • [28] Multimodal Large Language Model-Based Fault Detection and Diagnosis in Context of Industry 4.0
    Alsaif, Khalid M.
    Albeshri, Aiiad A.
    Khemakhem, Maher A.
    Eassa, Fathy E.
    ELECTRONICS, 2024, 13 (24):
  • [29] Large language model-supported interactive case-based learning: a pilot study
    Gim, Haelynn
    Cook, Benjamin
    Le, Jasmin
    Stretton, Brandon
    Gao, Christina
    Gupta, Aashray
    Kovoor, Joshua
    Guo, Christina
    Arnold, Matthew
    Gheihman, Galina
    Bacchi, Stephen
    INTERNAL MEDICINE JOURNAL, 2025,
  • [30] GalaxyGPT: A Hybrid Framework for Large Language Model Safety
    Zhou, Hange
    Zheng, Jiabin
    Zhang, Longtu
    IEEE ACCESS, 2024, 12 : 94436 - 94451