Steel design based on a large language model

被引：2

作者：

Tian, Shaohan ^{[1
]}

Jiang, Xue ^{[1
,2
]}

Wang, Weiren ^{[1
]}

Jing, Zhihua ^{[1
]}

Zhang, Chi ^{[1
]}

Zhang, Cheng ^{[1
]}

Lookman, Turab ^{[3
]}

Su, Yanjing ^{[1
]}

机构：

[1] Univ Sci & Technol Beijing, Inst Adv Mat & Technol, Beijing Adv Innovat Ctr Mat Genome Engn, Beijing 100083, Peoples R China

[2] Liaoning Acad Mat, Shenyang 110000, Liaoning, Peoples R China

[3] AiMaterials Res LLC, Santa Fe, NM 87501 USA

来源：

ACTA MATERIALIA | 2025年 / 285卷

基金：

中国国家自然科学基金;

关键词：

Property prediction; Steel design; Materials language model; Deep learning; Artificial intelligence; MACHINE; STRENGTH;

D O I：

10.1016/j.actamat.2024.120663

中图分类号：

T [工业技术];

学科分类号：

08 ;

摘要：

The success of artificial intelligence (AI) in materials research heavily relies on the integrity of structured data and the construction of precise descriptors. In this study, we present an end-to-end pipeline from materials text to properties for steels based on a large language model. The objective is to enable quantitative predictions of properties with high-accuracy and explore new steels. The pipeline includes a materials language encoder, named SteelBERT, and a multimodal deep learning framework that maps the composition and text sequence of complex fabrication processes to mechanical properties. We demonstrate high accuracy on mechanical properties, including yield strength (YS), ultimate tensile strength (UTS), and elongation (EL) by predicting determination coefficients (R2) reaching 78.17 % ( f 3.40 %), 82.56 % ( f 1.96 %), and 81.44 % ( f 2.98 %) respectively. Further, through an additional fine-tuning strategy for the design of specific steels with small datasets, we show how the performance can be refined. With only 64 experimental samples of 15Cr austenitic stainless steels, we obtain an optimized model with R2 of 89.85 % ( f 6.17 %), 88.34 % ( f 5.95 %) and 87.24 % ( f 5.15 %) for YS, UTS and EL, that requires the user to input composition and text sequence for processing and which outputs mechanical properties. The model efficiently optimizes the text sequence for the fabrication process by suggesting a secondary round of cold rolling and tempering to yield an exceptional YS of 960 MPa, UTS of 1138 MPa, and EL of 32.5 %, exceeding those of reported 15Cr austenitic stainless steels.

引用

页数：13

共 50 条

[21] LLMPC: Large Language Model Predictive Control
Maher, Gabriel
COMPUTERS, 2025, 14 (03)
[22] Generating Simulated Data with a Large Language Model
Kerley, Jeffrey
Anderson, Derek T.
Buck, Andrew R.
Alvey, Brendan
SYNTHETIC DATA FOR ARTIFICIAL INTELLIGENCE AND MACHINE LEARNING: TOOLS, TECHNIQUES, AND APPLICATIONS II, 2024, 13035
[23] Design pattern recognition: a study of large language models
Pandey, Sushant Kumar
Chand, Sivajeet
Horkoff, Jennifer
Staron, Miroslaw
Ochodek, Miroslaw
Durisic, Darko
EMPIRICAL SOFTWARE ENGINEERING, 2025, 30 (03)
[24] A Method for Synthesizing Ontology-Based Textual Design Datasets: Evaluating the Potential of Large Language Model in Domain-Specific Dataset Generation
Qiu, Yunjian
Jin, Yan
JOURNAL OF MECHANICAL DESIGN, 2025, 147 (04)
[25] FramedTruth: A Frame-Based Model Utilising Large Language Models for Misinformation Detection
Wang, Guan
Frederick, Rebecca
Haghighi, Boshra Talebi
Wong, B. L. William
Rupar, Verica
Li, Weihua
Bai, Quan
INTELLIGENT INFORMATION AND DATABASE SYSTEMS, PT I, ACIIDS 2024, 2024, 14795 : 135 - 146
[26] An improved transformer-based model for detecting phishing, spam and ham emails: A large language model approach
Jamal, Suhaima
Wimmer, Hayden
Sarker, Iqbal H.
SECURITY AND PRIVACY, 2024, 7 (05)
[27] Model-Based Residual Stress Design in Multiphase Seamless Steel Tubes
Leitner, Silvia
Winter, Gerald
Klarner, Juergen
Antretter, Thomas
Ecker, Werner
MATERIALS, 2020, 13 (02)
[28] Multimodal Large Language Model-Based Fault Detection and Diagnosis in Context of Industry 4.0
Alsaif, Khalid M.
Albeshri, Aiiad A.
Khemakhem, Maher A.
Eassa, Fathy E.
ELECTRONICS, 2024, 13 (24):
[29] Large language model-supported interactive case-based learning: a pilot study
Gim, Haelynn
Cook, Benjamin
Le, Jasmin
Stretton, Brandon
Gao, Christina
Gupta, Aashray
Kovoor, Joshua
Guo, Christina
Arnold, Matthew
Gheihman, Galina
Bacchi, Stephen
INTERNAL MEDICINE JOURNAL, 2025,
[30] GalaxyGPT: A Hybrid Framework for Large Language Model Safety
Zhou, Hange
Zheng, Jiabin
Zhang, Longtu
IEEE ACCESS, 2024, 12 : 94436 - 94451

← 1 2 3 4 5 →