Compositional design of multicomponent alloys using reinforcement learning

被引:6
|
作者
Xian, Yuehui [1 ]
Dang, Pengfei [1 ]
Tian, Yuan [1 ]
Jiang, Xue [2 ]
Zhou, Yumei [1 ]
Ding, Xiangdong [1 ]
Sun, Jun [1 ]
Lookman, Turab [1 ,2 ,3 ]
Xue, Dezhen [1 ]
机构
[1] Xi An Jiao Tong Univ, State Key Lab Mech Behav Mat, Xian 710049, Peoples R China
[2] Univ Sci & Technol Beijing, Beijing Adv Innovat Ctr Mat Genome Engn, Beijing 100083, Peoples R China
[3] AiMat Res LLC, Santa Fe, NM 87501 USA
基金
中国国家自然科学基金;
关键词
Compositional design; Reinforcement learning; Multicomponent alloys; Transformational enthalpy; Phase change materials; PHASE-CHANGE MATERIALS; HIGH ENTROPY ALLOYS; TEMPERATURES; STORAGE;
D O I
10.1016/j.actamat.2024.120017
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
The design of alloys has typically involved adaptive experimental synthesis and characterization guided by machine learning models fitted to available data. A bottleneck for sequential design, be it for self-driven or manual synthesis, by Bayesian Global Optimization (BGO) for example, is that the search space becomes intractable as the number of alloy elements and its compositions exceed a threshold. Here we investigate how reinforcement learning (RL) performs in the compositional design of alloys within a closed loop with manual synthesis and characterization. We demonstrate this strategy by designing a phase change multicomponent alloy (Ti 27.2 Ni 47 Hf 13.8 Zr 12 ) with the highest transformation enthalpy (Delta H) Delta H)-37.1 J/g (-39.0 J/g with further calibration) within the TiNi-based family of alloys from a space of over 2 x 108 8 candidates, although the initial training is only on a compact dataset of 112 alloys. We show how the training efficiency is increased by employing acquisition functions containing uncertainties, such as expected improvement (EI), as the reward itself. Existing alloy data is often limited, however, if the agent is pretrained on experimental results prior to the training process, it can access regions of higher reward values more frequently. In addition, the experimental feedback enables the agent to gradually explore new regions with higher rewards, compositionally different from the initial dataset. Our approach directly applies to processing conditions where the actions would be performed in a given order. We also compare RL performance to BGO and the genetic algorithm on several test functions to gain insight on their relative strengths in materials design.
引用
收藏
页数:9
相关论文
共 50 条
  • [21] Empirical Design in Reinforcement Learning
    Patterson, Andrew
    Neumann, Samuel
    White, Martha
    White, Adam
    JOURNAL OF MACHINE LEARNING RESEARCH, 2024, 25 : 1 - 63
  • [22] Novel Multicomponent B2-Ordered Aluminides: Compositional Design, Synthesis, Characterization, and Thermal Stability
    Muralikrishna, G. Mohan
    Esther, A. Carmel Mary
    Guruvidyathri, K.
    Watermeyer, Philipp
    Liebscher, Christian H.
    Kulkarni, Kaustubh N.
    Wilde, Gerhard
    Divinski, Sergiy V.
    Murty, B. S.
    METALS, 2020, 10 (11) : 1 - 19
  • [23] Introducing reinforcement learning to the energy system design process
    Perera, A. T. D.
    Wickramasinghe, P. U.
    Nik, Vahid M.
    Scartezzini, Jean-Louis
    APPLIED ENERGY, 2020, 262
  • [24] Multicomponent and High Entropy Alloys
    Cantor, Brian
    ENTROPY, 2014, 16 (09) : 4749 - 4768
  • [25] Compositional Reinforcement Learning for Discrete-Time Stochastic Control Systems
    Lavaei, Abolfazl
    Perez, Mateo
    Kazemi, Milad
    Somenzi, Fabio
    Soudjani, Sadegh
    Trivedi, Ashutosh
    Zamani, Majid
    IEEE OPEN JOURNAL OF CONTROL SYSTEMS, 2023, 2 : 425 - 438
  • [26] A study of controller design using an on-line evolutionary Reinforcement Learning
    Kondo, T
    Ito, K
    8TH INTERNATIONAL CONFERENCE ON NEURAL INFORMATION PROCESSING, VOLS 1-3, PROCEEDING, 2001, : 665 - 670
  • [27] Cost Optimization at Early Stages of Design Using Deep Reinforcement Learning
    Servadei, Lorenzo
    Zheng, Jiapeng
    Arjona-Medina, Jose
    Werner, Michael
    Esen, Volkan
    Hochreiter, Sepp
    Ecker, Wolfgang
    Wille, Robert
    PROCEEDINGS OF THE 2020 ACM/IEEE 2ND WORKSHOP ON MACHINE LEARNING FOR CAD (MLCAD '20), 2020, : 37 - 42
  • [28] Photonic architecture for reinforcement learning
    Flamini, Fulvio
    Hamann, Arne
    Jerbi, Sofiene
    Trenkwalder, Lea M.
    Nautrup, Hendrik Poulsen
    Briegel, Hans J.
    NEW JOURNAL OF PHYSICS, 2020, 22 (04)
  • [29] MoNbV, MoNbVTi and MoNbVTiHf multicomponent refractory alloys-Compositional modulated mechanical properties investigating
    Meng, Gang
    Gao, Rongli
    Liu, Fenghua
    Yu, Jianxin
    MATERIALS TODAY COMMUNICATIONS, 2022, 33
  • [30] Automated design and optimization of distributed filter circuits using reinforcement learning
    Gao, Peng
    Yu, Tao
    Wang, Fei
    Yuan, Ru-Yue
    JOURNAL OF COMPUTATIONAL DESIGN AND ENGINEERING, 2024, 11 (05) : 60 - 76