Compositional design of multicomponent alloys using reinforcement learning

被引:6
|
作者
Xian, Yuehui [1 ]
Dang, Pengfei [1 ]
Tian, Yuan [1 ]
Jiang, Xue [2 ]
Zhou, Yumei [1 ]
Ding, Xiangdong [1 ]
Sun, Jun [1 ]
Lookman, Turab [1 ,2 ,3 ]
Xue, Dezhen [1 ]
机构
[1] Xi An Jiao Tong Univ, State Key Lab Mech Behav Mat, Xian 710049, Peoples R China
[2] Univ Sci & Technol Beijing, Beijing Adv Innovat Ctr Mat Genome Engn, Beijing 100083, Peoples R China
[3] AiMat Res LLC, Santa Fe, NM 87501 USA
基金
中国国家自然科学基金;
关键词
Compositional design; Reinforcement learning; Multicomponent alloys; Transformational enthalpy; Phase change materials; PHASE-CHANGE MATERIALS; HIGH ENTROPY ALLOYS; TEMPERATURES; STORAGE;
D O I
10.1016/j.actamat.2024.120017
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
The design of alloys has typically involved adaptive experimental synthesis and characterization guided by machine learning models fitted to available data. A bottleneck for sequential design, be it for self-driven or manual synthesis, by Bayesian Global Optimization (BGO) for example, is that the search space becomes intractable as the number of alloy elements and its compositions exceed a threshold. Here we investigate how reinforcement learning (RL) performs in the compositional design of alloys within a closed loop with manual synthesis and characterization. We demonstrate this strategy by designing a phase change multicomponent alloy (Ti 27.2 Ni 47 Hf 13.8 Zr 12 ) with the highest transformation enthalpy (Delta H) Delta H)-37.1 J/g (-39.0 J/g with further calibration) within the TiNi-based family of alloys from a space of over 2 x 108 8 candidates, although the initial training is only on a compact dataset of 112 alloys. We show how the training efficiency is increased by employing acquisition functions containing uncertainties, such as expected improvement (EI), as the reward itself. Existing alloy data is often limited, however, if the agent is pretrained on experimental results prior to the training process, it can access regions of higher reward values more frequently. In addition, the experimental feedback enables the agent to gradually explore new regions with higher rewards, compositionally different from the initial dataset. Our approach directly applies to processing conditions where the actions would be performed in a given order. We also compare RL performance to BGO and the genetic algorithm on several test functions to gain insight on their relative strengths in materials design.
引用
收藏
页数:9
相关论文
共 50 条
  • [1] Recursive Compositional Reinforcement Learning for Continuous Control
    Tanik, Guven Orkun
    Ertekin, Seyda
    2022 30TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE, SIU, 2022,
  • [2] Multicomponent alloys design and mechanical response: From high entropy alloys to complex concentrated alloys
    Cabrera, Manuel
    Oropesa, Yovany
    Sanhueza, Juan Pablo
    Tuninetti, Victor
    Onate, Angelo
    MATERIALS SCIENCE & ENGINEERING R-REPORTS, 2024, 161
  • [3] Device-System End-to-End Design of Photonic Neuromorphic Processor Using Reinforcement Learning
    Tang, Yingheng
    Zamani, Princess Tara
    Chen, Ruiyang
    Ma, Jianzhu
    Qi, Minghao
    Yu, Cunxi
    Gao, Weilu
    LASER & PHOTONICS REVIEWS, 2023, 17 (02)
  • [4] Accelerated design of multicomponent metallic glasses using machine learning
    Bajpai, Anurag
    Bhatt, Jatin
    Gurao, N. P.
    Biswas, Krishanu
    JOURNAL OF MATERIALS RESEARCH, 2022, 37 (15) : 2428 - 2445
  • [5] A study on automatic fixture design using reinforcement learning
    Darren Wei Wen Low
    Dennis Wee Keong Neo
    A. Senthil Kumar
    The International Journal of Advanced Manufacturing Technology, 2020, 107 : 2303 - 2311
  • [6] A study on automatic fixture design using reinforcement learning
    Low, Darren Wei Wen
    Neo, Dennis Wee Keong
    Kumar, A. Senthil
    INTERNATIONAL JOURNAL OF ADVANCED MANUFACTURING TECHNOLOGY, 2020, 107 (5-6) : 2303 - 2311
  • [7] Integrating process design and control using reinforcement learning
    Sachio, Steven
    Mowbray, Max
    Papathanasiou, Maria M.
    Rio-Chanona, Ehecatl Antonio del
    Petsagkourakis, Panagiotis
    CHEMICAL ENGINEERING RESEARCH & DESIGN, 2022, 183 : 160 - 169
  • [8] Automated Design of Analog Circuits Using Reinforcement Learning
    Settaluri, Keertana
    Liu, Zhaokai
    Khurana, Rishubh
    Mirhaj, Arash
    Jain, Rajeev
    Nikolic, Borivoje
    IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2022, 41 (09) : 2794 - 2807
  • [9] Compositional design of compounds with elements not in training data using supervised learning
    He, Jingjin
    Yin, Ruowei
    Wang, Changxin
    Liu, Chuanbao
    Xue, Dezhen
    Su, Yanjing
    Qiao, Lijie
    Lookman, Turab
    Bai, Yang
    JOURNAL OF MATERIOMICS, 2025, 11 (03)
  • [10] Investigations into the Design and Implementation of Reinforcement Learning Using Deep Learning Neural Networks
    Tudoroiu, Roxana-Elena
    Zaheeruddin, Mohammed
    Curiac, Daniel-Ioan
    Radu, Mihai Sorin
    Tudoroiu, Nicolae
    ALGORITHMS, 2025, 18 (03)