Representations of Materials for Machine Learning

被引:42
|
作者
Damewood, James [1 ]
Karaguesian, Jessica [1 ,2 ]
Lunger, Jaclyn R. [1 ]
Tan, Aik Rui [1 ]
Xie, Mingrou [1 ,3 ]
Peng, Jiayu [1 ]
Gomez-Bombarelli, Rafael [1 ]
机构
[1] MIT, Dept Mat Sci & Engn, Cambridge, MA USA
[2] MIT, Ctr Computat Sci & Engn, Cambridge, MA USA
[3] MIT, Dept Chem Engn, Cambridge, MA USA
关键词
representation; feature engineering; machine learning; materials science; crystal structure; generative model; INVERSE DESIGN; PREDICTION; REDUCTION; MOLECULES; NETWORKS; ENERGIES; MODELS;
D O I
10.1146/annurev-matsci-080921-085947
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
High-throughput data generation methods and machine learning (ML) algorithms have given rise to a new era of computational materials science by learning the relations between composition, structure, and properties and by exploiting such relations for design. However, to build these connections, materials data must be translated into a numerical form, called a representation, that can be processed by an ML model. Data sets in materials science vary in format (ranging from images to spectra), size, and fidelity. Predictive models vary in scope and properties of interest. Here, we review context-dependent strategies for constructing representations that enable the use of materials as inputs or outputs for ML models. Furthermore, we discuss how modern ML techniques can learn representations from data and transfer chemical and physical information between tasks. Finally, we outline high-impact questions that have not been fully resolved and thus require further investigation.
引用
收藏
页码:399 / 426
页数:28
相关论文
共 50 条
  • [31] Representations of molecules and materials for interpolation of quantum-mechanical simulations via machine learning
    Langer, Marcel F.
    Goessmann, Alex
    Rupp, Matthias
    NPJ COMPUTATIONAL MATERIALS, 2022, 8 (01)
  • [32] High throughput screening of new piezoelectric materials using graph machine learning and knowledge graph approach
    Anand, Archit
    Kumari, Priyanka
    Kalyani, Ajay Kumar
    COMPUTATIONAL MATERIALS SCIENCE, 2025, 246
  • [33] Protein representations: Encoding biological information for machine learning in biocatalysis
    Harding-Larsen, David
    Funk, Jonathan
    Madsen, Niklas Gesmar
    Gharabli, Hani
    Acevedo-Rocha, Carlos G.
    Mazurenko, Stanislav
    Welner, Ditte Hededam
    BIOTECHNOLOGY ADVANCES, 2024, 77
  • [34] Crystal structure representations for machine learning models of formation energies
    Faber, Felix
    Lindmaa, Alexander
    von Lilienfeld, O. Anatole
    Armiento, Rickard
    INTERNATIONAL JOURNAL OF QUANTUM CHEMISTRY, 2015, 115 (16) : 1094 - 1101
  • [35] Battery Materials Discovery and Smart Grid Management using Machine Learning
    Wong, Andrew Jun Yao
    Zhou, Xin
    Lum, Yanwei
    Yao, Zhenpeng
    Chua, Yang Choo
    Wen, Yonggang
    Seh, Zhi Wei
    BATTERIES & SUPERCAPS, 2022, 5 (11)
  • [36] Sustainable Thermoelectric Materials Predicted by Machine Learning
    Chernyavsky, Dmitry
    van den Brink, Jeroen
    Park, Gyu-Hyeon
    Nielsch, Kornelius
    Thomas, Andy
    ADVANCED THEORY AND SIMULATIONS, 2022, 5 (11)
  • [37] Opportunities and Challenges for Machine Learning in Materials Science
    Morgan, Dane
    Jacobs, Ryan
    ANNUAL REVIEW OF MATERIALS RESEARCH, VOL 50, 2020, 2020, 50 : 71 - 103
  • [38] A Critical Review of Machine Learning of Energy Materials
    Chen, Chi
    Zuo, Yunxing
    Ye, Weike
    Li, Xiangguo
    Deng, Zhi
    Ong, Shyue Ping
    ADVANCED ENERGY MATERIALS, 2020, 10 (08)
  • [39] Machine learning in materials genome initiative: A review
    Liu, Yingli
    Niu, Chen
    Wang, Zhuo
    Gan, Yong
    Zhu, Yan
    Sun, Shuhong
    Shen, Tao
    JOURNAL OF MATERIALS SCIENCE & TECHNOLOGY, 2020, 57 : 113 - 122
  • [40] Application of machine learning in magnetocaloric materials: A review
    Mo, Weiquan
    Wang, Jianfeng
    Yuan, Guoqing
    Cao, Dan
    Bai, Gongxun
    MATERIALS TODAY COMMUNICATIONS, 2025, 44