Accurate property prediction with interpretable machine learning model for small datasets via transformed atom vector

被引:12
作者
Chen, Xinyu [1 ]
Lu, Shuaihua [1 ]
Wan, Xinyang [1 ]
Chen, Qian [1 ]
Zhou, Qionghua [1 ]
Wang, Jinlan [1 ,2 ]
机构
[1] Southeast Univ, Sch Phys, Nanjing 211189, Peoples R China
[2] Suzhou Lab, Suzhou 215125, Peoples R China
关键词
NETWORKS;
D O I
10.1103/PhysRevMaterials.6.123803
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Machine learning techniques can greatly accelerate material discovery while high-dimensional representation often causes overfitting problems and leads to poor model performance. Building a structure-property relation-ship with low-dimensional representation is always an open challenge, especially for diverse structures within small datasets. To address this issue, a low-dimensional representation named the transformed atom vector (TAV) is proposed, which is a crystal-graph-based descriptor. As an example, we apply it in two-dimensional materials and predict the band gap at the Heyd-Scuseria-Ernzerhof level with only 500 samples at acceptable accuracy. Moreover, TAV representation retains interpretability, based on which a property-oriented search method through element substitution is developed. This work provides a universal low-dimensional representation containing rich material information, as well as an intuitive interpretation approach for material design, which improves the feasibility and interpretability of machine learning models for small datasets and helps realize accurate yet meaningful property prediction at a lower cost.
引用
收藏
页数:8
相关论文
共 60 条
[1]   Machine Learning Study of the Magnetic Ordering in 2D Materials [J].
Acosta, Carlos Mera ;
Ogoshi, Elton ;
Souza, Jose Antonio ;
Dalpian, Gustavo M. .
ACS APPLIED MATERIALS & INTERFACES, 2022, 14 (07) :9418-9432
[2]   Topology-Scaling Identification of Layered Solids and Stable Exfoliated 2D Materials [J].
Ashton, Michael ;
Paul, Joshua ;
Sinnott, Susan B. ;
Hennig, Richard G. .
PHYSICAL REVIEW LETTERS, 2017, 118 (10)
[3]   On representing chemical environments [J].
Bartok, Albert P. ;
Kondor, Risi ;
Csanyi, Gabor .
PHYSICAL REVIEW B, 2013, 87 (18)
[4]   Machine learning for molecular and materials science [J].
Butler, Keith T. ;
Davies, Daniel W. ;
Cartwright, Hugh ;
Isayev, Olexandr ;
Walsh, Aron .
NATURE, 2018, 559 (7715) :547-555
[5]   Solving the quantum many-body problem with artificial neural networks [J].
Carleo, Giuseppe ;
Troyer, Matthias .
SCIENCE, 2017, 355 (6325) :602-605
[6]   Learning properties of ordered and disordered materials from multi-fidelity data [J].
Chen, Chi ;
Zuo, Yunxing ;
Ye, Weike ;
Li, Xiangguo ;
Ong, Shyue Ping .
NATURE COMPUTATIONAL SCIENCE, 2021, 1 (01) :46-+
[7]   Graph Networks as a Universal Machine Learning Framework for Molecules and Crystals [J].
Chen, Chi ;
Ye, Weike ;
Zuo, Yunxing ;
Zheng, Chen ;
Ong, Shyue Ping .
CHEMISTRY OF MATERIALS, 2019, 31 (09) :3564-3572
[8]  
Chen Tianqi, 2016, 22 ACM SIGKDD INT C
[9]   A Statistical Learning Framework for Materials Science: Application to Elastic Moduli of k-nary Inorganic Polycrystalline Compounds [J].
de Jong, Maarten ;
Chen, Wei ;
Notestine, Randy ;
Persson, Kristin ;
Ceder, Gerbrand ;
Jain, Anubhav ;
Asta, Mark ;
Gamst, Anthony .
SCIENTIFIC REPORTS, 2016, 6
[10]  
Dzeroski S, 2004, MACH LEARN, V54, P255, DOI 10.1023/B.MAC.0000015881.36452.6e