Crystal synthesizability prediction using contrastive positive unlabeled learning

被引:0
作者
Sun, Tao [1 ,3 ]
Yuan, Jianmei [1 ,2 ,3 ]
机构
[1] Xiangtan Univ, Sch Math & Computat Sci, Xiangtan 411105, Hunan, Peoples R China
[2] Xiangtan Univ, Minist Educ, Key Lab Intelligent Comp & Informat Proc, Xiangtan 411105, Hunan, Peoples R China
[3] Natl Ctr Appl Math Hunan, Xiangtan 411105, Hunan, Peoples R China
关键词
Perovskite materials; Photovoltaic applications; Contrastive learning; Positive unlabeled learning; PEROVSKITE; ALGORITHM;
D O I
10.1016/j.cpc.2024.109465
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
High-throughput screening or generative models rapidly identify crystal structures with the desired properties, but the synthesizable ratio is generally low. Experimentally verifying the synthesizability of individual virtual crystals would entail significant time and resources. Therefore, a method for automatically assessing the synthesizability of virtual crystals is urgently needed. This paper describes an approach that combines contrastive learning and positive unlabeled learning. The resulting contrastive positive unlabeled learning (CPUL) model predicts the crystal-likeness score (CLscore) of virtual materials. The model achieves a true positive (CLscore > 0.5) prediction accuracy of 93.95% on a test set containing 10,000 materials taken from the Materials Project (MP) database. We further validate the model by using all Fe-containing materials from the MP database as the test set, obtaining a true positive rate of 88.89%. This indicates that the CPUL model performs well, even with limited knowledge of the interactions between Fe and the atoms in the crystals. The CPUL model is then used to assess the CLscore of virtual crystals in the MP database and analyze their synthesizability by combining the energy above the hull. Finally, the synthesizability of perovskite materials is predicted using the proposed CPUL model, resulting in seven candidate halide perovskite materials for photovoltaic applications.
引用
收藏
页数:9
相关论文
共 44 条
[1]   Predicting the synthesizability of crystalline inorganic materials from the data of known material compositions [J].
Antoniuk, Evan R. ;
Cheon, Gowoon ;
Wang, George ;
Bernstein, Daniel ;
Cai, William ;
Reed, Evan J. .
NPJ COMPUTATIONAL MATERIALS, 2023, 9 (01)
[2]   Modified global k-means algorithm for minimum sum-of-squares clustering problems [J].
Bagirov, Adil M. .
PATTERN RECOGNITION, 2008, 41 (10) :3192-3199
[3]   High throughput calculations for a dataset of bilayer materials [J].
Barik, Ranjan Kumar ;
Woods, Lilia M. .
SCIENTIFIC DATA, 2023, 10 (01)
[4]  
Chen T, 2020, PR MACH LEARN RES, V119
[5]   AFLOW: An automatic framework for high-throughput materials discovery [J].
Curtarolo, Stefano ;
Setyawan, Wahyu ;
Hart, Gus L. W. ;
Jahnatek, Michal ;
Chepulskii, Roman V. ;
Taylor, Richard H. ;
Wanga, Shidong ;
Xue, Junkai ;
Yang, Kesong ;
Levy, Ohad ;
Mehl, Michael J. ;
Stokes, Harold T. ;
Demchenko, Denis O. ;
Morgan, Dane .
COMPUTATIONAL MATERIALS SCIENCE, 2012, 58 :218-226
[6]  
Datta J., 2024, PREPRINT
[7]   Predicting synthesizability of crystalline materials via deep learning [J].
Davariashtiyani, Ali ;
Kadkhodaie, Zahra ;
Kadkhodaei, Sara .
COMMUNICATIONS MATERIALS, 2021, 2 (01)
[8]   High-Throughput Strategies in the Discovery of Thermoelectric Materials [J].
Deng, Tingting ;
Qiu, Pengfei ;
Yin, Tingwei ;
Li, Ze ;
Yang, Jiong ;
Wei, Tianran ;
Shi, Xun .
ADVANCED MATERIALS, 2024, 36 (13)
[9]   Prediction of Synthesis of 2D Metal Carbides and Nitrides (MXenes) and Their Precursors with Positive and Unlabeled Machine Learning [J].
Frey, Nathan C. ;
Wang, Jin ;
Bellido, Gabriel Ivan Vega ;
Anasori, Babak ;
Gogotsi, Yury ;
Shenoy, Vivek B. .
ACS NANO, 2019, 13 (03) :3031-3041
[10]   Robust Design of High-Performance Optoelectronic ChalcogenideCrystals from High-Throughput ComputationYu [J].
Gan, Yu ;
Miao, Naihua ;
Lan, Penghua ;
Zhou, Jian ;
Elliott, Stephen R. ;
Sun, Zhimei .
JOURNAL OF THE AMERICAN CHEMICAL SOCIETY, 2022, 144 (13) :5878-5886