Prediction of Organic-Inorganic Hybrid Perovskite Band Gap by Multiple Machine Learning Algorithms

被引:7
作者
Feng, Shun [1 ]
Wang, Juan [1 ,2 ]
机构
[1] Xijing Univ, Sch Elect Informat, Xian Key Lab Adv Photoelect Mat & Energy Convers D, Xian 710123, Peoples R China
[2] Xijing Univ, Shaanxi Engn Res Ctr Controllable Neutron Source, Sch Elect Informat, Xian 710123, Peoples R China
关键词
organic-inorganic hybrid perovskite; band gap prediction; machine learning model; XGBoost algorithm;
D O I
10.3390/molecules29020499
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
As an indicator of the optical characteristics of perovskite materials, the band gap is a crucial parameter that impacts the functionality of a wide range of optoelectronic devices. Obtaining the band gap of a material via a labor-intensive, time-consuming, and inefficient high-throughput calculation based on first principles is possible. However, it does not yield the most accurate results. Machine learning techniques emerge as a viable and effective substitute for conventional approaches in band gap prediction. This paper collected 201 pieces of data through the literature and open-source databases. By separating the features related to bits A, B, and X, a dataset of 1208 pieces of data containing 30 feature descriptors was established. The dataset underwent preprocessing, and the Pearson correlation coefficient method was employed to eliminate non-essential features as a subset of features. The band gap was predicted using the GBR algorithm, the random forest algorithm, the LightGBM algorithm, and the XGBoost algorithm, in that order, to construct a prediction model for organic-inorganic hybrid perovskites. The outcomes demonstrate that the XGBoost algorithm yielded an MAE value of 0.0901, an MSE value of 0.0173, and an R2 value of 0.991310. These values suggest that, compared to the other two models, the XGBoost model exhibits the lowest prediction error, suggesting that the input features may better fit the prediction model. Finally, analysis of the XGBoost-based prediction model's prediction results using the SHAP model interpretation method reveals that the occupancy rate of the A-position ion has the greatest impact on the prediction of the band gap and has an A-negative correlation with the prediction results of the band gap. The findings provide valuable insights into the relationship between the prediction of band gaps and significant characteristics of organic-inorganic hybrid perovskites.
引用
收藏
页数:18
相关论文
共 26 条
[1]   Explaining anomalies detected by autoencoders using Shapley Additive Explanations [J].
Antwarg, Liat ;
Miller, Ronnie Mindlin ;
Shapira, Bracha ;
Rokach, Lior .
EXPERT SYSTEMS WITH APPLICATIONS, 2021, 186
[2]   Gradient boosting survival tree with applications in credit scoring [J].
Bai, Miaojun ;
Zheng, Yan ;
Shen, Yun .
JOURNAL OF THE OPERATIONAL RESEARCH SOCIETY, 2022, 73 (01) :39-55
[3]   Root mean square error (RMSE) or mean absolute error (MAE)? - Arguments against avoiding RMSE in the literature [J].
Chai, T. ;
Draxler, R. R. .
GEOSCIENTIFIC MODEL DEVELOPMENT, 2014, 7 (03) :1247-1250
[4]   Δ-Machine learning-driven discovery of double hybrid organic-inorganic perovskites [J].
Chen, Jialu ;
Xu, Wenjun ;
Zhang, Ruiqin .
JOURNAL OF MATERIALS CHEMISTRY A, 2022, 10 (03) :1402-1413
[5]  
Cutler A, 2012, ENSEMBLE MACHINE LEARNING: METHODS AND APPLICATIONS, P157, DOI 10.1007/978-1-4419-9326-7_5
[6]   Screening for lead-free inorganic double perovskites with suitable band gaps and high stability using combined machine learning and DFT calculation [J].
Gao, Zhengyang ;
Zhang, Hanwen ;
Mao, Guangyang ;
Ren, Jianuo ;
Chen, Ziheng ;
Wu, Chongchong ;
Gates, Ian D. ;
Yang, Weijie ;
Ding, Xunlei ;
Yao, Jianxi .
APPLIED SURFACE SCIENCE, 2021, 568
[7]  
Ke GL, 2017, ADV NEUR IN, V30
[8]   Feature Selection: A Data Perspective [J].
Li, Jundong ;
Cheng, Kewei ;
Wang, Suhang ;
Morstatter, Fred ;
Trevino, Robert P. ;
Tang, Jiliang ;
Liu, Huan .
ACM COMPUTING SURVEYS, 2018, 50 (06)
[9]   Design of Organic-Inorganic Hybrid Heterostructured Semiconductors via High-Throughput Materials Screening for Optoelectronic Applications [J].
Li, Yawen ;
Yang, Jingxiu ;
Zhao, Ruoting ;
Zhang, Yilin ;
Wang, Xinjiang ;
He, Xin ;
Fu, Yuhao ;
Zhang, Lijun .
JOURNAL OF THE AMERICAN CHEMICAL SOCIETY, 2022, 144 (36) :16656-16666
[10]   Accelerated discovery of stable lead-free hybrid organic-inorganic perovskites via machine learning [J].
Lu, Shuaihua ;
Zhou, Qionghua ;
Ouyang, Yixin ;
Guo, Yilv ;
Li, Qiang ;
Wang, Jinlan .
NATURE COMMUNICATIONS, 2018, 9