New materials band gap prediction based on the high-throughput calculation and the machine learning

被引:0
作者
Xu Y. [1 ]
Wang X. [2 ]
Li X. [2 ]
Xi L. [2 ]
Ni J. [1 ]
Zhu W. [1 ]
Zhang W. [1 ]
Yang J. [2 ]
机构
[1] School of Computer Engineering and Science, Shanghai University, Shanghai
[2] Materials Genome Institute of Shanghai University, Shanghai
来源
Zhongguo Kexue Jishu Kexue/Scientia Sinica Technologica | 2019年 / 49卷 / 01期
关键词
Bandgap; Component substitution approach; Diamond-like structures; Ensemble learning; Machine learning;
D O I
10.1360/N092018-00202
中图分类号
学科分类号
摘要
The bandgap often plays an important role in functional materials applications. For example, optoelectronic materials are generally wide bandgap semiconductors, while thermoelectric materials are narrow bandgap semiconductor materials. Therefore, predicting the bandgap rapidly and accurately for a given class of materials structures has great scientific importance for the functional materials applications. However, considering that the method of obtaining high-precision band gaps based on first-principles high-throughput calculations is time consuming and inefficient, and it is also not realistic to systematically measure a large number of material system band gaps. Machine learning methods based the statistics may be a promising alternative. This paper designs an ensemble learning model for effectively and accurately predicting bandgap values. Based on the calculated band gap values of diamond-like structures in thermoelectric materials, on the one hand, single component substitution strategy was used to generate large quantities of similar compounds, and the repetitive structures was filtered out by using the structural repeatability examination technique, resulting in 356 unique material structures. On the other hand, in combination with machine learning techniques, an efficient band gap prediction model was constructed, and by which the band gap values of 50 similar material systems are predicted and verified. As is the result of the experiment, this prediction model has 77.73% accuracy. It is enough robustness and stability to be widely used in thermoelectric materials application scenarios which require large band gap prediction. © 2019, Science Press. All right reserved.
引用
收藏
页码:44 / 54
页数:10
相关论文
共 50 条
[1]  
Goldsmid H.J., Introduction to Thermoelectricity, pp. 197-220, (2005)
[2]  
Snyder G.J., Toberer E.S., Complex thermoelectric materials, Nat Mater, 7, pp. 105-114, (2008)
[3]  
Bell L.E., Cooling, heating, generating power, and recovering waste heat with thermoelectric systems, Science, 321, pp. 1457-1461, (2008)
[4]  
Uher C., Semiconductors and Semimetals, pp. 139-253, (2001)
[5]  
Curtarolo S., Hart G.L.W., Nardelli M.B., Et al., The high-throughput highway to computational materials design, Nat Mater, 12, pp. 191-201, (2013)
[6]  
Sharma V., Wang C., Lorenzini R.G., Et al., Rational design of all organic polymer dielectrics, Nat Commun, 5, (2014)
[7]  
Ceder G., Hautier G., Jain A., Et al., Recharging lithium battery research with first-principles methods, MRS Bull, 36, pp. 185-191, (2011)
[8]  
Curtarolo S., Setyawan W., Wang S., Et al., AFLOWLIB.ORG: A Distributed Materials Properties Repository from High-Throughput Ab Initio Calculations, Comput Mater Sci, 58, pp. 227-235, (2012)
[9]  
Landis D.D., Hummelshoj J.S., Nestorov S., Et al., The computational materials repository, Comput Sci Eng, 14, pp. 51-57, (2012)
[10]  
Service R.F., Materials scientists look to a data-intensive future, Science, 335, pp. 1434-1435, (2012)