New opportunities for materials informatics: Resources and data mining techniques for uncovering hidden relationships

被引:185
作者
Jain, Anubhav [1 ]
Hautier, Geoffroy [2 ]
Ong, Shyue Ping [3 ]
Persson, Kristin [1 ,4 ]
机构
[1] Univ Calif Berkeley, Lawrence Berkeley Natl Lab, Energy & Environm Technol Div, Berkeley, CA 94720 USA
[2] Catholic Univ Louvain, Inst Condensed Matter & Nanosci IMCN, B-1348 Louvain La Neuve, Belgium
[3] Univ Calif San Diego, Dept NanoEngn, La Jolla, CA 92093 USA
[4] Univ Calif Berkeley, Mat Sci & Engn, Berkeley, CA 94720 USA
关键词
DENSITY-FUNCTIONAL THEORY; CRYSTAL-STRUCTURE; NEURAL-NETWORKS; OXIDE COMPOUNDS; DESIGN; CATHODES; INFRASTRUCTURE; SEMICONDUCTORS; PRINCIPLES; PREDICTION;
D O I
10.1557/jmr.2016.80
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Data mining has revolutionized sectors as diverse as pharmaceutical drug discovery, finance, medicine, and marketing, and has the potential to similarly advance materials science. In this paper, we describe advances in simulation-based materials databases, open-source software tools, and machine learning algorithms that are converging to create new opportunities for materials informatics. We discuss the data mining techniques of exploratory data analysis, clustering, linear models, kernel ridge regression, tree-based regression, and recommendation engines. We present these techniques in the context of several materials application areas, including compound prediction, Li-ion battery design, piezoelectric materials, photocatalysts, and thermoelectric materials. Finally, we demonstrate how new data and tools are making it easier and more accessible than ever to perform data mining through a new analysis that learns trends in the valence and conduction band character of compounds in the Materials Project database using data on over 2500 compounds.
引用
收藏
页码:977 / 994
页数:18
相关论文
共 127 条
[101]   Design of electroceramic materials using artificial neural networks and multiobjective evolutionary algorithms [J].
Scott, D. J. ;
Manos, S. ;
Coveney, P. V. .
JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2008, 48 (02) :262-273
[102]   Prediction of Low-Thermal-Conductivity Compounds with First-Principles Anharmonic Lattice-Dynamics Calculations and Bayesian Optimization [J].
Seko, Atsuto ;
Togo, Atsushi ;
Hayashi, Hiroyuki ;
Tsuda, Koji ;
Chaput, Laurent ;
Tanaka, Isao .
PHYSICAL REVIEW LETTERS, 2015, 115 (20)
[103]   Machine learning with systematic density-functional theory calculations: Application to melting temperatures of single- and binary-component solids [J].
Seko, Atsuto ;
Maekawa, Tomoya ;
Tsuda, Koji ;
Tanaka, Isao .
PHYSICAL REVIEW B, 2014, 89 (05)
[104]   Stereochemistry of vanadium in oxygen-containing compounds [J].
Serezhkin, V. N. ;
Urusov, V. S. .
JOURNAL OF STRUCTURAL CHEMISTRY, 2009, 50 (05) :861-866
[105]   REVISED EFFECTIVE IONIC-RADII AND SYSTEMATIC STUDIES OF INTERATOMIC DISTANCES IN HALIDES AND CHALCOGENIDES [J].
SHANNON, RD .
ACTA CRYSTALLOGRAPHICA SECTION A, 1976, 32 (SEP1) :751-767
[106]   p-type transparent conducting oxides [J].
Sheng, Su ;
Fang, Guojia ;
Li, Chun ;
Xu, Sheng ;
Zhao, Xingzhong .
PHYSICA STATUS SOLIDI A-APPLICATIONS AND MATERIALS SCIENCE, 2006, 203 (08) :1891-1900
[107]   Lithium Copper(I) Orthophosphates Li3-xCuxPO4: Synthesis, Crystal Structures, and Electrochemical Properties [J].
Snyder, Katharina ;
Raguz, Branimir ;
Hoffbauer, Wilfried ;
Glaum, Robert ;
Ehrenberg, Hartmut ;
Herklotz, Markus .
ZEITSCHRIFT FUR ANORGANISCHE UND ALLGEMEINE CHEMIE, 2014, 640 (05) :944-951
[108]   "Property Phase Diagrams" for Compound Semiconductors through Data Mining [J].
Srinivasan, Srikant ;
Rajan, Krishna .
MATERIALS, 2013, 6 (01) :279-290
[109]   Correcting density functional theory for accurate predictions of compound enthalpies of formation: Fitted elemental-phase reference energies [J].
Stevanovic, Vladan ;
Lany, Stephan ;
Zhang, Xiuwen ;
Zunger, Alex .
PHYSICAL REVIEW B, 2012, 85 (11)