New opportunities for materials informatics: Resources and data mining techniques for uncovering hidden relationships

被引:184
作者
Jain, Anubhav [1 ]
Hautier, Geoffroy [2 ]
Ong, Shyue Ping [3 ]
Persson, Kristin [1 ,4 ]
机构
[1] Univ Calif Berkeley, Lawrence Berkeley Natl Lab, Energy & Environm Technol Div, Berkeley, CA 94720 USA
[2] Catholic Univ Louvain, Inst Condensed Matter & Nanosci IMCN, B-1348 Louvain La Neuve, Belgium
[3] Univ Calif San Diego, Dept NanoEngn, La Jolla, CA 92093 USA
[4] Univ Calif Berkeley, Mat Sci & Engn, Berkeley, CA 94720 USA
关键词
DENSITY-FUNCTIONAL THEORY; CRYSTAL-STRUCTURE; NEURAL-NETWORKS; OXIDE COMPOUNDS; DESIGN; CATHODES; INFRASTRUCTURE; SEMICONDUCTORS; PRINCIPLES; PREDICTION;
D O I
10.1557/jmr.2016.80
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Data mining has revolutionized sectors as diverse as pharmaceutical drug discovery, finance, medicine, and marketing, and has the potential to similarly advance materials science. In this paper, we describe advances in simulation-based materials databases, open-source software tools, and machine learning algorithms that are converging to create new opportunities for materials informatics. We discuss the data mining techniques of exploratory data analysis, clustering, linear models, kernel ridge regression, tree-based regression, and recommendation engines. We present these techniques in the context of several materials application areas, including compound prediction, Li-ion battery design, piezoelectric materials, photocatalysts, and thermoelectric materials. Finally, we demonstrate how new data and tools are making it easier and more accessible than ever to perform data mining through a new analysis that learns trends in the valence and conduction band character of compounds in the Materials Project database using data on over 2500 compounds.
引用
收藏
页码:977 / 994
页数:18
相关论文
共 127 条
[61]   SELF-CONSISTENT EQUATIONS INCLUDING EXCHANGE AND CORRELATION EFFECTS [J].
KOHN, W ;
SHAM, LJ .
PHYSICAL REVIEW, 1965, 140 (4A) :1133-&
[62]   Mining for elastic constants of intermetallics from the charge density landscape [J].
Kong, Chang Sun ;
Broderick, Scott R. ;
Jones, Travis E. ;
Loyola, Claudia ;
Eberhart, Mark E. ;
Rajan, Krishna .
PHYSICA B-CONDENSED MATTER, 2015, 458 :1-7
[63]   Information-Theoretic Approach for the Discovery of Design Rules for Crystal Chemistry [J].
Kong, Chang Sun ;
Luo, Wei ;
Arapan, Sergiu ;
Villars, Pierre ;
Iwata, Shuichi ;
Ahuja, Rajeev ;
Rajan, Krishna .
JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2012, 52 (07) :1812-1820
[64]   Rational design of binary halide scintillators via data mining [J].
Kong, Chang Sun ;
Rajan, Krishna .
NUCLEAR INSTRUMENTS & METHODS IN PHYSICS RESEARCH SECTION A-ACCELERATORS SPECTROMETERS DETECTORS AND ASSOCIATED EQUIPMENT, 2012, 680 :145-154
[65]   Layered monodiphosphate Li9V3(P2O7)3(PO4)2: A novel cathode material for lithium-ion batteries [J].
Kuang, Quan ;
Xu, Jiantie ;
Zhao, Yanming ;
Chen, Xiaolong ;
Chen, Liquan .
ELECTROCHIMICA ACTA, 2011, 56 (05) :2201-2205
[66]  
Kubaschewski O., 1993, MAT THERMOCHEMISTRY, P376
[67]   SrCu2O2:: A p-type conductive oxide with wide band gap [J].
Kudo, A ;
Yanagi, H ;
Hosono, H ;
Kawazoe, H .
APPLIED PHYSICS LETTERS, 1998, 73 (02) :220-222
[68]   The Computational Materials Repository [J].
Landis, David D. ;
Hummelshoj, Jens S. ;
Nestorov, Svetlozar ;
Greeley, Jeff ;
Dulak, Marcin ;
Bligaard, Thomas ;
Norskov, Jens K. ;
Jacobsen, Karsten W. .
COMPUTING IN SCIENCE & ENGINEERING, 2012, 14 (06) :51-57
[69]  
Lin L., 2015, MAT PERFORM CHARACT, V4
[70]  
LINSTROM PJ, 2015, NIST STANDARD REFERE, V69, P20899