Pairwise Difference Regression: A Machine Learning Meta-algorithm for Improved Prediction and Uncertainty Quantification in Chemical Search
被引:23
作者:
Tynes, Michael
论文数: 0引用数: 0
h-index: 0
机构:
Los Alamos Natl Lab, Theoret Div, Los Alamos, NM 87545 USA
Los Alamos Natl Lab, Ctr Nonlinear Studies, Los Alamos, NM 87545 USALos Alamos Natl Lab, Comp Computat & Stat Sci Div, Comp, Los Alamos, NM 87545 USA
Tynes, Michael
[3
,4
]
Gao, Wenhao
论文数: 0引用数: 0
h-index: 0
机构:
Los Alamos Natl Lab, Comp Computat & Stat Sci Div, Comp, Los Alamos, NM 87545 USA
MIT, Dept Chem Engn, Cambridge, MA 02139 USALos Alamos Natl Lab, Comp Computat & Stat Sci Div, Comp, Los Alamos, NM 87545 USA
Gao, Wenhao
[1
,2
]
Burrill, Daniel J.
论文数: 0引用数: 0
h-index: 0
机构:
Los Alamos Natl Lab, Theoret Div, Los Alamos, NM 87545 USA
Los Alamos Natl Lab, Ctr Nonlinear Studies, Los Alamos, NM 87545 USALos Alamos Natl Lab, Comp Computat & Stat Sci Div, Comp, Los Alamos, NM 87545 USA
Burrill, Daniel J.
[3
,4
]
Batista, Enrique R.
论文数: 0引用数: 0
h-index: 0
机构:
Los Alamos Natl Lab, Theoret Div, Los Alamos, NM 87545 USA
Los Alamos Natl Lab, Ctr Nonlinear Studies, Los Alamos, NM 87545 USALos Alamos Natl Lab, Comp Computat & Stat Sci Div, Comp, Los Alamos, NM 87545 USA
Batista, Enrique R.
[3
,4
]
Perez, Danny
论文数: 0引用数: 0
h-index: 0
机构:
Los Alamos Natl Lab, Theoret Div, Los Alamos, NM 87545 USALos Alamos Natl Lab, Comp Computat & Stat Sci Div, Comp, Los Alamos, NM 87545 USA
Perez, Danny
[3
]
Yang, Ping
论文数: 0引用数: 0
h-index: 0
机构:
Los Alamos Natl Lab, Theoret Div, Los Alamos, NM 87545 USALos Alamos Natl Lab, Comp Computat & Stat Sci Div, Comp, Los Alamos, NM 87545 USA
Yang, Ping
[3
]
Lubbers, Nicholas
论文数: 0引用数: 0
h-index: 0
机构:
Los Alamos Natl Lab, Comp Computat & Stat Sci Div, Comp, Los Alamos, NM 87545 USALos Alamos Natl Lab, Comp Computat & Stat Sci Div, Comp, Los Alamos, NM 87545 USA
Lubbers, Nicholas
[1
]
机构:
[1] Los Alamos Natl Lab, Comp Computat & Stat Sci Div, Comp, Los Alamos, NM 87545 USA
[2] MIT, Dept Chem Engn, Cambridge, MA 02139 USA
[3] Los Alamos Natl Lab, Theoret Div, Los Alamos, NM 87545 USA
[4] Los Alamos Natl Lab, Ctr Nonlinear Studies, Los Alamos, NM 87545 USA
Machine learning (ML) plays a growing role in the design and discovery of chemicals, aiming to reduce the need to perform expensive experiments and simulations. ML for such applications is promising but difficult, as models must generalize to vast chemical spaces from small training sets and must have reliable uncertainty quantification metrics to identify and prioritize unexplored regions. Ab initio computational chemistry and chemical intuition alike often take advantage of differences between chemical conditions, rather than their absolute structure or state, to generate more reliable results. We have developed an analogous comparison-based approach for ML regression, called pairwise difference regression (PADRE), which is applicable to arbitrary underlying learning models and operates on pairs of input data points. During training, the model learns to predict differences between all possible pairs of input points. During prediction, the test points are paired with all training set points, giving rise to a set of predictions that can be treated as a distribution of which the mean is treated as a final prediction and the dispersion is treated as an uncertainty measure. Pairwise difference regression was shown to reliably improve the performance of the random forest algorithm across five chemical ML tasks. Additionally, the pair-derived dispersion is both well correlated with model error and performs well in active learning. We also show that this method is competitive with state-of-the-art neural network techniques. Thus, pairwise difference regression is a promising tool for candidate selection algorithms used in chemical discovery.
机构:
Univ Virginia, Dept Mat Sci & Engn, Charlottesville, VA 22904 USA
Univ Virginia, Dept Mech & Aerosp Engn, Charlottesville, VA 22904 USAUniv Virginia, Dept Mat Sci & Engn, Charlottesville, VA 22904 USA
机构:
Univ Virginia, Dept Mat Sci & Engn, Charlottesville, VA 22904 USA
Univ Virginia, Dept Mech & Aerosp Engn, Charlottesville, VA 22904 USAUniv Virginia, Dept Mat Sci & Engn, Charlottesville, VA 22904 USA