Interpreting and generalizing deep learning in physics-based problems with functional linear models

被引:2
作者
Arzani, Amirhossein [1 ,2 ]
Yuan, Lingxiao [3 ]
Newell, Pania [1 ]
Wang, Bei [2 ,4 ]
机构
[1] Univ Utah, Dept Mech Engn, Salt Lake City, UT 84112 USA
[2] Univ Utah, Sci Comp & Imaging Inst, Salt Lake City, UT 84112 USA
[3] Boston Univ, Dept Mech Engn, Boston, MA USA
[4] Univ Utah, Sch Comp, Salt Lake City, UT USA
关键词
Explainable artificial intelligence (XAI); Scientific machine learning; Functional data analysis; Operator learning; Generalization; BANDWIDTH SELECTION;
D O I
10.1007/s00366-024-01987-z
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Although deep learning has achieved remarkable success in various scientific machine learning applications, its opaque nature poses concerns regarding interpretability and generalization capabilities beyond the training data. Interpretability is crucial and often desired in modeling physical systems. Moreover, acquiring extensive datasets that encompass the entire range of input features is challenging in many physics-based learning tasks, leading to increased errors when encountering out-of-distribution (OOD) data. In this work, motivated by the field of functional data analysis (FDA), we propose generalized functional linear models as an interpretable surrogate for a trained deep learning model. We demonstrate that our model could be trained either based on a trained neural network (post-hoc interpretation) or directly from training data (interpretable operator learning). A library of generalized functional linear models with different kernel functions is considered and sparse regression is used to discover an interpretable surrogate model that could be analytically presented. We present test cases in solid mechanics, fluid mechanics, and transport. Our results demonstrate that our model can achieve comparable accuracy to deep learning and can improve OOD generalization while providing more transparency and interpretability. Our study underscores the significance of interpretable representation in scientific machine learning and showcases the potential of functional linear models as a tool for interpreting and generalizing deep learning.
引用
收藏
页码:135 / 157
页数:23
相关论文
共 92 条
[11]   OPTIMAL MINIMAL NEURAL INTERPRETATION OF SPECTRA [J].
BORGGAARD, C ;
THODBERG, HH .
ANALYTICAL CHEMISTRY, 1992, 64 (05) :545-551
[12]   Data-driven discovery of Green's functions with human-understandable deep learning [J].
Boulle, Nicolas ;
Earls, Christopher J. ;
Townsend, Alex .
SCIENTIFIC REPORTS, 2022, 12 (01)
[13]  
Boullé N, 2022, J MACH LEARN RES, V23
[14]   Compressive sensing based machine learning strategy for characterizing the flow around a cylinder with limited pressure measurements [J].
Bright, Ido ;
Lin, Guang ;
Kutz, J. Nathan .
PHYSICS OF FLUIDS, 2013, 25 (12)
[15]   Discovering governing equations from data by sparse identification of nonlinear dynamical systems [J].
Brunton, Steven L. ;
Proctor, Joshua L. ;
Kutz, J. Nathan .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2016, 113 (15) :3932-3937
[16]   Applied Koopmanism [J].
Budisic, Marko ;
Mohr, Ryan ;
Mezic, Igor .
CHAOS, 2012, 22 (04)
[17]   Data-driven discovery of coordinates and governing equations [J].
Champion, Kathleen ;
Lusch, Bethany ;
Kutz, J. Nathan ;
Brunton, Steven L. .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2019, 116 (45) :22445-22451
[18]  
Cohen G, 2017, IEEE IJCNN, P2921, DOI 10.1109/IJCNN.2017.7966217
[19]  
Cremades A, 2023, ARXIV
[20]   Comparing different nonlinear dimensionality reduction techniques for data-driven unsteady fluid flow modeling [J].
Csala, Hunor ;
Dawson, Scott T. M. ;
Arzani, Amirhossein .
PHYSICS OF FLUIDS, 2022, 34 (11)