Transfer learning of recurrent neural network-based plasticity models

被引:16
作者
Heidenreich, Julian N. [1 ,2 ]
Bonatti, Colin [1 ]
Mohr, Dirk [1 ,2 ]
机构
[1] ETH, Dept Mech & Proc Engn, Chair Artificial Intelligence Mech & Mfg, Zurich, Switzerland
[2] MIT, Dept Mech Engn, Impact & Crashworthiness Lab, Cambridge, MA USA
关键词
artificial intelligence; plasticity; recurrent neural network; transfer learning; GLOBAL OPTIMIZATION;
D O I
10.1002/nme.7357
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Mechanics-specific recurrent neural network (RNN) models are known for their ability to describe the complex three-dimensional stress-strain response of elasto-plastic solids for arbitrary loading paths. To apply RNN models to real materials, it is crucial to identify a strategy that allows for their training from small datasets that could be obtained from robot-assisted experiments. It is demonstrated that regular training with datasets comprising random walks (RWs) in strain space yield a significantly higher generalization ability than the same number of sequences for smooth loading paths. Moreover, it is found that transfer learning, that is, initializing the weights and biases with the parameters from an already trained material, improves the convergence rates and reduces the required number of stress-strain sequences for training. When leveraging the experience gained for multiple materials through ensemble transfer learning, even more substantial improvements are obtained. For example, the same model accuracy and generalization ability is obtained from training with 400 smooth stress-strain sequences after ensemble transfer as from training with 10,000 RW sequences after regular training.
引用
收藏
页数:31
相关论文
共 65 条
[1]   Deep learning for plasticity and thermo-viscoplasticity [J].
Abueidda, Diab W. ;
Koric, Seid ;
Sobh, Nahil A. ;
Sehitoglu, Huseyin .
INTERNATIONAL JOURNAL OF PLASTICITY, 2021, 136
[2]   Prediction of nonlinear viscoelastic behavior of polymeric composites using an artificial neural network [J].
Al-Haik, MS ;
Hussaini, MY ;
Garmestani, H .
INTERNATIONAL JOURNAL OF PLASTICITY, 2006, 22 (07) :1367-1392
[3]   Finite-time analysis of the multiarmed bandit problem [J].
Auer, P ;
Cesa-Bianchi, N ;
Fischer, P .
MACHINE LEARNING, 2002, 47 (2-3) :235-256
[4]   Elasto-viscoplastic constitutive equations for polycrystalline fcc materials at low homologous temperatures [J].
Balasubramanian, S ;
Anand, L .
JOURNAL OF THE MECHANICS AND PHYSICS OF SOLIDS, 2002, 50 (01) :101-126
[5]   An alternative to kinematic hardening in classical plasticity [J].
Barlat, Frederic ;
Gracio, Jose J. ;
Lee, Myoung-Gyu ;
Rauch, Edgar F. ;
Vincze, Gabriela .
INTERNATIONAL JOURNAL OF PLASTICITY, 2011, 27 (09) :1309-1327
[6]  
Bennett PN., 2003, P ICML WORKSH CONT L
[7]  
Blitzer J., 2007, BIOGRAPHIES BOLLYWOO, P432
[8]  
Blitzer JC, 2006, Proceedings of the 2006 Conference on Empirical Methods in Natural Language Processing, P120, DOI DOI 10.3115/1610075.1610094
[9]   On the importance of self-consistency in recurrent neural network models representing elasto-plastic solids [J].
Bonatti, Colin ;
Mohr, Dirk .
JOURNAL OF THE MECHANICS AND PHYSICS OF SOLIDS, 2022, 158
[10]   One for all: Universal material model based on minimal state-space neural networks [J].
Bonatti, Colin ;
Mohr, Dirk .
SCIENCE ADVANCES, 2021, 7 (26)