Reinforcement learning and approximate Bayesian computation for model selection and parameter calibration applied to a nonlinear

被引：11

作者：

Ritto, T. G. ^{[1
,2
]}

Beregi, S. ^{[2
]}

Barton, D. A. W. ^{[2
]}

机构：

[1] Univ Fed Rio de Janeiro, Dept Mech Engn, Rio De Janeiro, Brazil

[2] Univ Bristol, Fac Engn, Bristol, England

来源：

MECHANICAL SYSTEMS AND SIGNAL PROCESSING | 2022年 / 181卷

基金：

英国工程与自然科学研究理事会;

关键词：

Nonlinear dynamics; Parameter identification; Model selection; Reinforcement learning; ABC; Decision under uncertainty; MONTE-CARLO; ROBOTICS;

D O I：

10.1016/j.ymssp.2022.109485

中图分类号：

TH [机械、仪表工业];

学科分类号：

0802 ;

摘要：

In the context of digital twins and integration of physics-based models with machine learning tools, this paper proposes a new methodology for model selection and parameter identification. It combines (i) reinforcement learning (RL) for model selection through a Thompson-like sampling with (ii) approximate Bayesian computation (ABC) for parameter identification and uncertainty quantification. These two methods are applied together to a nonlinear mechanical oscillator with periodic forcing. Experimental data are used in the analysis and two different nonlinear models are tested. The initial Beta distribution that represents the likelihood of the model is updated depending on how successful the model is at reproducing the reference data (reinforcement learning strategy). At the same time, the prior distribution of the model parameters is updated using a likelihood-free strategy (ABC). In the end, the rewards and the posterior distribution of the parameters of each model are obtained. The results show that the combined methodology (RL-ABC) is promising for model selection from bifurcation diagrams. Prior parameter distribution was successfully updated, correlations between parameters were found, probabilistic envelopes of the posterior model are consistent with the available data, the most rewarded model was selected, and the reinforcing strategy allows to speed up the selection process.

引用

页数：15

共 37 条

[1] Hierarchical fault classification for resource constrained systems
Adams, Stephen
Meekins, Ryan
Beling, Peter A.
Farinholt, Kevin
Brown, Nathan
Polter, Sherwood
Dong, Qing
[J]. MECHANICAL SYSTEMS AND SIGNAL PROCESSING, 2019, 134
[2] [Anonymous], 2013, 30 INT C MACHINE LEA
[3] Numerical Continuation in a Physical Experiment: Investigation of a Nonlinear Energy Harvester
Barton, David A. W.
Burrow, Stephen G.
[J]. JOURNAL OF COMPUTATIONAL AND NONLINEAR DYNAMICS, 2011, 6 (01):
[4] Approximate Bayesian Computation in Evolution and Ecology
Beaumont, Mark A.
[J]. ANNUAL REVIEW OF ECOLOGY, EVOLUTION, AND SYSTEMATICS, VOL 41, 2010, 41 : 379 - 406
[5] Model selection using response measurements: Bayesian probabilistic approach
Beck, JL
Yuen, KV
[J]. JOURNAL OF ENGINEERING MECHANICS-ASCE, 2004, 130 (02): : 192 - 203
[6] Model selection and parameter estimation in structural dynamics using approximate Bayesian computation
Ben Abdessalem, Anis
Dervilis, Nikolaos
Wagg, David
Worden, Keith
[J]. MECHANICAL SYSTEMS AND SIGNAL PROCESSING, 2018, 99 : 306 - 325
[7] Robustness of nonlinear parameter identification in the presence of process noise using control-based continuation
Beregi, Sandor
Barton, David A. W.
Rezgui, Djamel
Neild, Simon A.
[J]. NONLINEAR DYNAMICS, 2021, 104 (02) : 885 - 900
[8] Bowman A. W., 1997, Applied Smoothing Techniques for Data Analysis: the Kernel Approach with S-Plus Illustrations, V18
[9] Discovering governing equations from data by sparse identification of nonlinear dynamical systems
Brunton, Steven L.
Proctor, Joshua L.
Kutz, J. Nathan
[J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2016, 113 (15) : 3932 - 3937
[10] Improved delivery policies for future drone-based delivery systems
Chen, Heng
Hu, Zhangchen
Solak, Senay
[J]. EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2021, 294 (03) : 1181 - 1201

← 1 2 3 4 →