Reinforcement learning and approximate Bayesian computation for model selection and parameter calibration applied to a nonlinear

被引:11
作者
Ritto, T. G. [1 ,2 ]
Beregi, S. [2 ]
Barton, D. A. W. [2 ]
机构
[1] Univ Fed Rio de Janeiro, Dept Mech Engn, Rio De Janeiro, Brazil
[2] Univ Bristol, Fac Engn, Bristol, England
基金
英国工程与自然科学研究理事会;
关键词
Nonlinear dynamics; Parameter identification; Model selection; Reinforcement learning; ABC; Decision under uncertainty; MONTE-CARLO; ROBOTICS;
D O I
10.1016/j.ymssp.2022.109485
中图分类号
TH [机械、仪表工业];
学科分类号
0802 ;
摘要
In the context of digital twins and integration of physics-based models with machine learning tools, this paper proposes a new methodology for model selection and parameter identification. It combines (i) reinforcement learning (RL) for model selection through a Thompson-like sampling with (ii) approximate Bayesian computation (ABC) for parameter identification and uncertainty quantification. These two methods are applied together to a nonlinear mechanical oscillator with periodic forcing. Experimental data are used in the analysis and two different nonlinear models are tested. The initial Beta distribution that represents the likelihood of the model is updated depending on how successful the model is at reproducing the reference data (reinforcement learning strategy). At the same time, the prior distribution of the model parameters is updated using a likelihood-free strategy (ABC). In the end, the rewards and the posterior distribution of the parameters of each model are obtained. The results show that the combined methodology (RL-ABC) is promising for model selection from bifurcation diagrams. Prior parameter distribution was successfully updated, correlations between parameters were found, probabilistic envelopes of the posterior model are consistent with the available data, the most rewarded model was selected, and the reinforcing strategy allows to speed up the selection process.
引用
收藏
页数:15
相关论文
共 37 条
  • [1] Hierarchical fault classification for resource constrained systems
    Adams, Stephen
    Meekins, Ryan
    Beling, Peter A.
    Farinholt, Kevin
    Brown, Nathan
    Polter, Sherwood
    Dong, Qing
    [J]. MECHANICAL SYSTEMS AND SIGNAL PROCESSING, 2019, 134
  • [2] [Anonymous], 2013, 30 INT C MACHINE LEA
  • [3] Numerical Continuation in a Physical Experiment: Investigation of a Nonlinear Energy Harvester
    Barton, David A. W.
    Burrow, Stephen G.
    [J]. JOURNAL OF COMPUTATIONAL AND NONLINEAR DYNAMICS, 2011, 6 (01):
  • [4] Approximate Bayesian Computation in Evolution and Ecology
    Beaumont, Mark A.
    [J]. ANNUAL REVIEW OF ECOLOGY, EVOLUTION, AND SYSTEMATICS, VOL 41, 2010, 41 : 379 - 406
  • [5] Model selection using response measurements: Bayesian probabilistic approach
    Beck, JL
    Yuen, KV
    [J]. JOURNAL OF ENGINEERING MECHANICS-ASCE, 2004, 130 (02): : 192 - 203
  • [6] Model selection and parameter estimation in structural dynamics using approximate Bayesian computation
    Ben Abdessalem, Anis
    Dervilis, Nikolaos
    Wagg, David
    Worden, Keith
    [J]. MECHANICAL SYSTEMS AND SIGNAL PROCESSING, 2018, 99 : 306 - 325
  • [7] Robustness of nonlinear parameter identification in the presence of process noise using control-based continuation
    Beregi, Sandor
    Barton, David A. W.
    Rezgui, Djamel
    Neild, Simon A.
    [J]. NONLINEAR DYNAMICS, 2021, 104 (02) : 885 - 900
  • [8] Bowman A. W., 1997, Applied Smoothing Techniques for Data Analysis: the Kernel Approach with S-Plus Illustrations, V18
  • [9] Discovering governing equations from data by sparse identification of nonlinear dynamical systems
    Brunton, Steven L.
    Proctor, Joshua L.
    Kutz, J. Nathan
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2016, 113 (15) : 3932 - 3937
  • [10] Improved delivery policies for future drone-based delivery systems
    Chen, Heng
    Hu, Zhangchen
    Solak, Senay
    [J]. EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2021, 294 (03) : 1181 - 1201