Iterative Methods with Self-Learning for Solving Nonlinear Equations

被引：0

作者：

Popkov, Yu. S. ^{[1
,2
]}

机构：

[1] Russian Acad Sci, Fed Res Ctr Comp Sci & Control, Moscow, Russia

[2] Russian Acad Sci, Trapeznikov Inst Control Sci, Moscow, Russia

来源：

AUTOMATION AND REMOTE CONTROL | 2024年 / 85卷 / 05期

关键词：

nonlinear equation; iterative methods; reinforcement; Monte Carlo experiment;

D O I：

10.1134/S0005117924050060

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

This paper is devoted to the problem of solving a system of nonlinear equations with an arbitrary but continuous vector function on the left-hand side. By assumption, the values of its components are the only a priori information available about this function. An approximate solution of the system is determined using some iterative method with parameters, and the qualitative properties of the method are assessed in terms of a quadratic residual functional. We propose a self-learning (reinforcement) procedure based on auxiliary Monte Carlo (MC) experiments, an exponential utility function, and a payoff function that implements Bellman's optimality principle. A theorem on the strict monotonic decrease of the residual functional is proven.

引用

页码：472 / 476

页数：5

共 18 条

[11] Russel SJ., 2010, Artificial intelligence: a modern approach, V3rd
[12] Strekalovsky A.S., 2003, Elementy nevypukloi optimizatsii (Elements of Nonconvex Optimization)
[13] Sutton RS., 1998, INTRO REINFORCEMENT, DOI [10.1109/TNN.1998.712192, DOI 10.1109/TNN.1998.712192]
[14] van Hasselt H, 2016, AAAI CONF ARTIF INTE, P2094
[15] Wang C., 2022, P INT C LEARN REPR I
[16] Wasserman P., 1989, Neural Computing Theory and Practice
[17] WATKINS CJCH, 1992, MACH LEARN, V8, P279, DOI 10.1007/BF00992698
[18] Wiering M, 2012, ADAPT LEARN OPTIM, V12, P1, DOI 10.1007/978-3-642-27645-3

← 1 2 →