Stochastic searching on the line and its applications to parameter learning in nonlinear optimization

被引:58
作者
Oommen, BJ
机构
[1] School of Computer Science, Carleton University, Ottawa
来源
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS | 1997年 / 27卷 / 04期
基金
加拿大自然科学与工程研究理事会;
关键词
D O I
10.1109/3477.604122
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We consider the problem of a learning mechanism (for example, a robot) locating a point on a line when it is interacting with a random environment which essentially informs it, possibly erroneously, which may it should move. In this paper we present a novel scheme by which the point can be learned using some recently devised learning principles. The heart of the strategy involves discretizing the space and performing a controlled random walk on this space. The scheme is shown to be epsilon-optimal and to converge with probability 1. Although the problem is solved in its generality, its application in nonlinear optimization has also been suggested. Typically, an optimization process involves working one's way toward the maximum (minimum) using the local information that is available. However, the crucial issue in these strategies is that of determining the parameter to be used in the optimization itself. If the parameter is too small the convergence is sluggish. On the other hand, if the parameter is too large, the system could erroneously converge or even oscillate. Our strategy can be used to determine the best parameter to be used in the optimization.
引用
收藏
页码:733 / 739
页数:7
相关论文
共 20 条
[1]  
[Anonymous], [No title captured]
[2]  
[Anonymous], 1986, NUMERICAL RECIPES C
[3]  
BAEZAYATES RA, 1992, P 1992 INT C CHIL CO, P269
[4]  
BAEZAYATES RA, 1988, P SCAND WORKSH ALG T, P176
[5]  
KARLIN S, 1974, 1 COURSE STOCHASTIC
[6]  
KASHYAP RL, 1983, IEEE T PATTERN ANAL, P667
[7]  
Lakshmivarahan S., 1981, Learning Algorithms, V1
[8]   DISCRETIZED ESTIMATOR LEARNING AUTOMATA [J].
LANCTOT, JK ;
OOMMEN, BJ .
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS, 1992, 22 (06) :1473-1483
[9]  
LANCTOT JK, 1989, THESIS CARLETON U OT
[10]  
NARENDRA K, 1989, LEARNING AUTOMATA IN