ANALYSIS OF AN ADAPTIVE-CONTROL SCHEME FOR A PARTIALLY OBSERVED CONTROLLED MARKOV-CHAIN

被引：23

作者：

FERNANDEZGAUCHERAND, E

ARAPOSTATHIS, A

MARCUS, SI

机构：

[1] UNIV TEXAS, DEPT ELECT & COMP ENGN, AUSTIN, TX 78712 USA

[2] UNIV MARYLAND, DEPT ELECT ENGN, COLL PK, MD 20742 USA

[3] UNIV MARYLAND, SYST RES CTR, COLL PK, MD 20742 USA

来源：

IEEE TRANSACTIONS ON AUTOMATIC CONTROL | 1993年 / 38卷 / 06期

关键词：

D O I：

10.1109/9.222316

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

We consider an adaptive finite state controlled Markov chain with partial state information, motivated by a class of replacement problems. We present parameter estimation techniques based on the information available after actions that reset the state to a known value are taken. We prove that the parameter estimates converge w.p.1 to the true (unknown) parameter, under the feedback structure induced by a certainty equivalent adaptive policy. We also show that the adaptive policy is self-optimizing in a long-run average sense, for any (measurable) sequence of parameter estimates converging w.p.1 to the true parameter.

引用

页码：987 / 993

页数：7

共 18 条

[1] [Anonymous], 1978, STOCHASTIC APPROXIMA
[2] ANALYSIS OF AN IDENTIFICATION ALGORITHM ARISING IN THE ADAPTIVE ESTIMATION OF MARKOV-CHAINS
ARAPOSTATHIS, A
MARCUS, SI
[J]. MATHEMATICS OF CONTROL SIGNALS AND SYSTEMS, 1990, 3 (01) : 1 - 29
[3] ARAPOSTATHIS A, UNPUB DISCRETE TIME
[4] ARAPOSTATHIS A, 1990, 29TH P IEEE C DEC CO, P1438
[5] OPTIMAL CONTROL OF MARKOV PROCESSES WITH INCOMPLETE STATE INFORMATION
ASTROM, KJ
[J]. JOURNAL OF MATHEMATICAL ANALYSIS AND APPLICATIONS, 1965, 10 (01) : 174 - &
[6] Bertsekas D.P., 1987, ABSTRACT DYNAMIC PRO
[7] Fernandez-Gaucherand E., 1991, Annals of Operations Research, V29, P439, DOI 10.1007/BF02283610
[8] FERNANDEZGAUCHE.E, 1988, 27TH P IEEE C DEC CO, P1204
[9] FERNANDEZGAUCHE.E, 1991, THESIS U TEXAS AUSTI
[10] Fernandezgaucherand E., 1989, LECT NOTES CONTR INF, V130, P217

← 1 2 →