共 16 条
- [1] Acosta-Abreu R. S., 1985, Control and Cybernetics, V14, P313
- [2] BARANOV VV, 1981, CYBERNETICS+, V17, P815
- [7] HINDERER K, 1970, F NONSTATIONARY DYNA
- [9] HUBNER G, 1977, DYNAMISCHE OPTIMIERU, V98, P57
- [10] LEARNING ALGORITHMS FOR MARKOV DECISION-PROCESSES [J]. JOURNAL OF APPLIED PROBABILITY, 1987, 24 (01) : 270 - 276