共 63 条
- [51] Puterman M., 1994, MARKOV DECISION PROC
- [52] Rummery G. A., 1994, CUEDFINFENGTR166
- [53] LEARNING APPLIED TO SUCCESSIVE APPROXIMATION ALGORITHMS [J]. IEEE TRANSACTIONS ON SYSTEMS SCIENCE AND CYBERNETICS, 1970, SSC6 (02): : 97 - &
- [54] Singh SP, 1996, MACH LEARN, V22, P123, DOI 10.1007/BF00114726
- [55] Sutton R. S., 1988, Machine Learning, V3, P9, DOI 10.1023/A:1022633531479
- [56] Sutton R.S., 1990, P 7 INT C MACHINE LE, P216
- [57] Sutton R.S., 1984, Temporal Credit Assignment in Reinforcement Learning
- [58] SUTTON RS, 1996, ADV NEURAL INFORMATI, V8
- [59] FUZZY IDENTIFICATION OF SYSTEMS AND ITS APPLICATIONS TO MODELING AND CONTROL [J]. IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS, 1985, 15 (01): : 116 - 132
- [60] TESAURO G, 1992, MACH LEARN, V8, P257, DOI 10.1007/BF00992697