共 12 条
- [1] Blackwell D.(1956)An analog of the minimax theorem for vector payoffs Pacific Journal of Mathematic 6 1-8
- [2] Chernoff H.(1952)A measure of the asymptotic efficiency for tests of a hypothesis based on the sum of observations Annals of Mathematical Statistics 23 493-509
- [3] Harsanyi J.C.(1967)Games with incomplete information played by bayesian players, Parts i, ii, iii Management Science 14 159-182
- [4] Kaelbling L.P.(1996)Reinforcement learning: A survey Journal of Artificial Intelligence Research 4 237-258
- [5] Littman M.L.(1997)Dynamic non-Bayesian decision-making Journal of Artificial Intelligence Research 7 231-248
- [6] Moore A.W.(1995)Multi-entity models Machine Intelligence 14 63-88
- [7] Monderer D.(1953)Stochastic games Proc. Nat. Acad. Sci. U.S.A. 39 1095-1100
- [8] Tennenholtz M.(1984)A theory of the learnable Comm. ACM 27 1134-1142
- [9] Moses Y.(undefined)undefined undefined undefined undefined-undefined
- [10] Tennenholtz M.(undefined)undefined undefined undefined undefined-undefined