共 10 条
- [1] BOUTILIER C, 1999, P IJCAI 99 STOCKH SW
- [2] Christopher JohnCornish Hella by Watkins., 1989, Learning from delayed rewards
- [3] Coello C. A. C., 1999, Knowledge and Information Systems, V1, P269
- [4] DEB K, 1998, CI4998 TR U DORTM DE
- [5] Fonseca C. M., 1995, P 1 INT C GEN ALG EN, P45
- [6] Lamont G. B., 1999, P 1999 ACM S APPL CO, P351, DOI DOI 10.1145/298151.298382
- [7] Littman M.L., 1994, MACHINE LEARNING P 1, P157, DOI 10.1016/B978-1-55860-335-6.50027-1
- [8] Mariano C, 2000, LECT NOTES ARTIF INT, V1793, P212
- [9] Tan M, 1993, P 10 INT C MACHINE L, P330, DOI DOI 10.1016/B978-1-55860-307-3.50049-6
- [10] Viennet R, 1996, INT J SYST SCI, V27, P255, DOI 10.1080/00207729608929211