共 14 条
- [1] Gao Y., Chen S., Lu X., Research on reinforcement learning technology: A review, Acta Automatica Sinica, 30, 1, pp. 86-100, (2004)
- [2] Zhang W., Lu T., Several crux problems of reinforcement learning application in robotics, Computer Engineering and Applications, 40, 4, (2004)
- [3] Skinner B.F., The Behavior of Organisms, (1938)
- [4] Wolf R., Heisenberg M., Basic organization of operant-behavior as revealed in drosophila flight orientation, Journal of Comparative Physiology A, 169, 6, pp. 699-705, (1991)
- [5] Rosen B.E., Goodwin J.M., Vidal J.J., Machine operantconditioning, Annual International Conference of the IEEE Engineering in Medicine and Biology Society, pp. 1500-1501, (1988)
- [6] Gaudiano P., Chang C., Adaptive obstacle avoidance with a neural network for operant conditioning: Experiments with real robots, IEEE International Symposium on Computational Intelligence in Robotics and Automation, pp. 13-18, (1997)
- [7] Zalama E., Gomez J., Paul M., Et al., Adaptive behavior navigation of a mobile robot, IEEE Transactions on Systems, Man, and Cybernetics, Part A - Systems and Humans, 32, 1, pp. 160-169, (2002)
- [8] Itoh K., Miwa H., Matsumoto M., Et al., Behavior model of humanoid robots based on operant conditioning, IEEE/RAS International Conference on Humanoid Robots, pp. 220-225, (2005)
- [9] Dominguez S., Zalama E., Garcia-Bermejo J.G., Et al., Lecture Notes in Computer Science, pp. 691-702, (2006)
- [10] Singh S.P., Sutton R.S., Reinforcement learning with replacing eligibility traces, Machine Learning, 22, 1-3, pp. 123-158, (1996)