共 41 条
- [1] Arnold L., Auger A., Hansen N., Ollivier Y., Informationgeometric Optimization Algorithms: A Unifying Picture Via Invariance Principles, (2011)
- [2] Barto A., Mahadevan S., Recent advances in hierarchical reinforcement learning, Discrete Event Systems, 13, 1-2, pp. 41-77, (2003)
- [3] Beyer H.-G., Schwefel H.-P., Evolution strategies-a comprehensive introduction, Natural Computing, 1, 1, pp. 3-52, (2002)
- [4] Busoniu L., Ernst D., De Schutter B., Babuska R., Crossentropy optimization of control policies with adaptive basis functions, IEEE Transactions on Systems, Man, andCybernetics-Part B: Cybernetics, 41, 1, pp. 196-209, (2011)
- [5] Gomez F., Schmidhuber J., Miikkulainen R., Accelerated neural evolution through cooperatively coevolved synapses, Journalof Machine Learning Research, 9, pp. 937-965, (2008)
- [6] Hansen N., Ostermeier A., Completely derandomized selfadaptation in evolution strategies, Evolutionary Computation, 9, 2, pp. 159-195, (2001)
- [7] Hansen N., The CMA Evolution Strategy: A Tutorial, (2011)
- [8] Heidrich-Meisner V., Igel C., Evolution strategies for direct policy search, Proceedings of the 10th interna-tional conference on Parallel Problem Solving from Nature:PPSN X, pp. 428-437, (2008)
- [9] Heidrich-Meisner V., Igel C., Similarities and differences between policy gradient methods and evolution strategies, ESANN 2008, 16th European Symposium on Artifi-cial Neural Networks, Bruges, Belgium, April 23-25, 2008,Proceedings, pp. 149-154, (2008)
- [10] Ijspeert A., Nakanishi J., Pastor P., Hoffmann H., Schaal S., Dynamical Movement Primitives: Learning attractor models for motor behaviors, Neural Computation, 25, 2, pp. 328-373, (2013)