Close collaboration and desired strategy is indispensable for humanoid robots in the RoboCup soccer competition. In order to solve the problem that the convergence rate is too low in training local strategies,this paper mainly proposed a method to optimize the parameters in decision and positioning based on reinforcement learning for soccer robots. First, Markov decision process is applied to the framework for reinforcement learning. Then,we propose a relative improved method, which is known as a Sarsa Algorithm to overcome the drawback of the low convergence rate of the average reward reinforcement learning. Meanwhile, in order to deal with the large state space problems arising in the training and improve the generalization ability, this method is applied to the Keepaway local training. The training results show that, this algorithm has a faster convergent speed than other ordinary learning algorithm.
机构:
Beihang Univ, Sch Automat Sci & Elect Engn, Beijing 100191, Peoples R ChinaBeihang Univ, Sch Automat Sci & Elect Engn, Beijing 100191, Peoples R China
Liu, Bo
Zhang, Yongping
论文数: 0引用数: 0
h-index: 0
机构:
Beihang Univ, Sch Automat Sci & Elect Engn, Beijing 100191, Peoples R ChinaBeihang Univ, Sch Automat Sci & Elect Engn, Beijing 100191, Peoples R China
Zhang, Yongping
Sun, Hanlin
论文数: 0引用数: 0
h-index: 0
机构:
Beihang Univ, Sino French Engineer Sch, Beijing 100191, Peoples R ChinaBeihang Univ, Sch Automat Sci & Elect Engn, Beijing 100191, Peoples R China
Sun, Hanlin
Sheng, Guojun
论文数: 0引用数: 0
h-index: 0
机构:
COSMO Ind Intelligence Res Inst Co Ltd, Qingdao 266426, Peoples R ChinaBeihang Univ, Sch Automat Sci & Elect Engn, Beijing 100191, Peoples R China
Sheng, Guojun
Zou, Xiaofu
论文数: 0引用数: 0
h-index: 0
机构:
Beihang Univ, Sch Artificial Intelligence, Beijing 100191, Peoples R ChinaBeihang Univ, Sch Automat Sci & Elect Engn, Beijing 100191, Peoples R China
Zou, Xiaofu
Cheng, Ying
论文数: 0引用数: 0
h-index: 0
机构:
Beihang Univ, Sch Automat Sci & Elect Engn, Beijing 100191, Peoples R ChinaBeihang Univ, Sch Automat Sci & Elect Engn, Beijing 100191, Peoples R China
Cheng, Ying
Tao, Fei
论文数: 0引用数: 0
h-index: 0
机构:
Beihang Univ, Sch Automat Sci & Elect Engn, Beijing 100191, Peoples R China
Beihang Univ, Int Res Inst Multidisciplinary Sci, Digital Twin Int Res Ctr, Beijing 100191, Peoples R ChinaBeihang Univ, Sch Automat Sci & Elect Engn, Beijing 100191, Peoples R China