Path planning for a statically stable biped robot using PRM and reinforcement learning

被引：10

作者：

Kulkarni, Prasad

Goswami, Dip

Guha, Prithwijit

Dutta, Ashish

机构：

[1] Nagoya Univ, Dept Engn Sci & Mech, Chikusa Ku, Nagoya, Aichi, Japan

[2] Tata Motors, Tata Motors Res Div, Pune, Maharashtra, India

[3] Natl Univ Singapore, Dept Elect & Comp Engn, Singapore 117576, Singapore

[4] Indian Inst Technol, Dept Elect Engn, Kanpur 208016, Uttar Pradesh, India

[5] Indian Inst Technol, Dept Mech Engn, Kanpur 208016, Uttar Pradesh, India

来源：

JOURNAL OF INTELLIGENT & ROBOTIC SYSTEMS | 2006年 / 47卷 / 03期

关键词：

potential function; PRM; reinforcement learning; statically stable biped robot;

D O I：

10.1007/s10846-006-9071-3

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In this paper path planning and obstacle avoidance for a statically stable biped robot using PRM and reinforcement learning is discussed. The main objective of the paper is to compare these two methods of path planning for applications involving a biped robot. The statically stable biped robot under consideration is a 4-degree of freedom walking robot that can follow any given trajectory on flat ground and has a fixed step length of 200 mm. It is proved that the path generated by the first method produces the shortest smooth path but it also increases the computational burden on the controller, as the robot has to turn at almost all steps. However the second method produces paths that are composed of straight-line segments and hence requires less computation for trajectory following. Experiments were also conducted to prove the effectiveness of the reinforcement learning based path planning method.

引用

页码：197 / 214

页数：18

共 18 条

[1] NUMERICAL POTENTIAL-FIELD TECHNIQUES FOR ROBOT PATH PLANNING [J].

BARRAQUAND, J ;

LANGLOIS, B ;

LATOMBE, JC .

IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS, 1992, 22 (02) :224-241

[2] A random sampling scheme for path planning [J].

Barraquand, J ;

Kavraki, L ;

Latombe, JC ;

Motwani, R ;

Li, TY ;

Raghavan, P .

INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH, 1997, 16 (06) :759-774

[3]

BEOM HR, 1995, IEEE T SYST MAN CYB, V25, P464, DOI 10.1109/21.364859

[4]

CLARK M, 2003, P IEEE INT C ROB AUT

[5]

Dulimarta H, 2002, PROCEEDINGS OF THE 4TH WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION, VOLS 1-4, P3267, DOI 10.1109/WCICA.2002.1020138

[6]

GOSAVI A, TUTORIAL REINFORCEME

[7] A grasp-based motion planning algorithm for character animation [J].

Kalisiak, M ;

van de Panne, M .

JOURNAL OF VISUALIZATION AND COMPUTER ANIMATION, 2001, 12 (03) :117-129

[8] Probabilistic roadmaps for path planning in high-dimensional configuration spaces [J].

Kavraki, LE ;

Svestka, P ;

Latombe, JC ;

Overmars, MH .

IEEE TRANSACTIONS ON ROBOTICS AND AUTOMATION, 1996, 12 (04) :566-580

[9]

KUFFNER J, 2001, P IROS

[10] Dynamically-stable motion planning for humanoid robots [J].

Kuffner, JJ ;

Kagami, S ;

Nishiwaki, K ;

Inaba, M ;

Inoue, H .

AUTONOMOUS ROBOTS, 2002, 12 (01) :105-118

← 1 2 →