Learning value functions with relational state representations for guiding task-and-motion planning

被引：0

作者：

Kim, Beomjoon ^{[1
]}

Shimanuki, Luke ^{[1
]}

机构：

[1] MIT, Comp Sci & Artificial Intelligence Lab, Cambridge, MA 02139 USA

来源：

CONFERENCE ON ROBOT LEARNING, VOL 100 | 2019年 / 100卷

关键词：

Task and motion planning; value-function learning;

D O I：

暂无

中图分类号：

TP39 [计算机的应用];

学科分类号：

081203 ; 0835 ;

摘要：

We propose a novel relational state representation and an action-value function learning algorithm that learns from planning experience for geometric task-and-motion planning (GTAMP) problems, in which the goal is to move several objects to regions in the presence of movable obstacles. The representation encodes information about which objects occlude the manipulation of other objects and is encoded using a small set of predicates. It supports efficient learning, using graph neural networks, of an action-value function that can be used to guide a GTAMP solver. Importantly, it enables learning from planning experience on simple problems and generalizing to more complex problems and even across substantially different geometric environments. We demonstrate the method in two challenging GTAMP domains.

引用

页数：14

共 25 条

[1] Bhardwaj M., 2017, C ROB LEARN
[2] Chitnis R, 2019, IEEE INT CONF ROBOT, P7865, DOI [10.1109/ICRA.2019.8794342, 10.1109/icra.2019.8794342]
[3] Chitnis R, 2016, IEEE INT CONF ROBOT, P447, DOI 10.1109/ICRA.2016.7487165
[4] Domshlak C, 2010, AAAI CONF ARTIF INTE, P1071
[5] Farahmand A., 2011, Advances in Neural Information Processing Systems
[6] Fink M., 2007, Artificial Intelligence and Statistics
[7] Garrett C. R., 2016, INT JOINT C ART INT
[8] Gilmer J, 2017, PR MACH LEARN RES, V70
[9] Gori M, 2005, IEEE IJCNN, P729
[10] Guestrin Carlos, 2003, INT JOINT C ART INT, P1003

← 1 2 3 →