Modular Robot Design Synthesis with Deep Reinforcement Learning

被引:0
作者
Whitman, Julian [1 ]
Bhirangi, Raunaq [2 ]
Travers, Matthew [2 ]
Choset, Howie [2 ]
机构
[1] Carnegie Mellon Univ, Dept Mech Engn, 5000 Forbes Ave, Pittsburgh, PA 15213 USA
[2] Carnegie Mellon Univ, Robot Inst, 5000 Forbes Ave, Pittsburgh, PA 15213 USA
来源
THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE | 2020年 / 34卷
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Modular robots hold the promise of versatility in that their components can be re-arranged to adapt the robot design to a task at deployment time. Even for the simplest designs, determining the optimal design is exponentially complex due to the number of permutations of ways the modules can be connected. Further, when selecting the design for a given task, there is an additional computational burden in evaluating the capability of each robot, e.g., whether it can reach certain points in the workspace. This work uses deep reinforcement learning to create a search heuristic that allows us to efficiently search the space of modular serial manipulator designs. We show that our algorithm is more computationally efficient in determining robot designs for given tasks in comparison to the current state-of-the-art.
引用
收藏
页码:10418 / 10425
页数:8
相关论文
共 22 条
[1]   Effortless creation of safe robots from modules through self-programming and self-verification [J].
Althoff, M. ;
Giusti, A. ;
Liu, S. B. ;
Pereira, A. .
SCIENCE ROBOTICS, 2019, 4 (31)
[2]  
[Anonymous], 2017, NEURAL INFORM PROCES
[3]  
[Anonymous], 2018, ARXIV180101432
[4]   LEARNING TO ACT USING REAL-TIME DYNAMIC-PROGRAMMING [J].
BARTO, AG ;
BRADTKE, SJ ;
SINGH, SP .
ARTIFICIAL INTELLIGENCE, 1995, 72 (1-2) :81-138
[5]  
Bengio Y, 2009, INT C MACHINE LEARNI, P41, DOI [DOI 10.1145/1553374.1553380, 10.1145/1553374.1553380]
[6]  
Bhardwaj M., 2017, C ROBOT LEARNING, P271
[7]  
Chen I.M., 1996, P 4 INT C CONTR AUT
[8]  
Chen T., 2018, ADV NEURAL INFORM PR, P9333
[9]  
Desai Ruta, 2017, 2017 IEEE International Conference on Robotics and Automation (ICRA), P1196, DOI 10.1109/ICRA.2017.7989143
[10]  
Desai R., 2018, ARXIV180607419