Deep Reinforcement Learning for Tactile Robotics: Learning to Type on a Braille Keyboard

被引：20

作者：

Church, Alex ^{[1
,2
]}

Lloyd, John ^{[1
,2
]}

Hadsell, Raia ^{[3
]}

Lepora, Nathan F. ^{[1
,2
]}

机构：

[1] Univ Bristol, Dept Engn Math, Bristol BS8 1UB, Avon, England

[2] Univ Bristol, Bristol Robot Lab, Bristol BS8 1UB, Avon, England

[3] Google DeepMind, London N1C 4AG, England

来源：

IEEE ROBOTICS AND AUTOMATION LETTERS | 2020年 / 5卷 / 04期

基金：

英国工程与自然科学研究理事会;

关键词：

Task analysis; Tactile sensors; Keyboards; Training; Force and tactile sensing; reinforecment learning; biomimetics;

D O I：

10.1109/LRA.2020.3010461

中图分类号：

TP24 [机器人技术];

学科分类号：

080202 ; 1405 ;

摘要：

Artificial touch would seem well-suited for Reinforcement Learning (RL), since both paradigms rely on interaction with an environment. Here we propose a new environment and set of tasks to encourage development of tactile reinforcement learning: learning to type on a braille keyboard. Four tasks are proposed, progressing in difficulty from arrow to alphabet keys and from discrete to continuous actions. A simulated counterpart is also constructed by sampling tactile data from the physical environment. Using state-of-the-art deep RL algorithms, we show that all of these tasks can be successfully learnt in simulation, and 3 out of 4 tasks can be learned on the real robot. A lack of sample efficiency currently makes the continuous alphabet task impractical on the robot. To the best of our knowledge, this work presents the first demonstration of successfully training deep RL agents in the real world using observations that exclusively consist of tactile images. To aid future research utilising this environment, the code for this project has been released along with designs of the braille keycaps for 3D printing and a guide for recreating the experiments.

引用

页码：6145 / 6152

页数：8

共 40 条

[1]

Ahn M., 2019, P MACHINE LEARNING R, V100, P1300

[2] Learning dexterous in-hand manipulation [J].

Andrychowicz, Marcin ;

Baker, Bowen ;

Chociej, Maciek ;

Jozefowicz, Rafal ;

McGrew, Bob ;

Pachocki, Jakub ;

Petron, Arthur ;

Plappert, Matthias ;

Powell, Glenn ;

Ray, Alex ;

Schneider, Jonas ;

Sidor, Szymon ;

Tobin, Josh ;

Welinder, Peter ;

Weng, Lilian ;

Zaremba, Wojciech .

INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH, 2020, 39 (01) :3-20

[3]

[Anonymous], 2018, P 6 INT C LEARN REPR

[4]

[Anonymous], 2016, P 4 INT C LEARN REPR

[5]

[Anonymous], 2017, ARXIV170403073

[6]

[Anonymous], 2016, P 4 INT C LEARN REPR

[7]

[Anonymous], 2018, ARXIV180209464

[8] The Arcade Learning Environment: An Evaluation Platform for General Agents [J].

Bellemare, Marc G. ;

Naddaf, Yavar ;

Veness, Joel ;

Bowling, Michael .

JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2013, 47 :253-279

[9]

Brockman Greg, 2016, arXiv

[10]

Calandra R., 2017, ANN C ROBOT LEARNING, P314

← 1 2 3 4 →