Smart Magnetic Microrobots Learn to Swim with Deep Reinforcement Learning

被引:35
作者
Behrens, Michael R. [1 ]
Ruder, Warren C. [1 ,2 ]
机构
[1] Univ Pittsburgh, Dept Bioengn, 300 Technol Dr, Pittsburgh, PA 15219 USA
[2] Carnegie Mellon Univ, Dept Mech Engn, 5000 Forbes Ave, Pittsburgh, PA 15213 USA
基金
美国国家卫生研究院; 美国国家科学基金会;
关键词
artificial intelligence; control systems; machine learning; magnetics; microrobot; reinforcement learning; robotics; BEHAVIOR; DESIGN; ROBOT;
D O I
10.1002/aisy.202200023
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Swimming microrobots are increasingly developed with complex materials and dynamic shapes and are expected to operate in complex environments in which the system dynamics are difficult to model and positional control of the microrobot is not straightforward to achieve. Deep reinforcement learning is a promising method of autonomously developing robust controllers for creating smart microrobots, which can adapt their behavior to operate in uncharacterized environments without the need to model the system dynamics. This article reports the development of a smart helical magnetic hydrogel microrobot that uses the soft actor critic reinforcement learning algorithm to autonomously derive a control policy which allows the microrobot to swim through an uncharacterized biomimetic fluidic environment under control of a time-varying magnetic field generated from a three-axis array of electromagnets. The reinforcement learning agent learns successful control policies from both state vector input and raw images, and the control policies learned by the agent recapitulate the behavior of rationally designed controllers based on physical models of helical swimming microrobots. Deep reinforcement learning applied to microrobot control is likely to significantly expand the capabilities of the next generation of microrobots.
引用
收藏
页数:16
相关论文
共 82 条
[1]   How Should Microrobots Swim? [J].
Abbott, Jake J. ;
Peyer, Kathrin E. ;
Lagomarsino, Marco Cosentino ;
Zhang, Li ;
Dong, Lixin ;
Kaliakatsos, Ioannis K. ;
Nelson, Bradley J. .
INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH, 2009, 28 (11-12) :1434-1447
[2]   Multifunctional surface microrollers for targeted cargo delivery in physiological blood flow [J].
Alapan, Yunus ;
Bozuyuk, Ugur ;
Erkoc, Pelin ;
Karacakol, Alp Can ;
Sitti, Metin .
SCIENCE ROBOTICS, 2020, 5 (42)
[3]   Soft erythrocyte-based bacterial microswimmers for cargo delivery [J].
Alapan, Yunus ;
Yasa, Oncay ;
Schauer, Oliver ;
Giltinan, Joshua ;
Tabak, Ahmet F. ;
Sourjik, Victor ;
Sitti, Metin .
SCIENCE ROBOTICS, 2018, 3 (17)
[4]  
Amin, IN VITRO VALIDATION
[5]  
Andrychowicz M., 2017, ADV NEURAL INF PROCE, V30
[6]   An elegant mind: Learning and memory in Caenorhabditis elegans [J].
Ardiel, Evan L. ;
Rankin, Catharine H. .
LEARNING & MEMORY, 2010, 17 (04) :191-201
[7]   Deep Reinforcement Learning A brief survey [J].
Arulkumaran, Kai ;
Deisenroth, Marc Peter ;
Brundage, Miles ;
Bharath, Anil Anthony .
IEEE SIGNAL PROCESSING MAGAZINE, 2017, 34 (06) :26-38
[8]   Medical Imaging of Microrobots: Toward In Vivo Applications [J].
Aziz, Azaam ;
Pane, Stefano ;
Iacovacci, Veronica ;
Koukourakis, Nektarios ;
Czarske, Juergen ;
Menciassi, Arianna ;
Medina-Sanchez, Mariana ;
Schmidt, Oliver G. .
ACS NANO, 2020, 14 (09) :10865-10893
[9]   Adaptive immunity [J].
Bonilla, Francisco A. ;
Oettgen, Hans C. .
JOURNAL OF ALLERGY AND CLINICAL IMMUNOLOGY, 2010, 125 (02) :S33-S40
[10]   Self-Folding Thermo-Magnetically Responsive Soft Microgrippers [J].
Breger, Joyce C. ;
Yoon, ChangKyu ;
Xiao, Rui ;
Kwag, Hye Rin ;
Wang, Martha O. ;
Fisher, John P. ;
Nguyen, Thao D. ;
Gracias, David H. .
ACS APPLIED MATERIALS & INTERFACES, 2015, 7 (05) :3398-3405