Smart Magnetic Microrobots Learn to Swim with Deep Reinforcement Learning

被引:28
|
作者
Behrens, Michael R. [1 ]
Ruder, Warren C. [1 ,2 ]
机构
[1] Univ Pittsburgh, Dept Bioengn, 300 Technol Dr, Pittsburgh, PA 15219 USA
[2] Carnegie Mellon Univ, Dept Mech Engn, 5000 Forbes Ave, Pittsburgh, PA 15213 USA
基金
美国国家卫生研究院; 美国国家科学基金会;
关键词
artificial intelligence; control systems; machine learning; magnetics; microrobot; reinforcement learning; robotics; BEHAVIOR; DESIGN; ROBOT;
D O I
10.1002/aisy.202200023
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Swimming microrobots are increasingly developed with complex materials and dynamic shapes and are expected to operate in complex environments in which the system dynamics are difficult to model and positional control of the microrobot is not straightforward to achieve. Deep reinforcement learning is a promising method of autonomously developing robust controllers for creating smart microrobots, which can adapt their behavior to operate in uncharacterized environments without the need to model the system dynamics. This article reports the development of a smart helical magnetic hydrogel microrobot that uses the soft actor critic reinforcement learning algorithm to autonomously derive a control policy which allows the microrobot to swim through an uncharacterized biomimetic fluidic environment under control of a time-varying magnetic field generated from a three-axis array of electromagnets. The reinforcement learning agent learns successful control policies from both state vector input and raw images, and the control policies learned by the agent recapitulate the behavior of rationally designed controllers based on physical models of helical swimming microrobots. Deep reinforcement learning applied to microrobot control is likely to significantly expand the capabilities of the next generation of microrobots.
引用
收藏
页数:16
相关论文
共 50 条
  • [1] Deep learning, reinforcement learning, and world models
    Matsuo, Yutaka
    LeCun, Yann
    Sahani, Maneesh
    Precup, Doina
    Silver, David
    Sugiyama, Masashi
    Uchibe, Eiji
    Morimoto, Jun
    NEURAL NETWORKS, 2022, 152 : 267 - 275
  • [2] Ultrasound Microrobots with Reinforcement Learning
    Schrage, Matthijs
    Medany, Mahmoud
    Ahmed, Daniel
    ADVANCED MATERIALS TECHNOLOGIES, 2023, 8 (10)
  • [3] Deep reinforcement learning in smart manufacturing: A review and prospects
    Li, Chengxi
    Zheng, Pai
    Yin, Yue
    Wang, Baicun
    Wang, Lihui
    CIRP JOURNAL OF MANUFACTURING SCIENCE AND TECHNOLOGY, 2023, 40 : 75 - 101
  • [4] Hierarchical Planning with Deep Reinforcement Learning for 3D Navigation of Microrobots in Blood Vessels
    Yang, Yuguang
    Bevan, Michael A.
    Li, Bo
    ADVANCED INTELLIGENT SYSTEMS, 2022, 4 (11)
  • [5] Deep Reinforcement Learning-Based Collision-Free Navigation for Magnetic Helical Microrobots in Dynamic Environments
    Wang, Huaping
    Qiu, Yukang
    Hou, Yaozhen
    Shi, Qing
    Huang, Hen-Wei
    Huang, Qiang
    Fukuda, Toshio
    IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2024,
  • [6] A Review of Deep Reinforcement Learning for Smart Building Energy Management
    Yu, Liang
    Qin, Shuqi
    Zhang, Meng
    Shen, Chao
    Jiang, Tao
    Guan, Xiaohong
    IEEE INTERNET OF THINGS JOURNAL, 2021, 8 (15): : 12046 - 12063
  • [7] Predictive Control of a Robot Manipulator with Deep Reinforcement Learning
    Bejar, Eduardo
    Moran, Antonio
    2021 7TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND ROBOTICS (ICCAR), 2021, : 127 - 130
  • [8] Deep reinforcement learning for conservation decisions
    Lapeyrolerie, Marcus
    Chapman, Melissa S.
    Norman, Kari E. A.
    Boettiger, Carl
    METHODS IN ECOLOGY AND EVOLUTION, 2022, 13 (11): : 2649 - 2662
  • [9] Survey of Deep Reinforcement Learning for Motion Planning of Autonomous Vehicles
    Aradi, Szilard
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (02) : 740 - 759
  • [10] Deep Reinforcement Learning for Smart City Communication Networks
    Xia, Zhenchang
    Xue, Shan
    Wu, Jia
    Chen, Yanjiao
    Chen, Junjie
    Wu, Libing
    IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2021, 17 (06) : 4188 - 4196