Design of an Active Vision System for High-Level Isolation Units through Q-Learning

被引:3
|
作者
Ruiz, Andrea Gil [1 ,2 ]
Victores, Juan G. [1 ]
Lukawski, Bartek [1 ]
Balaguer, Carlos [1 ]
机构
[1] Univ Carlos III Madrid, RoboticsLab Res Grp, Leganes 28911, Spain
[2] Av Univ 30, Madrid 28911, Spain
来源
APPLIED SCIENCES-BASEL | 2020年 / 10卷 / 17期
关键词
reinforcement learning; personal protective equipment; Q-Learning; reward shaping; grid search; healthcare; infectious diseases; Filoviridae viruses; coronavirus;
D O I
10.3390/app10175927
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
The inspection of Personal Protective Equipment (PPE) is one of the most necessary measures when treating patients affected by infectious diseases, such as Ebola or COVID-19. Assuring the integrity of health personnel in contact with infected patients has become an important concern in developed countries. This work focuses on the study of Reinforcement Learning (RL) techniques for controlling a scanner prototype in the presence of blood traces on the PPE that could arise after contact with pathological patients. A preliminary study on the design of an agent-environment system able to simulate the required task is presented. The task has been adapted to an environment for the OpenAI Gym toolkit. The evaluation of the agent's performance has considered the effects of different topological designs and tuning hyperparameters of the Q-Learning model-free algorithm. Results have been evaluated on the basis of average reward and timesteps per episode. The sample-average method applied to the learning rate parameter, as well as a specific epsilon decaying method worked best for the trained agents. The obtained results report promising outcomes of an inspection system able to center and magnify contaminants in the real scanner system.
引用
收藏
页数:15
相关论文
共 12 条
  • [1] High-Level Path Planning for an Autonomous Sailboat Robot Using Q-Learning
    da Silva Junior, Andouglas Goncalves
    dos Santos, Davi Henrique
    Fernandes de Negreiros, Alvaro Pinto
    Boas de Souza Silva, Joao Moreno Vilas
    Garcia Goncalves, Luiz Marcos
    SENSORS, 2020, 20 (06)
  • [2] High-level Tracking of Autonomous Underwater Vehicles Based on Pseudo Averaged Q-learning
    Shi, Wenjie
    Song, Shiji
    Wu, Cheng
    2018 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2018, : 4138 - 4143
  • [3] Personnel Management and Biosecurity of US High-Level Isolation Units
    Herstein, Jocelyn J.
    Biddinger, Paul D.
    Gibbs, Shawn G.
    Le, Aurora B.
    Jelden, Katelyn C.
    Hewlett, Angela L.
    Lowe, John J.
    JOURNAL OF NURSING ADMINISTRATION, 2018, 48 (11): : 553 - 560
  • [4] Design and Testing of a Demand Response Q-Learning Algorithm for a Smart Home Energy Management System
    Angano, Walter
    Musau, Peter
    Wekesa, Cyrus Wabuge
    2021 IEEE PES/IAS POWERAFRICA CONFERENCE, 2021, : 328 - 332
  • [5] Global High-Consequence Infectious Disease Readiness and Response: An Inventory of High-Level Isolation Units
    Stern, Katie L.
    Sauer, Lauren M.
    Arguinchona, Christa
    Dunning, Jake
    Elrayes, Wael
    Lim, Poh Lian
    Vasoo, Shawn
    Herstein, Jocelyn J.
    HEALTH SECURITY, 2024, : 422 - 428
  • [6] Learning high-level robotic soccer strategies from scratch through reinforcement learning
    Abreu, Miguel
    Reis, Luis Paulo
    Cardoso, Henrique Lopes
    2019 19TH IEEE INTERNATIONAL CONFERENCE ON AUTONOMOUS ROBOT SYSTEMS AND COMPETITIONS (ICARSC 2019), 2019, : 128 - 134
  • [7] Design and Applications of Q-Learning Adaptive PID Algorithm for Maglev Train Levitation Control System
    Shou, Baineng
    Zhang, Hehong
    Long, Zhiqiang
    Xie, Yunde
    Zhang, Ke
    Gu, Qiuming
    2023 35TH CHINESE CONTROL AND DECISION CONFERENCE, CCDC, 2023, : 1947 - 1953
  • [8] Energy and Quality Aware Multi-UAV Flight Path Design Through Q-Learning Algorithms
    Zouaoui, Hend
    Faricelli, Simone
    Cuomo, Francesca
    Colonnese, Stefania
    Chiaraviglio, Luca
    WIRED/WIRELESS INTERNET COMMUNICATIONS, WWIC 2019, 2019, 11618 : 246 - 257
  • [9] Dispatching Algorithm Design for Elevator Group Control System with Q-Learning based on a Recurrent Neural Network
    Liu, Weipeng
    Liu, Ning
    Sun, Hexu
    Xing, Guansheng
    Dong, Yan
    Chen, Haiyong
    2013 25TH CHINESE CONTROL AND DECISION CONFERENCE (CCDC), 2013, : 3397 - 3402
  • [10] An Intelligent Control System Construction Using High-level Time Petri Net And Reinforcement Learning
    Feng, Liangbing
    Obayashi, Masanao
    Kuremoto, Takashi
    Kobayashi, Kunikazu
    INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND SYSTEMS (ICCAS 2010), 2010, : 535 - 539