Data-Driven Control of COVID-19 in Buildings: A Reinforcement-Learning Approach

被引:3
作者
Hosseinloo, Ashkan Haji [1 ]
Nabi, Saleh [2 ]
Hosoi, Anette [3 ]
Dahleh, Munther A. [1 ]
机构
[1] MIT, Dept Elect Engn & Comp Sci, Cambridge, MA 02139 USA
[2] Mitsubishi Elect Res Labs, Cambridge, MA 02139 USA
[3] MIT, Dept Mech Engn, Cambridge, MA 02139 USA
关键词
Disease control; reinforcement learning; data-driven control; HVAC system; AIRBORNE TRANSMISSION; VENTILATION;
D O I
10.1109/TASE.2023.3315549
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In addition to its public health crisis, COVID-19 pandemic has led to the shutdown and closure of workplaces with an estimated total cost of more than $16 trillion. Given the long hours an average person spends in buildings and indoor environments, this research article proposes data-driven control strategies to design optimal indoor airflow to minimize the exposure of occupants to viral pathogens in built environments. A general control framework is put forward for designing an optimal velocity field and proximal policy optimization, a reinforcement learning algorithm is employed to solve the control problem in a data-driven fashion. The same framework is used for optimal placement of disinfectants to neutralize the viral pathogens as an alternative to the airflow design when the latter is practically infeasible or hard to implement. We show, via computational simulations, that the control agent learns the optimal policy in both scenarios within a reasonable time. The proposed data-driven control framework in this study will have significant societal and economic benefits by setting the foundation for an improved methodology in designing case-specific infection control guidelines that can be realized by affordable ventilation devices and disinfectants.Note to Practitioners-This paper is motivated by the problem of COVID-19 infection spread in enclosed spaces but it also applies to other airborne pathogens. Airborne disease contagion often takes place in indoor environments; however, ventilation systems are almost never designed to take this into account so as to contain the spread of the pathogens. This is mainly because airflow design requires solving high-dimensional nonlinear partial differential equations known as Navier Stokes equations in fluid dynamics. In this paper, we propose a data-driven approach for solving the control problem of pathogen containment without solving the fluid dynamics equations. To this end, we first mathematically formulate the problem as an optimal control problem and then cast it as a reinforcement learning (RL) task. Reinforcement learning is the data-driven science of sequential decision-making and control in which the controller finds an optimal solution by systematic trial and error and without access to the system dynamics, i.e. fluid and pathogen dynamics in this paper. We employ an state-of-the-art RL algorithm, called PPO, to solve for optimal airflow in a room so as to minimize the exposure risk of occupants. Once it is calculated, the optimal airflow could be realized, via reverse engineering, by proper placement of the ventilation equipment, e.g. inlets, outlets, and fans. As an alternative to the airflow design, we use the same proposed data-driven techniques to find an optimal placement for pathogen disinfectants if there exists one, such as, hydrogen peroxide for COVID-19. Our results show the efficacy of our data-driven approach in designing an steady-state controller with full access to the system states. In future research, we will address the controller design with sparse measurements of the system states.
引用
收藏
页码:5691 / 5699
页数:9
相关论文
共 45 条
  • [1] Aliabadi Amir A, 2011, Adv Prev Med, V2011, P124064, DOI 10.4061/2011/124064
  • [2] Alnaes M., 2015, ARCH NUMER SOFTW, V3, P9, DOI DOI 10.11588/ANS.2015.100.20553
  • [3] Host-to-host airborne transmission as a multiphase flow problem for science-based social distance guidelines
    Balachandar, S.
    Zaleski, S.
    Soldati, A.
    Ahmadi, G.
    Bourouiba, L.
    [J]. INTERNATIONAL JOURNAL OF MULTIPHASE FLOW, 2020, 132
  • [4] Turbulent Gas Clouds and Respiratory Pathogen Emissions Potential Implications for Reducing Transmission of COVID-19
    Bourouiba, Lydia
    [J]. JAMA-JOURNAL OF THE AMERICAN MEDICAL ASSOCIATION, 2020, 323 (18): : 1837 - 1838
  • [5] Predictive and retrospective modelling of airborne infection risk using monitored carbon dioxide
    Burridge, Henry C.
    Fan, Shiwei
    Jones, Roderic L.
    Noakes, Catherine J.
    Linden, P. F.
    [J]. INDOOR AND BUILT ENVIRONMENT, 2022, 31 (05) : 1363 - 1380
  • [6] Chen GY, 1998, ENERG BUILDINGS, V28, P137
  • [7] Disinfection of N95 respirators by ionized hydrogen peroxide during pandemic coronavirus disease 2019 (COVID-19) due to SARS-CoV-2
    Cheng, V. C. C.
    Wong, S-C.
    Kwan, G. S. W.
    Hui, W-T.
    Yuen, K-Y.
    [J]. JOURNAL OF HOSPITAL INFECTION, 2020, 105 (02) : 358 - 359
  • [8] HVAC systems for environmental control to minimize the COVID-19 infection
    Ding, Junwei
    Yu, Chuck Wah
    Cao, Shi-Jie
    [J]. INDOOR AND BUILT ENVIRONMENT, 2020, 29 (09) : 1195 - 1201
  • [9] Reinforcement learning for bluff body active flow control in experiments and simulations
    Fan, Dixia
    Yang, Liu
    Wang, Zhicheng
    Triantafyllou, Michael S.
    Karniadakis, George Em
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2020, 117 (42) : 26091 - 26098
  • [10] Farahmand AM, 2017, P AMER CONTR CONF, P3120, DOI 10.23919/ACC.2017.7963427