Reinforcement learning for occupant behavior modeling in public buildings: Why, what and how?

被引:4
作者
Yu, Hao [1 ]
Xu, Xiaoxiao [1 ]
机构
[1] Nanjing Forestry Univ, Sch Civil Engn, Nanjing 210037, Jiangsu, Peoples R China
基金
中国国家自然科学基金;
关键词
Occupant behavior; Reinforcement learning; Public buildings; Energy consumption; Qualitative analysis; Abbreviations; ENERGY PERFORMANCE; HVAC; AGENT; OPTIMIZATION; CONSUMPTION; VENTILATION; FRAMEWORK; COMFORT; SYSTEMS; GAP;
D O I
10.1016/j.jobe.2024.110491
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
Effective control of energy consumption in public buildings holds paramount significance for global sustainable development. However, uncertainty in occupant behavior during operational phase often leads to a substantial discrepancy between the designed and actual energy consumption. While there is existing research on occupant behavior in public buildings utilizing stochastic modeling, statistical modeling, data mining, and agent-based modeling methods, the connotations and formation mechanisms have not been systematically revealed, rendering these developed models weak in applicability and reliability in practical scenarios. Reinforcement learning, which can perceive aspects of external environment and learn from stochastic interactions spontaneously, has been distinguished as a promising method for occupant behavior modeling. This research aims to develop a paradigm for reinforcement learning application in occupant behavior modeling in public buildings. Semi-structured interviews and literature review were combined to collect relevant information, while reasons for using reinforcement learning and applicable algorithms were discerned through qualitative analysis. Finally, detailed frameworks were proposed and validated through focus groups, offering a comprehensive guide that will serve as a reference for future research. This includes steps for collecting occupant-related data, selecting appropriate algorithms, and training and deploying reinforcement learning agents effectively. This research innovatively proposed a "panorama" of reinforcement learning for occupant behavior modeling in public buildings, laying a solid foundation for indoor environment and energy efficiency improvement in public building operations.
引用
收藏
页数:20
相关论文
共 115 条
[1]   Improving residential building energy simulations through occupancy data derived from commercial off-the-shelf Wi-Fi sensing technology [J].
Abolhassani, Soroush Samareh ;
Zandifar, Azar ;
Ghourchian, Negar ;
Amayri, Manar ;
Bouguila, Nizar ;
Eicker, Ursula .
ENERGY AND BUILDINGS, 2022, 272
[2]   From Reinforcement Learning to Deep Reinforcement Learning: An Overview [J].
Agostinelli, Forest ;
Hocquet, Guillaume ;
Singh, Sameer ;
Baldi, Pierre .
BRAVERMAN READINGS IN MACHINE LEARNING: KEY IDEAS FROM INCEPTION TO CURRENT STATE, 2018, 11100 :298-328
[3]   Application of deep Q-networks for model-free optimal control balancing between different HVAC systems [J].
Ahn, Ki Uhn ;
Park, Cheol Soo .
SCIENCE AND TECHNOLOGY FOR THE BUILT ENVIRONMENT, 2020, 26 (01) :61-74
[4]   Government championed strategies to overcome the barriers to public building energy efficiency retrofit projects [J].
Alam, Morshed ;
Zou, Patrick X. W. ;
Stewart, Rodney A. ;
Bertone, Edoardo ;
Sahin, Oz ;
Buntine, Chris ;
Marshall, Carolyn .
SUSTAINABLE CITIES AND SOCIETY, 2019, 44 :56-69
[5]   Energy-efficient control of indoor PM2.5 and thermal comfort in a real room using deep reinforcement learning [J].
An, Yuting ;
Chen, Chun .
ENERGY AND BUILDINGS, 2023, 295
[6]   A reinforcement learning approach for control of window behavior to reduce indoor PM2.5 concentrations in naturally ventilated buildings [J].
An, Yuting ;
Xia, Tongling ;
You, Ruoyu ;
Lai, Dayi ;
Liu, Junjie ;
Chen, Chun .
BUILDING AND ENVIRONMENT, 2021, 200
[7]   Dynamic modeling of presence of occupants using inhomogeneous Markov chains [J].
Andersen, Philip Delff ;
Iversen, Anne ;
Madsen, Henrik ;
Rode, Carsten .
ENERGY AND BUILDINGS, 2014, 69 :213-223
[8]   Bridging Automated to Autonomous Cyber Defense: Foundational Analysis of Tabular Q-Learning [J].
Applebaum, Andy ;
Dennler, Camron ;
Dwyer, Patrick ;
Moskowitz, Marina ;
Nguyen, Harold ;
Nichols, Nicole ;
Park, Nicole ;
Rachwalski, Paul ;
Rau, Frank ;
Webster, Adrian ;
Wolk, Melody .
PROCEEDINGS OF THE 15TH ACM WORKSHOP ON ARTIFICIAL INTELLIGENCE AND SECURITY, AISEC 2022, 2022, :149-159
[9]   Deep Reinforcement Learning A brief survey [J].
Arulkumaran, Kai ;
Deisenroth, Marc Peter ;
Brundage, Miles ;
Bharath, Anil Anthony .
IEEE SIGNAL PROCESSING MAGAZINE, 2017, 34 (06) :26-38
[10]   Development of a ranking procedure for energy performance evaluation of buildings based on occupant behavior [J].
Ashouri, Milad ;
Haghighat, Fariborz ;
Fung, Benjamin C. M. ;
Yoshino, Hiroshi .
ENERGY AND BUILDINGS, 2019, 183 :659-671