共 61 条
Sustainable Life-Cycle Maintenance Policymaking for Network-Level Deteriorating Bridges with a Convolutional Autoencoder-Structured Reinforcement Learning Agent
被引:27
作者:
Lei, Xiaoming
[1
]
Dong, You
[1
]
Frangopol, Dan M.
[2
]
机构:
[1] Hong Kong Polytech Univ, Dept Civil & Environm Engn, Hung Hom, , HongKong, Hong Kong 999077, Peoples R China
[2] Lehigh Univ, Chair Struct Engn & Architecture, ATLSS Engn Res Ctr, Dept Civil & Environm Engn, 117 ATLSS Dr, Bethlehem, PA 18015 USA
关键词:
Bridge maintenance;
Infrastructure management;
Network-level bridges;
Sustainable assessment;
Carbon footprint;
Reinforcement learning;
Deep-Q network;
Convolutional autoencoder;
OPTIMIZATION;
UTILITY;
MODELS;
D O I:
10.1061/JBENF2.BEENG-6159
中图分类号:
TU [建筑科学];
学科分类号:
0813 ;
摘要:
Bridges play a significant role in urban areas, and their performance and safety are highly related to the carbon emissions of infrastructure systems. Previous studies have mainly offered maintenance policies that balance structural safety with overall costs. Considering the goal of achieving near-zero global carbon emissions by 2050, this study proposes a policymaking agent based on a convolutional autoencoder-structured deep-Q network (ConvAE-DQN) for managing deteriorating bridges at the network level while considering sustainability performance. This agent considers environmental, economic, and safety metrics, including spatially correlated structural failure probability, traffic volume, bridge size, and others, which are transformed into a multiattribute utility model to form the reward function. Reinforcement learning is employed to optimize the life-cycle maintenance planning to minimize the total carbon emissions and economic costs while maximizing regional safety performance. The proposed method is substantiated by developing sustainable life-cycle maintenance policies for an existing bridge network in Northern China. It is found that the proposed ConvAE-DQN policymaking agent could output efficient and sustainable life-cycle maintenance policies, which are annually stable and easy to schedule. The utility-based reward function enhances the stability and convergence efficiency of the algorithm. This study also assesses the impact of budget levels on network-level bridge safety and carbon footprint.
引用
收藏
页数:15
相关论文