A machine learning approach to evaluate the spatial variability of New York City's 311 street flooding complaints

被引:23
作者
Agonafir, Candace [1 ,2 ]
Lakhankar, Tarendra [2 ]
Khanbilvardi, Reza [1 ,2 ]
Krakauer, Nir [1 ,2 ]
Radell, Dave [3 ]
Devineni, Naresh [1 ,2 ]
机构
[1] CUNY City Coll, Dept Civil Engn, New York, NY 10031 USA
[2] NOAA, Ctr Earth Syst Sci & Remote Sensing Technol NOAA, New York, NY USA
[3] Natl Weather Serv, NOAA, US Dept Commerce, Upton, NY 11973 USA
关键词
Urban flooding; Random forest; Street flooding; Urban sewer system; Flood factors; RANDOM FOREST CLASSIFIER; LOW IMPACT DEVELOPMENT; URBAN; PREDICTION; REGRESSION; RESOURCES; HYDROLOGY; SYSTEM; MEDIA;
D O I
10.1016/j.compenvurbsys.2022.101854
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Urbanization, accompanied by the creation of roads, pavements, and sidewalks creates an environment where there is limited infiltration capacity, leaving metropolitan areas especially vulnerable during intense rain events. Furthermore, within an urban setting, there is spatial variability, as certain areas, owing to location, topography, land feature conditions, population and physical attributes or precipitation patterns, are more prone to flood damages. To detect neighborhoods with increased flood risk, crowdsourced data, which is the consolidation of eyewitness accounts, affords particular value. With an intent to understand how factors affect the spatial variability of street flooding, the Random Forest regression machine learning algorithm is employed, where the 311 street flooding reports of New York City (NYC) serve as the response, while the explanatory variables include topographic and land feature, physical and population dynamics, locational, infrastructural, and climatic influences. This study also analyzes socio-economic variables as predictors, as to allow for better insight into potential biases within the NYC 311 crowdsourced platform. It is found that catch basin complaints have overwhelmingly the greatest predictor importance, at 41%, almost sixfold higher than that of the second highest ranked predictor, slope, at 6.7%. Thus, NYC has an apparent issue with debris blocking the basins, and this may be remediated by increased cleaning efforts or public awareness to maintain clear streets, particularly during forecasted rain events. Furthermore, more than a third of the top predictors are land feature and topographical conditions, with building characteristics dominating the category. Often excluded in urban flood models, building effects, with a combined total importance of 11.7%, have greater significance than commonly considered flooding factors, such as percent impervious cover or elevation. Another major finding is the significance of the 'commuters who drive alone' variable, which alerts to the prospect of more reports being filed by those more affected by street flooding, as opposed to reflecting the actual occurrence of flooding (more reports being filed by those who drive on flooded roads versus those who do not). Overall, the leading contribution of this study is the identification of the top flooding factors in NYC, along with the presentation of their specific impacts towards street flooding variability among zip codes.
引用
收藏
页数:19
相关论文
共 79 条
[1]   Modelling flash flood propagation in urban areas using a two-dimensional numerical model [J].
Abderrezzak, Kamal El Kadi ;
Paquier, Andre ;
Mignot, Emmanuel .
NATURAL HAZARDS, 2009, 50 (03) :433-460
[2]   Understanding New York City street flooding through 311 complaints [J].
Agonafir, Candace ;
Pabon, Alejandra Ramirez ;
Lakhankar, Tarendra ;
Khanbilvardi, Reza ;
Devineni, Naresh .
JOURNAL OF HYDROLOGY, 2022, 605
[3]   Flooding in the Nechako River Basin of Canada: A random forest modeling approach to flood analysis in a regulated reservoir system [J].
Albers, Sam J. ;
Dery, Stephen J. ;
Petticrew, Ellen L. .
CANADIAN WATER RESOURCES JOURNAL, 2016, 41 (1-2) :250-260
[4]  
Ali J., 2012, INT J COMPUT SCI ISS, V9, P272
[5]   Demystifying uncertainty in PM10 susceptibility mapping using variable drop-off in extreme-gradient boosting (XGB) and random forest (RF) algorithms [J].
AlThuwaynee, Omar F. ;
Kim, Sang-Wan ;
Najemaden, Mohamed A. ;
Aydda, Ali ;
Balogun, Abdul-Lateef ;
Fayyadh, Moatasem M. ;
Park, Hyuck-Jin .
ENVIRONMENTAL SCIENCE AND POLLUTION RESEARCH, 2021, 28 (32) :43544-43566
[6]  
[Anonymous], 2019, INS OUTS NYC COMMUTI
[7]  
[Anonymous], 2022, STREET FLOODING
[8]  
[Anonymous], 2022, GREEN ROOFS SOLAR PA
[9]  
[Anonymous], 2022, INFO BRIEF FLOOD RIS
[10]  
[Anonymous], 2022, FLOOD PREVENTION