Detecting the impact area of BP deepwater horizon oil discharge: an analysis by time varying coefficient logistic models and boosted trees

被引:3
作者
Li, Tianxi [1 ]
Gao, Chao [2 ]
Xu, Meng [3 ]
Rajaratnam, Bala [1 ,4 ]
机构
[1] Stanford Univ, Dept Stat, Stanford, CA 94305 USA
[2] Yale Univ, Dept Stat, New Haven, CT USA
[3] Nankai Univ, Dept Environm Sci, Tianjin 300071, Peoples R China
[4] Stanford Univ, Dept Environm Earth Syst Sci, Stanford, CA 94305 USA
关键词
Varying-coefficients model; Logistic regression; Boosting trees; Oil impacts;
D O I
10.1007/s00180-013-0449-y
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
The Deepwater Horizon oil discharge in the Gulf of Mexico is considered to be one of the worst environmental disasters to date. The spread of the oil spill and its consequences thereof had various environmental impacts. The National Oceanic and Atmospheric Administration (NOAA) in conjunction with the Environmental Protection Agency (EPA), the US Fish and Wildlife Service, and the American Statistical Association (ASA) have made available a few datasets containing information of the oil spill. In this paper, we analyzed four of these datasets in order to explore the use of applied statistics and machine learning methods to understand the spread of the oil spill. In particular, we analysed the "gliders, floats, boats" and "birds" data. The former contains various measurements on sea water such as salinity, temperature, spacial locations, depth and time. The latter contains information on the living conditions of birds, such as living status, oil conditions, locations and time. A varying-coefficients logistic regression was fitted to the birds data. The result indicated that the oil was spreading more quickly along the East-West direction. Analysis via boosted trees and logistic regression showed similar results based on the information provided by the above data.
引用
收藏
页码:141 / 157
页数:17
相关论文
共 11 条
[1]  
[Anonymous], 2010, R LANG ENV STAT COMP
[2]  
[Anonymous], WILEY SERIES PROBABI
[3]   Greedy function approximation: A gradient boosting machine [J].
Friedman, JH .
ANNALS OF STATISTICS, 2001, 29 (05) :1189-1232
[4]   Stochastic gradient boosting [J].
Friedman, JH .
COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2002, 38 (04) :367-378
[5]  
Hastie T., 1990, Gam: Generalized Additive Models
[6]  
Hastie T., 2009, The elements of statistical learning: data mining, inference, and prediction, V2nd
[7]   Comparative performance of generalized additive models and multivariate adaptive regression splines for statistical modelling of species distributions [J].
Leathwick, J. R. ;
Elith, J. ;
Hastie, T. .
ECOLOGICAL MODELLING, 2006, 199 (02) :188-196
[8]  
Loecher M., 2011, RgoogleMaps: Overlays on Google Map Tiles in R
[9]  
RIDGEWAY G, 2010, GBM GEN BOOSTED REGR, P00001
[10]  
[No title captured]