SPATIAL FACTOR MODELS FOR HIGH-DIMENSIONAL AND LARGE SPATIAL DATA: AN APPLICATION IN FOREST VARIABLE MAPPING

被引:25
|
作者
Taylor-Rodriguez, Daniel [1 ]
Finley, Andrew O. [2 ]
Datta, Abhirup [3 ]
Babcock, Chad [4 ]
Andersen, Hans-Erik [5 ]
Cook, Bruce D. [6 ]
Morton, Douglas C. [6 ]
Banerjee, Sudipto [7 ]
机构
[1] Portland State Univ, Dept Math & Stat, Portland, OR 97207 USA
[2] Michigan State Univ, Dept Forestry, E Lansing, MI 48824 USA
[3] Johns Hopkins Univ, Dept Biostat, Baltimore, MD 21205 USA
[4] Univ Washington, Sch Environm & Forest Sci, Seattle, WA 98195 USA
[5] USDA, Forest Serv, Pacific Northwest Res Stn, Seattle, WA USA
[6] NASA, Goddard Space Flight Ctr, Biospher Sci Lab, Greenbelt, MD USA
[7] Univ Calif Los Angeles, Dept Biostat, Los Angeles, CA USA
基金
美国国家科学基金会;
关键词
Forest outcomes; LiDAR data; nearest neighbor Gaussian processes; spatial prediction; GAUSSIAN PROCESS MODELS; ABOVEGROUND BIOMASS; HIERARCHICAL-MODELS; REGRESSION-MODELS; AIRBORNE; LIDAR; PREDICTION;
D O I
10.5705/ss.202018.0005
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Gathering information about forest variables is an expensive and arduous activity. Therefore, directly collecting the data required to produce high-resolution maps over large spatial domains is infeasible. Next-generation collection initiatives for remotely sensed light detection and ranging (LiDAR) data are specifically aimed at producing complete-coverage maps over large spatial domains. Given that Li-DAR data and forest characteristics are often strongly correlated, it is possible to use the former to model, predict, and map forest variables over regions of interest. This entails dealing with high-dimensional (similar to 10(2)) spatially dependent LiDAR outcomes over a large number of locations (similar to 10(5) - 10(6)). With this in mind, we develop the spatial factor nearest neighbor Gaussian process (SF-NNGP) model, which we embed in a two-stage approach that connects the spatial structure found in LiDAR signals with forest variables. We provide a simulation experiment that demonstrates the inferential and predictive performance of the SF-NNGP, and use the two-stage modeling strategy to generate complete-coverage maps of the forest variables, with associated uncertainty, over a large region of boreal forests in interior Alaska.
引用
收藏
页码:1155 / 1180
页数:26
相关论文
共 50 条
  • [41] A novel support vector classifier for longitudinal high-dimensional data and its application to neuroimaging data
    Chen S.
    Dubois Bowman F.
    Statistical Analysis and Data Mining, 2011, 4 (06): : 604 - 611
  • [42] Variable selection for high-dimensional genomic data with censored outcomes using group lasso prior
    Lee, Kyu Ha
    Chakraborty, Sounak
    Sun, Jianguo
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2017, 112 : 1 - 13
  • [43] Digital mapping of sand, clay, and soil carbon by Random Forest models under different spatial resolutions
    Bhering, Silvio Barge
    Chagas, Cesar da Silva
    de Carvalho Junior, Waldir
    Pereira, Nilson Rendeiro
    Calderano Filho, Braz
    Koenow Pinheiro, Helena Saraiva
    PESQUISA AGROPECUARIA BRASILEIRA, 2016, 51 (09) : 1359 - 1370
  • [44] Spatial models for probabilistic prediction of wind power with application to annual-average and high temporal resolution data
    Lenzi, Amanda
    Pinson, Pierre
    Clemmensen, Line H.
    Guillot, Gilles
    STOCHASTIC ENVIRONMENTAL RESEARCH AND RISK ASSESSMENT, 2017, 31 (07) : 1615 - 1631
  • [45] PSEUDO-VALUE METHOD FOR ULTRA HIGH-DIMENSIONAL SEMIPARAMETRIC MODELS WITH LIFETIME DATA
    Sit, Tony
    Xing, Yue
    Xu, Yongze
    Gu, Minggao
    STATISTICA SINICA, 2019, 29 (04) : 1939 - 1961
  • [46] Sampling hyperparameters in hierarchical models: Improving on Gibbs for high-dimensional latent fields and large datasets
    Norton, Richard A.
    Christen, J. Andres
    Fox, Colin
    COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION, 2018, 47 (09) : 2639 - 2655
  • [47] Spatial models for probabilistic prediction of wind power with application to annual-average and high temporal resolution data
    Amanda Lenzi
    Pierre Pinson
    Line H. Clemmensen
    Gilles Guillot
    Stochastic Environmental Research and Risk Assessment, 2017, 31 : 1615 - 1631
  • [48] Application of Machine Learning Techniques to High-Dimensional Clinical Data to Forecast Postoperative Complications
    Thottakkara, Paul
    Ozrazgat-Baslanti, Tezcan
    Hupf, Bradley B.
    Rashidi, Parisa
    Pardalos, Panos
    Momcilovic, Petar
    Bihorac, Azra
    PLOS ONE, 2016, 11 (05):
  • [49] A random forest-based algorithm for data-intensive spatial interpolation in crop yield mapping
    Mariano, Cordoba
    Monica, Balzarini
    COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2021, 184
  • [50] Mapping Forest Ecosystem Biomass Density for Xiangjiang River Basin by Combining Plot and Remote Sensing Data and Comparing Spatial Extrapolation Methods
    Zhu, Jia
    Huang, Zhihong
    Sun, Hua
    Wang, Guangxing
    REMOTE SENSING, 2017, 9 (03)