Distributed model building and recursive integration for big spatial data modeling

被引:0
|
作者
Hector, Emily C. [1 ]
Reich, Brian J. [1 ]
Eloyan, Ani [2 ]
机构
[1] North Carolina State Univ, Dept Stat, Raleigh, NC 27695 USA
[2] Brown Univ, Dept Biostat, Providence, RI 02912 USA
基金
美国国家科学基金会;
关键词
divide-and-conquer; functional connectivity; generalized method of moments; nearest-neighbor Gaussian process; optimal estimating functions; SAMPLE PROPERTIES; GAUSSIAN PROCESS; LIKELIHOOD;
D O I
10.1093/biomtc/ujae159
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Motivated by the need for computationally tractable spatial methods in neuroimaging studies, we develop a distributed and integrated framework for estimation and inference of Gaussian process model parameters with ultra-high-dimensional likelihoods. We propose a shift in viewpoint from whole to local data perspectives that is rooted in distributed model building and integrated estimation and inference. The framework's backbone is a computationally and statistically efficient integration procedure that simultaneously incorporates dependence within and between spatial resolutions in a recursively partitioned spatial domain. Statistical and computational properties of our distributed approach are investigated theoretically and in simulations. The proposed approach is used to extract new insights into autism spectrum disorder from the autism brain imaging data exchange.
引用
收藏
页数:11
相关论文
共 50 条
  • [31] A memory-free spatial additive mixed modeling for big spatial data
    Daisuke Murakami
    Daniel A. Griffith
    Japanese Journal of Statistics and Data Science, 2020, 3 : 215 - 241
  • [32] Security Model for Distributed GIS Spatial Data
    Zhang, Yanqun
    Wang, Qianping
    ISISE 2008: INTERNATIONAL SYMPOSIUM ON INFORMATION SCIENCE AND ENGINEERING, VOL 2, 2008, : 641 - 645
  • [33] A model for the spatial and temporal integration of geolocated data
    Arenas, Helbert
    Trojahn, Cassia
    Comparot, Catherine
    Aussenac-Gilles, Nathalie
    REVUE INTERNATIONALE DE GEOMATIQUE, 2018, 28 (02): : 243 - 266
  • [34] Random Sample Partition: A Distributed Data Model for Big Data Analysis
    Salloum, Salman
    Huan, Joshua Zhexue
    He, Yulin
    IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2019, 15 (11) : 5846 - 5854
  • [35] Big Data Integration
    Dong, Xin Luna
    Srivastava, Divesh
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2013, 6 (11): : 1188 - 1189
  • [36] Big Data Integration
    Dong, Xin Luna
    Srivastava, Divesh
    2013 IEEE 29TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE), 2013, : 1245 - 1248
  • [37] Big Data Integration
    Cudre-Mauroux, Philippe
    PROCEEDINGS OF THE 14TH INTERNATIONAL CONFERENCE ON TELECOMMUNICATIONS CONTEL 2017, 2017, : 5 - 5
  • [38] SparkNN: A distributed in-memory data partitioning for KNN queries on big spatial data
    Al Aghbari Z.
    Ismail T.
    Kamel I.
    Data Science Journal, 2020, 19 (01) : 1 - 14
  • [39] Uncertainty in the modeling of spatial big data on a pattern of bushfires holes
    Stein, Alfred
    Tolpekin, Valentyn A.
    NATURAL RESOURCE MODELING, 2018, 31 (03)
  • [40] A data model-independent approach to big research data integration
    Bartalesi V.
    Meghini C.
    Thanos C.
    International Journal of Metadata, Semantics and Ontologies, 2019, 13 (04): : 330 - 345