This paper presents a unified framework for supervised learning and inference procedures using the divide-and-conquer approach for high-dimensional correlated outcomes. We propose a general class of estimators that can be implemented in a fully distributed and parallelized computational scheme. Modeling, computational and theoretical challenges related to high-dimensional correlated outcomes are overcome by dividing data at both outcome and subject levels, estimating the parameter of interest from blocks of data using a broad class of supervised learning procedures, and combining block estimators in a closed-form meta-estimator asymptotically equivalent to estimates obtained by Hansen (1982)'s generalized method of moments (GMM) that does not require the entire data to be reloaded on a common server. We provide rigorous theoretical justifications for the use of distributed estimators with correlated outcomes by studying the asymptotic behaviour of the combined estimator with fixed and diverging number of data divisions. Simulations illustrate the finite sample performance of the proposed method, and we provide an R package for ease of implementation.
机构:
Univ Calif San Diego, Dept Math, San Diego, CA 92093 USA
Univ Calif San Diego, Halicioglu Data Sci Inst, San Diego, CA 92093 USAUniv Calif San Diego, Dept Math, San Diego, CA 92093 USA
Bradic, Jelena
Ji, Weijie
论文数: 0引用数: 0
h-index: 0
机构:
Shanghai Univ Finance & Econ, Sch Stat & Management, Shanghai, Peoples R ChinaUniv Calif San Diego, Dept Math, San Diego, CA 92093 USA
Ji, Weijie
Zhang, Yuqian
论文数: 0引用数: 0
h-index: 0
机构:
Renmin Univ China, Inst Stat & Big Data, Beijing, Peoples R ChinaUniv Calif San Diego, Dept Math, San Diego, CA 92093 USA
机构:
Nankai Univ, Sch Stat & Data Sci, LPMC & KLMDASR, Tianjin, Peoples R ChinaNankai Univ, Sch Stat & Data Sci, LPMC & KLMDASR, Tianjin, Peoples R China
Han, Dongxiao
Han, Miao
论文数: 0引用数: 0
h-index: 0
机构:
Shanghai Univ Finance & Econ, Sch Stat & Management, Shanghai, Peoples R ChinaNankai Univ, Sch Stat & Data Sci, LPMC & KLMDASR, Tianjin, Peoples R China
Han, Miao
Huang, Jian
论文数: 0引用数: 0
h-index: 0
机构:
Hong Kong Polytech Univ, Dept Appl Math, Hong Kong, Peoples R ChinaNankai Univ, Sch Stat & Data Sci, LPMC & KLMDASR, Tianjin, Peoples R China
Huang, Jian
Lin, Yuanyuan
论文数: 0引用数: 0
h-index: 0
机构:
Chinese Univ Hong Kong, Dept Stat, Hong Kong, Peoples R ChinaNankai Univ, Sch Stat & Data Sci, LPMC & KLMDASR, Tianjin, Peoples R China