Mallows model averaging with effective model size in fragmentary data prediction

被引:3
|
作者
Yuan, Chaoxia [1 ]
Fang, Fang [1 ,2 ]
Ni, Lyu [3 ]
机构
[1] East China Normal Univ, Sch Stat, Shanghai, Peoples R China
[2] East China Normal Univ, Key Lab Adv Theory & Applicat Stat & Data Sci MOE, Shanghai, Peoples R China
[3] East China Normal Univ, Sch Data Sci & Engn, 3663 North Zhongshan Rd, Shanghai 200062, Peoples R China
基金
国家重点研发计划; 中国国家自然科学基金;
关键词
Asymptotic optimality; Effective model size; Fragmentary data; Multiple data sources; Mallows model averaging; GENERALIZED LINEAR-MODELS; ASYMPTOTIC OPTIMALITY; SELECTION; REGRESSION;
D O I
10.1016/j.csda.2022.107497
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Most existing model averaging methods consider fully observed data while fragmentary data, in which not all the covariate data are available for many subjects, becomes more and more popular nowadays with the increasing data sources in many areas such as economics, social sciences and medical studies. The main challenge of model averaging in fragmentary data is that the samples to fit candidate models are different to the sample used for weight selection, which introduces bias to the Mallows criterion in the classical Mallows Model Averaging (MMA). A novel Mallows model averaging method that utilizes the "effective model size " taking different samples into consideration is proposed and its asymptotic optimality is established. Empirical evidences from a simulation study and a real data analysis are presented. The proposed Effective Mallows Model Averaging (EMMA) method not only provides a novel solution to the fragmentary data prediction, but also sheds light on model selection when candidate models have different sample sizes, which has rarely been discussed in the literature. (C)& nbsp;2022 Elsevier B.V. All rights reserved.
引用
收藏
页数:18
相关论文
共 50 条
  • [41] Least Squares Model Averaging for Distributed Data
    Zhang, Haili
    Liu, Zhaobo
    Zou, Guohua
    JOURNAL OF MACHINE LEARNING RESEARCH, 2023, 24
  • [42] Model averaging with covariates that are missing completely at random
    Zhang, Xinyu
    ECONOMICS LETTERS, 2013, 121 (03) : 360 - 363
  • [43] Varying-coefficient semiparametric model averaging prediction
    Li, Jialiang
    Xia, Xiaochao
    Wong, Weng Kee
    Nott, David
    BIOMETRICS, 2018, 74 (04) : 1417 - 1426
  • [44] Model Averaging Under Flexible Loss Functions
    Gu, Dieqi
    Liu, Qingfeng
    Zhang, Xinyu
    INFORMS JOURNAL ON COMPUTING, 2025,
  • [45] Optimal Model Averaging for Semiparametric Partially Linear Models with Censored Data
    Hu, Guozhi
    Cheng, Weihu
    Zeng, Jie
    MATHEMATICS, 2023, 11 (03)
  • [46] Model averaging in predictive regressions
    Liu, Chu-An
    Kuo, Biing-Shen
    ECONOMETRICS JOURNAL, 2016, 19 (02) : 203 - 231
  • [47] Model averaging: A shrinkage perspective
    Peng, Jingfu
    ELECTRONIC JOURNAL OF STATISTICS, 2024, 18 (02): : 3535 - 3572
  • [48] OPTIMAL MODEL AVERAGING ESTIMATION FOR PARTIALLY LINEAR MODELS
    Zhang, Xinyu
    Wang, Wendun
    STATISTICA SINICA, 2019, 29 (02) : 693 - 718
  • [49] Frequentist model averaging under a linear exponential loss
    Li, Xinmin
    Liang, Hua
    Liu, Huihang
    Tong, Tingting
    Xie, Tian
    STATISTICS AND COMPUTING, 2025, 35 (03)
  • [50] OPTIMAL MODEL AVERAGING BASED ON GENERALIZED METHOD OF MOMENTS
    Zhang, Xinyu
    STATISTICA SINICA, 2021, 31 (04) : 2103 - 2122