Mallows model averaging with effective model size in fragmentary data prediction

被引:3
|
作者
Yuan, Chaoxia [1 ]
Fang, Fang [1 ,2 ]
Ni, Lyu [3 ]
机构
[1] East China Normal Univ, Sch Stat, Shanghai, Peoples R China
[2] East China Normal Univ, Key Lab Adv Theory & Applicat Stat & Data Sci MOE, Shanghai, Peoples R China
[3] East China Normal Univ, Sch Data Sci & Engn, 3663 North Zhongshan Rd, Shanghai 200062, Peoples R China
基金
国家重点研发计划; 中国国家自然科学基金;
关键词
Asymptotic optimality; Effective model size; Fragmentary data; Multiple data sources; Mallows model averaging; GENERALIZED LINEAR-MODELS; ASYMPTOTIC OPTIMALITY; SELECTION; REGRESSION;
D O I
10.1016/j.csda.2022.107497
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Most existing model averaging methods consider fully observed data while fragmentary data, in which not all the covariate data are available for many subjects, becomes more and more popular nowadays with the increasing data sources in many areas such as economics, social sciences and medical studies. The main challenge of model averaging in fragmentary data is that the samples to fit candidate models are different to the sample used for weight selection, which introduces bias to the Mallows criterion in the classical Mallows Model Averaging (MMA). A novel Mallows model averaging method that utilizes the "effective model size " taking different samples into consideration is proposed and its asymptotic optimality is established. Empirical evidences from a simulation study and a real data analysis are presented. The proposed Effective Mallows Model Averaging (EMMA) method not only provides a novel solution to the fragmentary data prediction, but also sheds light on model selection when candidate models have different sample sizes, which has rarely been discussed in the literature. (C)& nbsp;2022 Elsevier B.V. All rights reserved.
引用
收藏
页数:18
相关论文
共 50 条
  • [11] Mallows model averaging based on kernel regression imputation with responses missing at random
    Zhu, Hengkun
    Zou, Guohua
    JOURNAL OF STATISTICAL PLANNING AND INFERENCE, 2024, 231
  • [12] On the sparsity of Mallows model averaging estimator
    Feng, Yang
    Liu, Qingfeng
    Okui, Ryo
    ECONOMICS LETTERS, 2020, 187
  • [13] A Mallows-Type Model Averaging Estimator for the Varying-Coefficient Partially Linear Model
    Zhu, Rong
    Wan, Alan T. K.
    Zhang, Xinyu
    Zou, Guohua
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2019, 114 (526) : 882 - 892
  • [14] Partial Linear Model Averaging Prediction for Longitudinal Data
    Li Na
    Fei Yu
    Zhang Xinyu
    JOURNAL OF SYSTEMS SCIENCE & COMPLEXITY, 2024, 37 (02) : 863 - 885
  • [15] Spatial Mallows model averaging for geostatistical models
    Liao, Jun
    Zou, Guohua
    Gao, Yan
    CANADIAN JOURNAL OF STATISTICS-REVUE CANADIENNE DE STATISTIQUE, 2019, 47 (03): : 336 - 351
  • [16] A Mallows-type model averaging estimator for ridge regression with randomly right censored data
    Zeng, Jie
    Hu, Guozhi
    Cheng, Weihu
    STATISTICS AND COMPUTING, 2024, 34 (05)
  • [17] Asymptotic optimality of generalized cross validation and regularized Mallows model averaging
    Zou, Chenchen
    Li, Xin
    Li, Xinmin
    Liang, Hua
    STATISTICS & PROBABILITY LETTERS, 2025, 222
  • [18] Partial Linear Model Averaging Prediction for Longitudinal Data
    Na Li
    Yu Fei
    Xinyu Zhang
    Journal of Systems Science and Complexity, 2024, 37 : 863 - 885
  • [19] Communication-efficient model averaging prediction for massive data with asymptotic optimality
    Xia, Xiaochao
    He, Sijin
    Pang, Naiwen
    STATISTICAL PAPERS, 2025, 66 (02)
  • [20] An Empirical Investigation on the Forecasting Ability of Mallows Model Averaging In a Macro Economic Environment
    Yin, Yip Chee
    Hock-Eam, Lim
    INTERNATIONAL CONFERENCE ON FUNDAMENTAL AND APPLIED SCIENCES 2012 (ICFAS2012), 2012, 1482 : 402 - 407