Mallows model averaging with effective model size in fragmentary data prediction

被引：3

作者：

Yuan, Chaoxia ^{[1
]}

Fang, Fang ^{[1
,2
]}

Ni, Lyu ^{[3
]}

机构：

[1] East China Normal Univ, Sch Stat, Shanghai, Peoples R China

[2] East China Normal Univ, Key Lab Adv Theory & Applicat Stat & Data Sci MOE, Shanghai, Peoples R China

[3] East China Normal Univ, Sch Data Sci & Engn, 3663 North Zhongshan Rd, Shanghai 200062, Peoples R China

来源：

COMPUTATIONAL STATISTICS & DATA ANALYSIS | 2022年 / 173卷

基金：

国家重点研发计划; 中国国家自然科学基金;

关键词：

Asymptotic optimality; Effective model size; Fragmentary data; Multiple data sources; Mallows model averaging; GENERALIZED LINEAR-MODELS; ASYMPTOTIC OPTIMALITY; SELECTION; REGRESSION;

D O I：

10.1016/j.csda.2022.107497

中图分类号：

TP39 [计算机的应用];

学科分类号：

081203 ; 0835 ;

摘要：

Most existing model averaging methods consider fully observed data while fragmentary data, in which not all the covariate data are available for many subjects, becomes more and more popular nowadays with the increasing data sources in many areas such as economics, social sciences and medical studies. The main challenge of model averaging in fragmentary data is that the samples to fit candidate models are different to the sample used for weight selection, which introduces bias to the Mallows criterion in the classical Mallows Model Averaging (MMA). A novel Mallows model averaging method that utilizes the "effective model size " taking different samples into consideration is proposed and its asymptotic optimality is established. Empirical evidences from a simulation study and a real data analysis are presented. The proposed Effective Mallows Model Averaging (EMMA) method not only provides a novel solution to the fragmentary data prediction, but also sheds light on model selection when candidate models have different sample sizes, which has rarely been discussed in the literature. (C)& nbsp;2022 Elsevier B.V. All rights reserved.

引用

页数：18

共 50 条

[41] Least Squares Model Averaging for Distributed Data
Zhang, Haili
Liu, Zhaobo
Zou, Guohua
JOURNAL OF MACHINE LEARNING RESEARCH, 2023, 24
[42] Model averaging with covariates that are missing completely at random
Zhang, Xinyu
ECONOMICS LETTERS, 2013, 121 (03) : 360 - 363
[43] Varying-coefficient semiparametric model averaging prediction
Li, Jialiang
Xia, Xiaochao
Wong, Weng Kee
Nott, David
BIOMETRICS, 2018, 74 (04) : 1417 - 1426
[44] Model Averaging Under Flexible Loss Functions
Gu, Dieqi
Liu, Qingfeng
Zhang, Xinyu
INFORMS JOURNAL ON COMPUTING, 2025,
[45] Optimal Model Averaging for Semiparametric Partially Linear Models with Censored Data
Hu, Guozhi
Cheng, Weihu
Zeng, Jie
MATHEMATICS, 2023, 11 (03)
[46] Model averaging in predictive regressions
Liu, Chu-An
Kuo, Biing-Shen
ECONOMETRICS JOURNAL, 2016, 19 (02) : 203 - 231
[47] Model averaging: A shrinkage perspective
Peng, Jingfu
ELECTRONIC JOURNAL OF STATISTICS, 2024, 18 (02): : 3535 - 3572
[48] OPTIMAL MODEL AVERAGING ESTIMATION FOR PARTIALLY LINEAR MODELS
Zhang, Xinyu
Wang, Wendun
STATISTICA SINICA, 2019, 29 (02) : 693 - 718
[49] Frequentist model averaging under a linear exponential loss
Li, Xinmin
Liang, Hua
Liu, Huihang
Tong, Tingting
Xie, Tian
STATISTICS AND COMPUTING, 2025, 35 (03)
[50] OPTIMAL MODEL AVERAGING BASED ON GENERALIZED METHOD OF MOMENTS
Zhang, Xinyu
STATISTICA SINICA, 2021, 31 (04) : 2103 - 2122

← 1 2 3 4 5 →