High-Dimensional Multi-Task Learning using Multivariate Regression and Generalized Fiducial Inference

被引:0
作者
Wei, Zhenyu [1 ]
Lee, Thomas C. M. [1 ]
机构
[1] Univ Calif Davis, Dept Stat, Davis, CA 95616 USA
基金
美国国家科学基金会;
关键词
Confidence ellipsoids; GMTask; Large p small n; Prediction ellipsoids; Uncertainty quantification; VARIABLE SELECTION; LINEAR-MODELS;
D O I
10.1080/10618600.2022.2090946
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Over the past decades, the Multi-Task Learning (MTL) problem has attracted much attention in the artificial intelligence and machine learning communities. However, most published work in this area focuses on point estimation; that is, estimating model parameters and/or making predictions. This article studies another important aspect of the MTL problem: uncertainty quantification for model choices and predictions. To be more specific, this article approaches the MTL problem with multivariate regression and develops a novel method for deriving a probability density function on the space of all potential regression models. With this density function, point estimates, as well as confidence and prediction ellipsoids, can be obtained for quantities of interest, such as future observations. The proposed method, termed GMTask, is based on the generalized fiducial inference (GFI) framework and is shown to enjoy desirable theoretical properties. Its promising empirical properties are illustrated via a sequence of numerical experiments and applications to two real datasets. for this article are available online.
引用
收藏
页码:226 / 240
页数:15
相关论文
共 34 条
[1]   MODEL SELECTION FOR MULTIVARIATE REGRESSION IN SMALL SAMPLES [J].
BEDRICK, EJ ;
TSAI, CL .
BIOMETRICS, 1994, 50 (01) :226-231
[2]   Consistent High-Dimensional Bayesian Variable Selection via Penalized Credible Regions [J].
Bondell, Howard D. ;
Reich, Brian J. .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2012, 107 (500) :1610-1624
[3]   Multitask learning [J].
Caruana, R .
MACHINE LEARNING, 1997, 28 (01) :41-75
[4]   The Dempster-Shafer calculus for statisticians [J].
Dempster, A. P. .
INTERNATIONAL JOURNAL OF APPROXIMATE REASONING, 2008, 48 (02) :365-377
[5]  
Evgeniou A., 2007, ADV NEURAL INFORM PR, P1, DOI [DOI 10.7551/MITPRESS/7503.003.0010, 10.7551/mitpress/7503.003.0010]
[6]   Sure independence screening for ultrahigh dimensional feature space [J].
Fan, Jianqing ;
Lv, Jinchi .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2008, 70 :849-883
[7]  
Fan JQ, 2012, J ROY STAT SOC B, V74, P37, DOI 10.1111/j.1467-9868.2011.01005.x
[8]  
Fisher RA, 1930, P CAMB PHILOS SOC, V26, P528
[9]   Regularization Paths for Generalized Linear Models via Coordinate Descent [J].
Friedman, Jerome ;
Hastie, Trevor ;
Tibshirani, Rob .
JOURNAL OF STATISTICAL SOFTWARE, 2010, 33 (01) :1-22
[10]   Generalized Fiducial Inference: A Review and New Results [J].
Hannig, Jan ;
Iyer, Hari ;
Lai, Randy C. S. ;
Lee, Thomas C. M. .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2016, 111 (515) :1346-1361