Robust and flexible learning of a high-dimensional classification rule using auxiliary outcomes

被引:0
作者
Liang, Muxuan [1 ]
Park, Jaeyoung [2 ]
Lu, Qing [1 ]
Zhong, Xiang [3 ]
机构
[1] Univ Florida, Dept Biostat, 2004 Mowry Rd, 5th Floor CTRB, Gainesville, FL 32611 USA
[2] Univ Cent Florida, Sch Global Hlth Management & Informat, Orlando, FL 32816 USA
[3] Univ Florida, Dept Ind & Syst Engn, Gainesville, FL 32611 USA
关键词
auxiliary outcomes; classification; high-dimensional data; multi-task learning; transfer learning; MULTITASK; ALGORITHMS; PREDICT;
D O I
10.1093/biomtc/ujae144
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Correlated outcomes are common in many practical problems. In some settings, one outcome is of particular interest, and others are auxiliary. To leverage information shared by all the outcomes, traditional multi-task learning (MTL) minimizes an averaged loss function over all the outcomes, which may lead to biased estimation for the target outcome, especially when the MTL model is misspecified. In this work, based on a decomposition of estimation bias into two types, within-subspace and against-subspace, we develop a robust transfer learning approach to estimating a high-dimensional linear decision rule for the outcome of interest with the presence of auxiliary outcomes. The proposed method includes an MTL step using all outcomes to gain efficiency and a subsequent calibration step using only the outcome of interest to correct both types of biases. We show that the final estimator can achieve a lower estimation error than the one using only the single outcome of interest. Simulations and real data analysis are conducted to justify the superiority of the proposed method.
引用
收藏
页数:9
相关论文
共 50 条
  • [21] A classification algorithm for high-dimensional data
    Roy, Asim
    INNS CONFERENCE ON BIG DATA 2015 PROGRAM, 2015, 53 : 345 - 355
  • [22] Estimation of predictive performance in high-dimensional data settings using learning curves
    Goedhart, Jeroen M.
    Klausch, Thomas
    van de Wiel, Mark A.
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2023, 180
  • [23] Robust transfer learning for high-dimensional quantile regression model with linear constraints
    Longjie Cao
    Yunquan Song
    Applied Intelligence, 2024, 54 : 1263 - 1274
  • [24] Robust transfer learning for high-dimensional quantile regression model with linear constraints
    Cao, Longjie
    Song, Yunquan
    APPLIED INTELLIGENCE, 2024, 54 (02) : 1263 - 1274
  • [25] Robust tests of the equality of two high-dimensional covariance matrices
    Zi, Xuemin
    Chen, Hui
    COMMUNICATIONS IN STATISTICS-THEORY AND METHODS, 2022, 51 (10) : 3120 - 3141
  • [26] Multivariate Feature Ranking With High-Dimensional Data for Classification Tasks
    Jimenez, Fernando
    Sanchez, Gracia
    Palma, Jose
    Miralles-Pechuan, Luis
    Botia, Juan A.
    IEEE ACCESS, 2022, 10 : 60421 - 60437
  • [27] On the orthogonal distance to class subspaces for high-dimensional data classification
    Zhu, Rui
    Xue, Jing-Hao
    INFORMATION SCIENCES, 2017, 417 : 262 - 273
  • [28] Global Binary Optimization on Graphs for Classification of High-Dimensional Data
    Merkurjev, Ekaterina
    Bae, Egil
    Bertozzi, Andrea L.
    Tai, Xue-Cheng
    JOURNAL OF MATHEMATICAL IMAGING AND VISION, 2015, 52 (03) : 414 - 435
  • [29] Multivariate functional subspace classification for high-dimensional longitudinal data
    Fukuda, Tatsuya
    Matsui, Hidetoshi
    Takada, Hiroya
    Misumi, Toshihiro
    Konishi, Sadanori
    JAPANESE JOURNAL OF STATISTICS AND DATA SCIENCE, 2024, 7 (01) : 1 - 16
  • [30] PETER HALL'S WORK ON HIGH-DIMENSIONAL DATA AND CLASSIFICATION
    Samworth, Richard J.
    ANNALS OF STATISTICS, 2016, 44 (05) : 1888 - 1895