Robust and flexible learning of a high-dimensional classification rule using auxiliary outcomes

被引:0
作者
Liang, Muxuan [1 ]
Park, Jaeyoung [2 ]
Lu, Qing [1 ]
Zhong, Xiang [3 ]
机构
[1] Univ Florida, Dept Biostat, 2004 Mowry Rd, 5th Floor CTRB, Gainesville, FL 32611 USA
[2] Univ Cent Florida, Sch Global Hlth Management & Informat, Orlando, FL 32816 USA
[3] Univ Florida, Dept Ind & Syst Engn, Gainesville, FL 32611 USA
关键词
auxiliary outcomes; classification; high-dimensional data; multi-task learning; transfer learning; MULTITASK; ALGORITHMS; PREDICT;
D O I
10.1093/biomtc/ujae144
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Correlated outcomes are common in many practical problems. In some settings, one outcome is of particular interest, and others are auxiliary. To leverage information shared by all the outcomes, traditional multi-task learning (MTL) minimizes an averaged loss function over all the outcomes, which may lead to biased estimation for the target outcome, especially when the MTL model is misspecified. In this work, based on a decomposition of estimation bias into two types, within-subspace and against-subspace, we develop a robust transfer learning approach to estimating a high-dimensional linear decision rule for the outcome of interest with the presence of auxiliary outcomes. The proposed method includes an MTL step using all outcomes to gain efficiency and a subsequent calibration step using only the outcome of interest to correct both types of biases. We show that the final estimator can achieve a lower estimation error than the one using only the single outcome of interest. Simulations and real data analysis are conducted to justify the superiority of the proposed method.
引用
收藏
页数:9
相关论文
共 50 条
  • [1] Flexible High-Dimensional Classification Machines and Their Asymptotic Properties
    Qiao, Xingye
    Zhang, Lingsong
    JOURNAL OF MACHINE LEARNING RESEARCH, 2015, 16 : 1547 - 1572
  • [2] Robust Classification of High-Dimensional Data Using Data-Adaptive Energy Distance
    Choudhury, Jyotishka Ray
    Saha, Aytijhya
    Roy, Sarbojit
    Dutta, Subhajit
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES: RESEARCH TRACK, ECML PKDD 2023, PT V, 2023, 14173 : 86 - 101
  • [3] Boosting for Vote Learning in High-dimensional kNN Classification
    Tomasev, Nenad
    2014 IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOP (ICDMW), 2014, : 676 - 683
  • [4] A Fuzzy Association Rule-Based Classification Model for High-Dimensional Problems With Genetic Rule Selection and Lateral Tuning
    Alcala-Fdez, Jesus
    Alcala, Rafael
    Herrera, Francisco
    IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2011, 19 (05) : 857 - 872
  • [5] Robust transfer learning of high-dimensional generalized linear model
    Sun, Fei
    Zhang, Qi
    PHYSICA A-STATISTICAL MECHANICS AND ITS APPLICATIONS, 2023, 618
  • [6] Robust transfer learning for high-dimensional regression with linear constraints
    Chen, Xuan
    Song, Yunquan
    Wang, Yuanfeng
    JOURNAL OF STATISTICAL COMPUTATION AND SIMULATION, 2024, 94 (11) : 2462 - 2482
  • [7] Penalized Independence Rule for Testing High-Dimensional Hypotheses
    Shen, Yanfeng
    Lin, Zhengyan
    Zhu, Jun
    COMMUNICATIONS IN STATISTICS-THEORY AND METHODS, 2011, 40 (13) : 2424 - 2435
  • [8] Learning from High-Dimensional Data in Multitasli/Multilabel Classification
    Kwok, James T.
    2013 SECOND IAPR ASIAN CONFERENCE ON PATTERN RECOGNITION (ACPR 2013), 2013, : 16 - 17
  • [9] Ensemble Method for Classification of High-Dimensional Data
    Piao, Yongjun
    Park, Hyun Woo
    Jin, Cheng Hao
    Ryu, Keun Ho
    2014 INTERNATIONAL CONFERENCE ON BIG DATA AND SMART COMPUTING (BIGCOMP), 2014, : 245 - +
  • [10] Robust adaptive LASSO in high-dimensional logistic regression
    Basu, Ayanendranath
    Ghosh, Abhik
    Jaenada, Maria
    Pardo, Leandro
    STATISTICAL METHODS AND APPLICATIONS, 2024, : 1217 - 1249