Robust and flexible learning of a high-dimensional classification rule using auxiliary outcomes

被引:0
作者
Liang, Muxuan [1 ]
Park, Jaeyoung [2 ]
Lu, Qing [1 ]
Zhong, Xiang [3 ]
机构
[1] Univ Florida, Dept Biostat, 2004 Mowry Rd, 5th Floor CTRB, Gainesville, FL 32611 USA
[2] Univ Cent Florida, Sch Global Hlth Management & Informat, Orlando, FL 32816 USA
[3] Univ Florida, Dept Ind & Syst Engn, Gainesville, FL 32611 USA
关键词
auxiliary outcomes; classification; high-dimensional data; multi-task learning; transfer learning; MULTITASK; ALGORITHMS; PREDICT;
D O I
10.1093/biomtc/ujae144
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Correlated outcomes are common in many practical problems. In some settings, one outcome is of particular interest, and others are auxiliary. To leverage information shared by all the outcomes, traditional multi-task learning (MTL) minimizes an averaged loss function over all the outcomes, which may lead to biased estimation for the target outcome, especially when the MTL model is misspecified. In this work, based on a decomposition of estimation bias into two types, within-subspace and against-subspace, we develop a robust transfer learning approach to estimating a high-dimensional linear decision rule for the outcome of interest with the presence of auxiliary outcomes. The proposed method includes an MTL step using all outcomes to gain efficiency and a subsequent calibration step using only the outcome of interest to correct both types of biases. We show that the final estimator can achieve a lower estimation error than the one using only the single outcome of interest. Simulations and real data analysis are conducted to justify the superiority of the proposed method.
引用
收藏
页数:9
相关论文
共 50 条
  • [31] Feature selection for high-dimensional classification using a competitive swarm optimizer
    Shenkai Gu
    Ran Cheng
    Yaochu Jin
    Soft Computing, 2018, 22 : 811 - 822
  • [32] Feature selection for high-dimensional classification using a competitive swarm optimizer
    Gu, Shenkai
    Cheng, Ran
    Jin, Yaochu
    SOFT COMPUTING, 2018, 22 (03) : 811 - 822
  • [33] Improved Algorithms for High-dimensional Robust PCA
    Lin, Xiaoyong
    Zhang, Zeqiu
    Wang, Jue
    Zhang, Zhaoyang
    Qiu, Tingting
    Mi, Zhengkun
    2016 IEEE INTERNATIONAL CONFERENCE ON UBIQUITOUS WIRELESS BROADBAND (ICUWB2016), 2016,
  • [35] Learning high-dimensional multimedia data
    Xiaofeng Zhu
    Zhi Jin
    Rongrong Ji
    Multimedia Systems, 2017, 23 : 281 - 283
  • [36] A Compressive Classification Framework for High-Dimensional Data
    Tabassum, Muhammad Naveed
    Ollila, Esa
    IEEE OPEN JOURNAL OF SIGNAL PROCESSING, 2020, 1 : 177 - 186
  • [37] A training algorithm for classification of high-dimensional data
    Vieira, A
    Barradas, N
    NEUROCOMPUTING, 2003, 50 : 461 - 472
  • [38] Online Nonlinear Classification for High-Dimensional Data
    Vanli, N. Denizcan
    Ozkan, Huseyin
    Delibalta, Ibrahim
    Kozat, Suleyman S.
    2015 IEEE INTERNATIONAL CONGRESS ON BIG DATA - BIGDATA CONGRESS 2015, 2015, : 685 - 688
  • [39] hGA: Hybrid genetic algorithm in fuzzy rule-based classification systems for high-dimensional problems
    Aydogan, Emel Kizilkaya
    Karaoglan, Ismail
    Pardalos, Panos M.
    APPLIED SOFT COMPUTING, 2012, 12 (02) : 800 - 806
  • [40] Learning high-dimensional multimedia data
    Zhu, Xiaofeng
    Jin, Zhi
    Ji, Rongrong
    MULTIMEDIA SYSTEMS, 2017, 23 (03) : 281 - 283