Robust and flexible learning of a high-dimensional classification rule using auxiliary outcomes

被引:0
作者
Liang, Muxuan [1 ]
Park, Jaeyoung [2 ]
Lu, Qing [1 ]
Zhong, Xiang [3 ]
机构
[1] Univ Florida, Dept Biostat, 2004 Mowry Rd, 5th Floor CTRB, Gainesville, FL 32611 USA
[2] Univ Cent Florida, Sch Global Hlth Management & Informat, Orlando, FL 32816 USA
[3] Univ Florida, Dept Ind & Syst Engn, Gainesville, FL 32611 USA
关键词
auxiliary outcomes; classification; high-dimensional data; multi-task learning; transfer learning; MULTITASK; ALGORITHMS; PREDICT;
D O I
10.1093/biomtc/ujae144
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Correlated outcomes are common in many practical problems. In some settings, one outcome is of particular interest, and others are auxiliary. To leverage information shared by all the outcomes, traditional multi-task learning (MTL) minimizes an averaged loss function over all the outcomes, which may lead to biased estimation for the target outcome, especially when the MTL model is misspecified. In this work, based on a decomposition of estimation bias into two types, within-subspace and against-subspace, we develop a robust transfer learning approach to estimating a high-dimensional linear decision rule for the outcome of interest with the presence of auxiliary outcomes. The proposed method includes an MTL step using all outcomes to gain efficiency and a subsequent calibration step using only the outcome of interest to correct both types of biases. We show that the final estimator can achieve a lower estimation error than the one using only the single outcome of interest. Simulations and real data analysis are conducted to justify the superiority of the proposed method.
引用
收藏
页数:9
相关论文
共 50 条
  • [11] Deep learning approach for cancer subtype classification using high-dimensional gene expression data
    Jiquan Shen
    Jiawei Shi
    Junwei Luo
    Haixia Zhai
    Xiaoyan Liu
    Zhengjiang Wu
    Chaokun Yan
    Huimin Luo
    BMC Bioinformatics, 23
  • [12] A Hierarchical Genetic Fuzzy Rule-Based Classifier for High-Dimensional Classification Problems
    Stavrakoudis, Dimitris G.
    Gitas, Ioannis Z.
    Theocharis, John B.
    IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS (FUZZ 2011), 2011, : 1279 - 1285
  • [13] SENSING-AWARE CLASSIFICATION WITH HIGH-DIMENSIONAL DATA
    Orten, Burkay
    Ishwar, Prakash
    Karl, W. Clem
    Saligrama, Venkatesh
    Pien, Homer
    2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 3700 - 3703
  • [14] Characterizing the scale dimension of a high-dimensional classification problem
    Marchette, DJ
    Priebe, CE
    PATTERN RECOGNITION, 2003, 36 (01) : 45 - 60
  • [15] Deep learning approach for cancer subtype classification using high-dimensional gene expression data
    Shen, Jiquan
    Shi, Jiawei
    Luo, Junwei
    Zhai, Haixia
    Liu, Xiaoyan
    Wu, Zhengjiang
    Yan, Chaokun
    Luo, Huimin
    BMC BIOINFORMATICS, 2022, 23 (01)
  • [16] HIGH-DIMENSIONAL CLASSIFICATION USING FEATURES ANNEALED INDEPENDENCE RULES
    Fan, Jianqing
    Fan, Yingying
    ANNALS OF STATISTICS, 2008, 36 (06) : 2605 - 2637
  • [17] GP-COACH: Genetic Programming-based learning of COmpact and ACcurate fuzzy rule-based classification systems for High-dimensional problems
    Berlanga, F. J.
    Rivera, A. J.
    del Jesus, M. J.
    Herrera, F.
    INFORMATION SCIENCES, 2010, 180 (08) : 1183 - 1200
  • [18] Accurate classification of depression through optimized machine learning models on high-dimensional noisy data
    Fang, Xingang
    Klawohn, Julia
    De Sabatino, Alexander
    Kundnani, Harsh
    Ryan, Jonathan
    Yu, Weikuan
    Hajcak, Greg
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2022, 71
  • [19] Classification of sparse high-dimensional vectors
    Ingster, Yuri I.
    Pouet, Christophe
    Tsybakov, Alexandre B.
    PHILOSOPHICAL TRANSACTIONS OF THE ROYAL SOCIETY A-MATHEMATICAL PHYSICAL AND ENGINEERING SCIENCES, 2009, 367 (1906): : 4427 - 4448
  • [20] Classification with High-Dimensional Sparse Samples
    Huang, Dayu
    Meyn, Sean
    2012 IEEE INTERNATIONAL SYMPOSIUM ON INFORMATION THEORY PROCEEDINGS (ISIT), 2012,