Robust and flexible learning of a high-dimensional classification rule using auxiliary outcomes

被引：0

作者：

Liang, Muxuan ^{[1
]}

Park, Jaeyoung ^{[2
]}

Lu, Qing ^{[1
]}

Zhong, Xiang ^{[3
]}

机构：

[1] Univ Florida, Dept Biostat, 2004 Mowry Rd, 5th Floor CTRB, Gainesville, FL 32611 USA

[2] Univ Cent Florida, Sch Global Hlth Management & Informat, Orlando, FL 32816 USA

[3] Univ Florida, Dept Ind & Syst Engn, Gainesville, FL 32611 USA

来源：

BIOMETRICS | 2024年 / 80卷 / 04期

关键词：

auxiliary outcomes; classification; high-dimensional data; multi-task learning; transfer learning; MULTITASK; ALGORITHMS; PREDICT;

D O I：

10.1093/biomtc/ujae144

中图分类号：

Q [生物科学];

学科分类号：

07 ; 0710 ; 09 ;

摘要：

Correlated outcomes are common in many practical problems. In some settings, one outcome is of particular interest, and others are auxiliary. To leverage information shared by all the outcomes, traditional multi-task learning (MTL) minimizes an averaged loss function over all the outcomes, which may lead to biased estimation for the target outcome, especially when the MTL model is misspecified. In this work, based on a decomposition of estimation bias into two types, within-subspace and against-subspace, we develop a robust transfer learning approach to estimating a high-dimensional linear decision rule for the outcome of interest with the presence of auxiliary outcomes. The proposed method includes an MTL step using all outcomes to gain efficiency and a subsequent calibration step using only the outcome of interest to correct both types of biases. We show that the final estimator can achieve a lower estimation error than the one using only the single outcome of interest. Simulations and real data analysis are conducted to justify the superiority of the proposed method.

引用

页数：9

共 50 条

[31] Feature selection for high-dimensional classification using a competitive swarm optimizer
Shenkai Gu
Ran Cheng
Yaochu Jin
Soft Computing, 2018, 22 : 811 - 822
[32] Feature selection for high-dimensional classification using a competitive swarm optimizer
Gu, Shenkai
Cheng, Ran
Jin, Yaochu
SOFT COMPUTING, 2018, 22 (03) : 811 - 822
[33] Improved Algorithms for High-dimensional Robust PCA
Lin, Xiaoyong
Zhang, Zeqiu
Wang, Jue
Zhang, Zhaoyang
Qiu, Tingting
Mi, Zhengkun
2016 IEEE INTERNATIONAL CONFERENCE ON UBIQUITOUS WIRELESS BROADBAND (ICUWB2016), 2016,
[34] Learning from High-Dimensional and Class-Imbalanced Datasets Using Random Forests
Pes, Barbara
INFORMATION, 2021, 12 (08)
[35] Learning high-dimensional multimedia data
Xiaofeng Zhu
Zhi Jin
Rongrong Ji
Multimedia Systems, 2017, 23 : 281 - 283
[36] A Compressive Classification Framework for High-Dimensional Data
Tabassum, Muhammad Naveed
Ollila, Esa
IEEE OPEN JOURNAL OF SIGNAL PROCESSING, 2020, 1 : 177 - 186
[37] A training algorithm for classification of high-dimensional data
Vieira, A
Barradas, N
NEUROCOMPUTING, 2003, 50 : 461 - 472
[38] Online Nonlinear Classification for High-Dimensional Data
Vanli, N. Denizcan
Ozkan, Huseyin
Delibalta, Ibrahim
Kozat, Suleyman S.
2015 IEEE INTERNATIONAL CONGRESS ON BIG DATA - BIGDATA CONGRESS 2015, 2015, : 685 - 688
[39] hGA: Hybrid genetic algorithm in fuzzy rule-based classification systems for high-dimensional problems
Aydogan, Emel Kizilkaya
Karaoglan, Ismail
Pardalos, Panos M.
APPLIED SOFT COMPUTING, 2012, 12 (02) : 800 - 806
[40] Learning high-dimensional multimedia data
Zhu, Xiaofeng
Jin, Zhi
Ji, Rongrong
MULTIMEDIA SYSTEMS, 2017, 23 (03) : 281 - 283

← 1 2 3 4 5 →