Efficient Learning of Minimax Risk Classifiers in High Dimensions

被引:0
作者
Bondugula, Kartheek [1 ]
Mazuelas, Santiago [1 ,2 ]
Perez, Aritz [1 ]
机构
[1] Basque Ctr Appl Math BCAM, Bilbao, Spain
[2] IKERBASQUE Basque Fdn Sci, Bilbao, Spain
来源
UNCERTAINTY IN ARTIFICIAL INTELLIGENCE | 2023年 / 216卷
关键词
SCALE; INFORMATION; SELECTION;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
High-dimensional data is common in multiple areas, such as health care and genomics, where the number of features can be tens of thousands. In such scenarios, the large number of features often leads to inefficient learning. Constraint generation methods have recently enabled efficient learning of L1-regularized support vector machines (SVMs). In this paper, we leverage such methods to obtain an efficient learning algorithm for the recently proposed minimax risk classifiers (MRCs). The proposed iterative algorithm also provides a sequence of worst-case error probabilities and performs feature selection. Experiments on multiple high-dimensional datasets show that the proposed algorithm is efficient in high-dimensional scenarios. In addition, the worst-case error probability provides useful information about the classifier performance, and the features selected by the algorithm are competitive with the state-of-the-art.
引用
收藏
页码:206 / 215
页数:10
相关论文
共 29 条
[1]  
[Anonymous], 2007, P 24 INT C MACHINE L
[2]  
Bertsimas D., 1997, Introduction to Linear Optimization
[3]  
Bondugula K, 2024, Arxiv, DOI arXiv:2108.01952
[4]  
Brown G, 2012, J MACH LEARN RES, V13, P27
[5]   FUNDAMENTAL BARRIERS TO HIGH-DIMENSIONAL REGRESSION WITH CONVEX PENALTIES [J].
Celentano, Michael ;
Montanari, Andrea .
ANNALS OF STATISTICS, 2022, 50 (01) :170-196
[6]  
Dedieu A, 2022, J MACH LEARN RES, V23
[7]  
Desrosiers Jacques., 2005, A primer in column generation
[8]   Minimum redundancy feature selection from microarray gene expression data [J].
Ding, C ;
Peng, HC .
PROCEEDINGS OF THE 2003 IEEE BIOINFORMATICS CONFERENCE, 2003, :523-528
[9]  
Dua D, 2017, UCI MACHINE LEARNING
[10]   Enriched Random Forest for High Dimensional Genomic Data [J].
Ghosh, Debopriya ;
Cabrera, Javier .
IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2022, 19 (05) :2817-2828