Regularized Multivariate Analysis Framework for Interpretable High-Dimensional Variable Selection

被引:10
|
作者
Munoz-Romero, Sergio [1 ]
Gomez-Verdejo, Vanessa [2 ]
Arenas-Garcia, Jernimo [2 ]
机构
[1] Univ Rey Juan Carlos, Dept Signal Proc & Commun, Madrid, Spain
[2] Univ Carlos III Madrid, Dept Signal Proc & Commun, E-28903 Getafe, Spain
关键词
SPARSE; REGRESSION;
D O I
10.1109/MCI.2016.2601701
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multivariate Analysis (MVA) comprises a family of well-known methods for feature extraction which exploit correlations among input variables representing the data. One important property that is enjoyed by most such methods is uncorrelation among the extracted features. Recently, regularized versions of MVA methods have appeared in the literature, mainly with the goal to gain interpretability of the solution. In these cases, the solutions can no longer be obtained in a closed manner, and more complex optimization methods that rely on the iteration of two steps are frequently used. This paper recurs to an alternative approach to solve efficiently this iterative problem. The main novelty of this approach lies in preserving several properties of the original methods, most notably the uncorrelation of the extracted features. Under this framework, we propose a novel method that takes advantage of the,2,1 norm to perform variable selection during the feature extraction process. Experimental results over different problems corroborate the advantages of the proposed formulation in comparison to state of the art formulations.
引用
收藏
页码:24 / 35
页数:12
相关论文
共 50 条
  • [31] PUlasso: High-Dimensional Variable Selection With Presence-Only Data
    Song, Hyebin
    Raskutti, Garvesh
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2020, 115 (529) : 334 - 347
  • [32] Determining and Depicting Relationships Among Components in High-Dimensional Variable Selection
    Hall, Peter
    Miller, Hugh
    JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 2011, 20 (04) : 988 - 1006
  • [33] Consistent Variable Selection for High-dimensional Nonparametric Additive Nonlinear Systems
    Mu, Biqiang
    Zheng, Wei Xing
    Bai, Er-Wei
    2016 IEEE 55TH CONFERENCE ON DECISION AND CONTROL (CDC), 2016, : 3066 - 3071
  • [34] Automatic model selection for high-dimensional survival analysis
    Lang, M.
    Kotthaus, H.
    Marwedel, P.
    Weihs, C.
    Rahnenfuehrer, J.
    Bischl, B.
    JOURNAL OF STATISTICAL COMPUTATION AND SIMULATION, 2015, 85 (01) : 62 - 76
  • [35] Optimal Feature Selection in High-Dimensional Discriminant Analysis
    Kolar, Mladen
    Liu, Han
    IEEE TRANSACTIONS ON INFORMATION THEORY, 2015, 61 (02) : 1063 - 1083
  • [36] Variable Selection in High-dimensional Varying-coefficient Models with Global Optimality
    Xue, Lan
    Qu, Annie
    JOURNAL OF MACHINE LEARNING RESEARCH, 2012, 13 : 1973 - 1998
  • [37] Bayesian Variable Selection in Structured High-Dimensional Covariate Spaces With Applications in Genomics
    Li, Fan
    Zhang, Nancy R.
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2010, 105 (491) : 1202 - 1214
  • [38] Structural identification and variable selection in high-dimensional varying-coefficient models
    Chen, Yuping
    Bai, Yang
    Fung, Wingkam
    JOURNAL OF NONPARAMETRIC STATISTICS, 2017, 29 (02) : 258 - 279
  • [39] Efficient test-based variable selection for high-dimensional linear models
    Gong, Siliang
    Zhang, Kai
    Liu, Yufeng
    JOURNAL OF MULTIVARIATE ANALYSIS, 2018, 166 : 17 - 31
  • [40] Controlled variable selection in Weibull mixture cure models for high-dimensional data
    Fu, Han
    Nicolet, Deedra
    Mrozek, Krzysztof
    Stone, Richard M.
    Eisfeld, Ann-Kathrin
    Byrd, John C.
    Archer, Kellie J.
    STATISTICS IN MEDICINE, 2022, 41 (22) : 4340 - 4366