High-dimensional sign-constrained feature selection and grouping

被引:0
|
作者
Qin, Shanshan [1 ]
Ding, Hao [1 ]
Wu, Yuehua [1 ]
Liu, Feng [2 ]
机构
[1] York Univ, Dept Math & Stat, 4700 Keele St, Toronto, ON M3J 1P3, Canada
[2] Univ Technol Sydney, Australian Artificial Intelligence Inst, Sydney, NSW 2007, Australia
基金
加拿大自然科学与工程研究理事会;
关键词
Difference convex programming; Feature grouping; Feature selection; High-dimensional; Non-negative; NONNEGATIVE LEAST-SQUARES; VARIABLE SELECTION; ADAPTIVE LASSO; REGRESSION; LIKELIHOOD; RECOVERY; MODELS; PATH;
D O I
10.1007/s10463-020-00766-z
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
In this paper, we propose a non-negative feature selection/feature grouping (nnFSG) method for general sign-constrained high-dimensional regression problems that allows regression coefficients to be disjointly homogeneous, with sparsity as a special case. To solve the resulting non-convex optimization problem, we provide an algorithm that incorporates the difference of convex programming, augmented Lagrange and coordinate descent methods. Furthermore, we show that the aforementioned nnFSG method recovers the oracle estimate consistently, and that the mean-squared errors are bounded. Additionally, we examine the performance of our method using finite sample simulations and applying it to a real protein mass spectrum dataset.
引用
收藏
页码:787 / 819
页数:33
相关论文
共 50 条
  • [1] High-dimensional sign-constrained feature selection and grouping
    Shanshan Qin
    Hao Ding
    Yuehua Wu
    Feng Liu
    Annals of the Institute of Statistical Mathematics, 2021, 73 : 787 - 819
  • [2] Sign-constrained least squares estimation for high-dimensional regression
    Meinshausen, Nicolai
    ELECTRONIC JOURNAL OF STATISTICS, 2013, 7 : 1607 - 1631
  • [3] High-dimensional feature selection via feature grouping: A Variable Neighborhood Search approach
    Garcia-Torres, Miguel
    Gomez-Vela, Francisco
    Melian-Batista, Belen
    Marcos Moreno-Vega, J.
    INFORMATION SCIENCES, 2016, 326 : 102 - 118
  • [4] Feature selection for high-dimensional data
    Bolón-Canedo V.
    Sánchez-Maroño N.
    Alonso-Betanzos A.
    Progress in Artificial Intelligence, 2016, 5 (2) : 65 - 75
  • [5] Feature selection for high-dimensional data
    Destrero A.
    Mosci S.
    De Mol C.
    Verri A.
    Odone F.
    Computational Management Science, 2009, 6 (1) : 25 - 40
  • [6] Feature Grouping as a Stochastic Regularizer for High-Dimensional Structured Data
    Aydore, Sergul
    Thirion, Bertrand
    Varoquaux, Gael
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 97, 2019, 97
  • [7] FEATURE SELECTION FOR HIGH-DIMENSIONAL DATA ANALYSIS
    Verleysen, Michel
    NCTA 2011: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON NEURAL COMPUTATION THEORY AND APPLICATIONS, 2011, : IS23 - IS25
  • [8] Feature selection for high-dimensional data in astronomy
    Zheng, Hongwen
    Zhang, Yanxia
    ADVANCES IN SPACE RESEARCH, 2008, 41 (12) : 1960 - 1964
  • [9] Feature selection for high-dimensional imbalanced data
    Yin, Liuzhi
    Ge, Yong
    Xiao, Keli
    Wang, Xuehua
    Quan, Xiaojun
    NEUROCOMPUTING, 2013, 105 : 3 - 11
  • [10] A filter feature selection for high-dimensional data
    Janane, Fatima Zahra
    Ouaderhman, Tayeb
    Chamlal, Hasna
    JOURNAL OF ALGORITHMS & COMPUTATIONAL TECHNOLOGY, 2023, 17