Transductive versions of the LASSO and the Dantzig Selector

被引:4
作者
Alquier, Pierre [1 ,2 ]
Hebiri, Mohamed [3 ]
机构
[1] Univ Paris 07, LPMA, F-75013 Paris, France
[2] CREST LS, F-92240 Malakoff, France
[3] Univ Marne la Vallee, Dept Math, F-77454 Marne La Vallee 2, France
关键词
High-dimensional data; LASSO; Sparsity; High-dimensional regression estimation; Variable selection; Transduction; VARIABLE SELECTION; AGGREGATION; CONSISTENCY; EXPRESSION; RECOVERY;
D O I
10.1016/j.jspi.2012.03.020
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Transductive methods are useful in prediction problems when the training dataset is composed of a large number of unlabeled observations and a smaller number of labeled observations. In this paper, we propose an approach for developing transductive prediction procedures that are able to take advantage of the sparsity in the high dimensional linear regression. More precisely, we define transductive versions of the LASSO (Tibshirani, 1996) and the Dantzig Selector (Candes and Tao, 2007). These procedures combine labeled and unlabeled observations of the training dataset to produce a prediction for the unlabeled observations. We propose an experimental study of the transductive estimators that shows that they improve the LASSO and Dantzig Selector in many situations, and particularly in high dimensional problems when the predictors are correlated. We then provide non-asymptotic theoretical guarantees for these estimation methods. Interestingly, our theoretical results show that the Transductive LASSO and Dantzig Selector satisfy sparsity inequalities under weaker assumptions than those required for the "original" LASSO. (C) 2012 Elsevier B.V. All rights reserved.
引用
收藏
页码:2485 / 2500
页数:16
相关论文
共 50 条
  • [1] Akaike H., 1973, 2 INTERNAT SYMPOS IN, P267, DOI [DOI 10.1007/978-1-4612-1694-0_15, 10.1007/978-1-4612-1694-0, 10.1007/978-1-4612-0919-5_38]
  • [2] Generalization of l1 constraints for high dimensional regression problems
    Alquier, Pierre
    Hebiri, Mohamed
    [J]. STATISTICS & PROBABILITY LETTERS, 2011, 81 (12) : 1760 - 1765
  • [3] LASSO, Iterative Feature Selection and the Correlation Selector: Oracle inequalities and numerical performances
    Alquier, Pierre
    [J]. ELECTRONIC JOURNAL OF STATISTICS, 2008, 2 : 1129 - 1152
  • [4] Amini M., 2003, P 18 IJCAI
  • [5] Ando RK, 2005, J MACH LEARN RES, V6, P1817
  • [6] [Anonymous], 1999, ICML
  • [7] [Anonymous], 2005, P INT WORKSH ART INT
  • [8] [Anonymous], 2006, BOOK REV IEEE T NEUR
  • [9] [Anonymous], NATURE STATISTI810
  • [10] [Anonymous], 2003, ICML