Best subset selection;
high-dimensional regression;
L-0;
minimization;
feature selection;
REGRESSION;
D O I:
10.1214/18-AOS1733
中图分类号:
O21 [概率论与数理统计];
C8 [统计学];
学科分类号:
020208 ;
070103 ;
0714 ;
摘要:
Standard penalized methods of variable selection and parameter estimation rely on the magnitude of coefficient estimates to decide which variables to include in the final model. However, coefficient estimates are unreliable when the design matrix is collinear. To overcome this challenge, an entirely new perspective on variable selection is presented within a generalized fiducial inference framework. This new procedure is able to effectively account for linear dependencies among subsets of covariates in a high-dimensional setting where p can grow almost exponentially in n, as well as in the classical setting where p <= n. It is shown that the procedure very naturally assigns small probabilities to subsets of covariates which include redundancies by way of explicit L-0 minimization Furthermore, with a typical sparsity assumption, it is shown that the proposed method is consistent in the sense that the probability of the true sparse subset of covariates converges in probability to 1 as n -> infinity, or as n -> infinity and p -> infinity. Very reasonable conditions are needed, and little restriction is placed on the class of possible subsets of covariates to achieve this consistency result.
机构:
Univ Pablo Olavide, Area Lenguajes & Sistemas Informat, Seville 41013, SpainUniv Pablo Olavide, Area Lenguajes & Sistemas Informat, Seville 41013, Spain
Garcia-Torres, Miguel
Gomez-Vela, Francisco
论文数: 0引用数: 0
h-index: 0
机构:
Univ Pablo Olavide, Area Lenguajes & Sistemas Informat, Seville 41013, SpainUniv Pablo Olavide, Area Lenguajes & Sistemas Informat, Seville 41013, Spain
Gomez-Vela, Francisco
Melian-Batista, Belen
论文数: 0引用数: 0
h-index: 0
机构:
Univ La Laguna, Dept Ingn Informat & Sistemas, San Cristobal la Laguna 38271, SpainUniv Pablo Olavide, Area Lenguajes & Sistemas Informat, Seville 41013, Spain
Melian-Batista, Belen
Marcos Moreno-Vega, J.
论文数: 0引用数: 0
h-index: 0
机构:
Univ La Laguna, Dept Ingn Informat & Sistemas, San Cristobal la Laguna 38271, SpainUniv Pablo Olavide, Area Lenguajes & Sistemas Informat, Seville 41013, Spain
机构:
School of Statistics and Management, Shanghai University of Finance and Economics
Key Laboratory of Mathematical Economics (SUFE), Ministry of EducationSchool of Statistics and Management, Shanghai University of Finance and Economics
Jianhua Hu
Jian Huang
论文数: 0引用数: 0
h-index: 0
机构:
Department of Biostatistics, University of IowaSchool of Statistics and Management, Shanghai University of Finance and Economics
Jian Huang
Feng Qiu
论文数: 0引用数: 0
h-index: 0
机构:
School of Statistics and Management, Shanghai University of Finance and Economics
Science College, Zhejiang Agriculture and Forestry UniversitySchool of Statistics and Management, Shanghai University of Finance and Economics
机构:
Shenzhen Univ, Coll Math & Stat, Inst Stat Sci, Shenzhen 518060, Peoples R ChinaShenzhen Univ, Coll Math & Stat, Inst Stat Sci, Shenzhen 518060, Peoples R China
Lin, Bingqing
Pang, Zhen
论文数: 0引用数: 0
h-index: 0
机构:
Hong Kong Polytech Univ, Dept Appl Math, Hong Kong, Hong Kong, Peoples R ChinaShenzhen Univ, Coll Math & Stat, Inst Stat Sci, Shenzhen 518060, Peoples R China
Pang, Zhen
Wang, Qihua
论文数: 0引用数: 0
h-index: 0
机构:
Shenzhen Univ, Coll Math & Stat, Inst Stat Sci, Shenzhen 518060, Peoples R China
Chinese Acad Sci, Acad Math & Syst Sci, Beijing 100190, Peoples R ChinaShenzhen Univ, Coll Math & Stat, Inst Stat Sci, Shenzhen 518060, Peoples R China
机构:
Chongqing Technol & Business Univ, Sch Math & Stat, Chongqing 400067, Peoples R China
Chongqing Technol & Business Univ, Chongqing Key Lab Stat Intelligent Comp & Monitori, Chongqing 400067, Peoples R ChinaChongqing Technol & Business Univ, Sch Math & Stat, Chongqing 400067, Peoples R China
Yang, Yiping
Zhao, Peixin
论文数: 0引用数: 0
h-index: 0
机构:
Chongqing Technol & Business Univ, Sch Math & Stat, Chongqing 400067, Peoples R China
Chongqing Technol & Business Univ, Chongqing Key Lab Stat Intelligent Comp & Monitori, Chongqing 400067, Peoples R ChinaChongqing Technol & Business Univ, Sch Math & Stat, Chongqing 400067, Peoples R China
Zhao, Peixin
Zhang, Jun
论文数: 0引用数: 0
h-index: 0
机构:
Shenzhen Univ, Sch Math Sci, Shenzhen 518060, Peoples R ChinaChongqing Technol & Business Univ, Sch Math & Stat, Chongqing 400067, Peoples R China
机构:
Shanghai Univ Finance & Econ, Sch Stat & Management, Shanghai 200433, Peoples R China
Minist Educ, Key Lab Math Econ SUFE, Shanghai 200433, Peoples R ChinaShanghai Univ Finance & Econ, Sch Stat & Management, Shanghai 200433, Peoples R China
Hu, Jianhua
Huang, Jian
论文数: 0引用数: 0
h-index: 0
机构:
Univ Iowa, Dept Biostat, Iowa City, IA 52242 USAShanghai Univ Finance & Econ, Sch Stat & Management, Shanghai 200433, Peoples R China
Huang, Jian
Qiu, Feng
论文数: 0引用数: 0
h-index: 0
机构:
Shanghai Univ Finance & Econ, Sch Stat & Management, Shanghai 200433, Peoples R China
Zhejiang Agr & Forestry Univ, Sci Coll, Hangzhou 311300, Zhejiang, Peoples R ChinaShanghai Univ Finance & Econ, Sch Stat & Management, Shanghai 200433, Peoples R China
机构:
Univ Melbourne, Melbourne Business Sch, Carlton, Vic 3053, AustraliaUniv Melbourne, Melbourne Business Sch, Carlton, Vic 3053, Australia
Ando, Tomohiro
Li, Ker-Chau
论文数: 0引用数: 0
h-index: 0
机构:
Univ Calif Los Angeles, Dept Stat, Los Angeles, CA 90095 USA
Acad Sinica, Inst Stat Sci, Taipei 11529, TaiwanUniv Melbourne, Melbourne Business Sch, Carlton, Vic 3053, Australia