Model Selection Techniques An overview

被引:209
作者
Ding, Jie [1 ,2 ,3 ]
Tarokh, Vahid [4 ,5 ,6 ,7 ,8 ,9 ,10 ]
Yang, Yuhong [11 ,12 ,13 ]
机构
[1] Harvard Univ, Cambridge, MA 02138 USA
[2] Duke Univ, Informat Initiat, Durham, NC USA
[3] Univ Minnesota, Sch Stat, Minneapolis, MN 55455 USA
[4] AT&T Labs Res, Atlanta, GA USA
[5] AT&T Wireless Serv, Atlanta, GA USA
[6] Dept Wireless Commun & Signal Proc, Atlanta, GA USA
[7] MIT, Cambridge, MA 02139 USA
[8] Harvard Univ, Elect Engn, Cambridge, MA 02138 USA
[9] Duke Univ, Elect & Comp Engn Comp Sci & Mathematics, Durham, NC 27706 USA
[10] CALTECH, Pasadena, CA 91125 USA
[11] Iowa State Univ, Dept Stat, Ames, IA USA
[12] Univ Minnesota, Minneapolis, MN USA
[13] Inst Math Stat, Bethesda, MD USA
关键词
AKAIKES INFORMATION CRITERION; MINIMUM DESCRIPTION LENGTH; VARIABLE SELECTION; ORDER SELECTION; MULTIPLE-REGRESSION; ORACLE PROPERTIES; DANTZIG SELECTOR; CROSS-VALIDATION; TIME-SERIES; LASSO;
D O I
10.1109/MSP.2018.2867638
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In the era of big data, analysts usually explore various statistical models or machine-learning methods for observed data to facilitate scientific discoveries or gain predictive power. Whatever data and fitting procedures are employed, a crucial step is to select the most appropriate model or method from a set of candidates. Model selection is a key ingredient in data analysis for reliable and reproducible statistical inference or prediction, and thus it is central to scientific studies in such fields as ecology, economics, engineering, finance, political science, biology, and epidemiology. There has been a long history of model selection techniques that arise from researches in statistics, information theory, and signal processing. A considerable number of methods has been proposed, following different philosophies and exhibiting varying performances. The purpose of this article is to provide a comprehensive overview of them, in terms of their motivation, large sample performance, and applicability. We provide integrated and practically relevant discussions on theoretical properties of state-of-the-art model selection approaches. We also share our thoughts on some controversial views on the practice of model selection. © 2018 IEEE.
引用
收藏
页码:16 / 34
页数:19
相关论文
共 90 条
[71]   Bayesian measures of model complexity and fit [J].
Spiegelhalter, DJ ;
Best, NG ;
Carlin, BR ;
van der Linde, A .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2002, 64 :583-616
[72]   Model-order selection [J].
Stoica, P ;
Selén, Y .
IEEE SIGNAL PROCESSING MAGAZINE, 2004, 21 (04) :36-47
[73]   ASYMPTOTIC EQUIVALENCE OF CHOICE OF MODEL BY CROSS-VALIDATION AND AKAIKES CRITERION [J].
STONE, M .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 1977, 39 (01) :44-47
[74]  
Taylor J., 2014, POSTSELECTION ADAPTI
[76]   Catching up faster by switching sooner: a predictive approach to adaptive estimation with an application to the AIC-BIC dilemma [J].
van Erven, Tim ;
Grunwald, Peter ;
de Rooij, Steven .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2012, 74 :361-417
[77]   AN INFORMATION MEASURE FOR CLASSIFICATION [J].
WALLACE, CS ;
BOULTON, DM .
COMPUTER JOURNAL, 1968, 11 (02) :185-+
[78]   ON PREDICTIVE LEAST-SQUARES PRINCIPLES [J].
WEI, CZ .
ANNALS OF STATISTICS, 1992, 20 (01) :1-42
[79]   The large-sample distribution of the likelihood ratio for testing composite hypotheses [J].
Wilks, SS .
ANNALS OF MATHEMATICAL STATISTICS, 1938, 9 :60-62
[80]   Accurate estimation of influenza epidemics using Google search data via ARGO [J].
Yang, Shihao ;
Santillana, Mauricio ;
Kou, S. C. .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2015, 112 (47) :14473-14478