Sparse Estimation of Polynomial and Rational Dynamical Models

被引:31
作者
Rojas, Cristian R. [1 ,2 ]
Toth, Roland [3 ]
Hjalmarsson, Hakan [1 ,2 ]
机构
[1] KTH Royal Inst Technol, Automat Control Lab, S-10044 Stockholm, Sweden
[2] KTH Royal Inst Technol, ACCESS Linnaeus Ctr, S-10044 Stockholm, Sweden
[3] Eindhoven Univ Technol, Dept Elect Engn, Control Syst Grp, NL-5600 MB Eindhoven, Netherlands
基金
欧洲研究理事会;
关键词
AIC; BIC; cross-validation; LASSO; model structure selection; sparse estimation; Steiglitz-McBride method; system identification; ORACLE PROPERTIES; REGRESSION; LASSO; SELECTION; IDENTIFICATION; SHRINKAGE;
D O I
10.1109/TAC.2014.2351711
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In many practical situations, it is highly desirable to estimate an accurate mathematical model of a real system using as few parameters as possible. At the same time, the need for an accurate description of the system behavior without knowing its complete dynamical structure often leads to model parameterizations describing a rich set of possible hypotheses; an unavoidable choice, which suggests sparsity of the desired parameter estimate. An elegant way to impose this expectation of sparsity is to estimate the parameters by penalizing the criterion with the l(0) "norm" of the parameters. Due to the non-convex nature of the l(0)-norm, this penalization is often implemented as solving an optimization program based on a convex relaxation (e. g., l(1)/LASSO, nuclear norm, ...). Two difficulties arise when trying to apply these methods: (1) the need to use cross-validation or some related technique for choosing the values of regularization parameters associated with the l(1) penalty; and (2) the requirement that the (unpenalized) cost function must be convex. To address the first issue, we propose a new technique for sparse linear regression called SPARSEVA, with close ties with the LASSO (least absolute shrinkage and selection operator), which provides an automatic tuning of the amount of regularization. The second difficulty, which imposes a severe constraint on the types of model structures or estimation methods on which the l(1) relaxation can be applied, is addressed by combining SPARSEVA and the Steiglitz-McBride method. To demonstrate the advantages of the proposed approach, a solid theoretical analysis and an extensive simulation study are provided.
引用
收藏
页码:2962 / 2977
页数:16
相关论文
共 38 条
[1]  
[Anonymous], 2001, EXERCISES PROBABILIT
[2]  
[Anonymous], 1999, SYSTEM IDENTIFICATIO
[3]  
[Anonymous], 1995, Adaptive IIR Filtering in Signal Processing and Control
[4]  
Berger J.O., 1985, Statistical decision theory and Bayesian analysis, V2nd
[5]   BETTER SUBSET REGRESSION USING THE NONNEGATIVE GARROTE [J].
BREIMAN, L .
TECHNOMETRICS, 1995, 37 (04) :373-384
[6]  
Bühlmann P, 2011, SPRINGER SER STAT, P1, DOI 10.1007/978-3-642-20192-9
[7]   Stable signal recovery from incomplete and inaccurate measurements [J].
Candes, Emmanuel J. ;
Romberg, Justin K. ;
Tao, Terence .
COMMUNICATIONS ON PURE AND APPLIED MATHEMATICS, 2006, 59 (08) :1207-1223
[8]   A Bayesian approach to sparse dynamic network identification [J].
Chiuso, Alessandro ;
Pillonetto, Gianluigi .
AUTOMATICA, 2012, 48 (08) :1553-1565
[9]   IDEAL SPATIAL ADAPTATION BY WAVELET SHRINKAGE [J].
DONOHO, DL ;
JOHNSTONE, IM .
BIOMETRIKA, 1994, 81 (03) :425-455
[10]   Least angle regression - Rejoinder [J].
Efron, B ;
Hastie, T ;
Johnstone, I ;
Tibshirani, R .
ANNALS OF STATISTICS, 2004, 32 (02) :494-499