Lasso-constrained regression analysis for interval-valued data

被引:34
作者
Giordani, Paolo [1 ]
机构
[1] Univ Roma La Sapienza, Dept Stat Sci, I-00185 Rome, Italy
关键词
Interval-valued data; Regression; Lasso; Prediction accuracy; MODEL; COMPACT; SETS;
D O I
10.1007/s11634-014-0164-8
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
A new method of regression analysis for interval-valued data is proposed. The relationship between an interval-valued response variable and a set of interval-valued explanatory variables is investigated by considering two regression models, one for the midpoints and the other one for the radii. The estimation problem is approached by introducing Lasso-based constraints on the regression coefficients. This can improve the prediction accuracy of the model and, taking into account the nature of the constraints, can sometimes produce a parsimonious model with a common subset of regression coefficients for the midpoint and the radius models. The effectiveness of our method, called Lasso-IR (Lasso-based Interval-valued Regression), is shown by a simulation experiment and some applications to real data.
引用
收藏
页码:5 / 19
页数:15
相关论文
共 20 条
[1]  
[Anonymous], 1995, CLASSICS APPL MATH
[2]  
[Anonymous], 2008, SYMBOLIC DATA ANAL S
[3]  
[Anonymous], 2007, Symbolic Data Analysis: Conceptual Statistics and Data Mining
[4]  
[Anonymous], 2000, Analysis of Symbolic Data: Exploratory Methods for Extracting Statistical Information from Complex Data
[5]   From the statistics of data to the statistics of knowledge: Symbolic data analysis [J].
Billard, L ;
Diday, E .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2003, 98 (462) :470-487
[6]  
Billard L, 2000, ST CLASS DAT ANAL, P369
[7]   Confidence sets in a linear regression model for interval data [J].
Blanco-Fernandez, Angela ;
Colubi, Ana ;
Gonzalez-Rodriguez, Gil .
JOURNAL OF STATISTICAL PLANNING AND INFERENCE, 2012, 142 (06) :1320-1329
[8]   Estimation of a flexible simple linear model for interval data based on set arithmetic [J].
Blanco-Fernandez, Angela ;
Corral, Norberto ;
Gonzalez-Rodriguez, Gil .
COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2011, 55 (09) :2568-2578
[9]   A robust method for linear regression of symbolic interval data [J].
Domingues, Marco A. O. ;
de Souza, Renata M. C. R. ;
Cysneiros, Francisco Jose A. .
PATTERN RECOGNITION LETTERS, 2010, 31 (13) :1991-1996
[10]   Least angle regression - Rejoinder [J].
Efron, B ;
Hastie, T ;
Johnstone, I ;
Tibshirani, R .
ANNALS OF STATISTICS, 2004, 32 (02) :494-499