Lasso-constrained regression analysis for interval-valued data

被引:0
作者
Paolo Giordani
机构
[1] Sapienza University of Rome,Department of Statistical Sciences
来源
Advances in Data Analysis and Classification | 2015年 / 9卷
关键词
Interval-valued data; Regression; Lasso; Prediction accuracy; MSC 62J05;
D O I
暂无
中图分类号
学科分类号
摘要
A new method of regression analysis for interval-valued data is proposed. The relationship between an interval-valued response variable and a set of interval-valued explanatory variables is investigated by considering two regression models, one for the midpoints and the other one for the radii. The estimation problem is approached by introducing Lasso-based constraints on the regression coefficients. This can improve the prediction accuracy of the model and, taking into account the nature of the constraints, can sometimes produce a parsimonious model with a common subset of regression coefficients for the midpoint and the radius models. The effectiveness of our method, called Lasso-IR (Lasso-based Interval-valued Regression), is shown by a simulation experiment and some applications to real data.
引用
收藏
页码:5 / 19
页数:14
相关论文
共 34 条
[1]  
Ahn J(2012)A resampling approach for interval-valued data regression Stat Anal Data Min 5 336-348
[2]  
Peng M(2003)From the statistics of data to the statistics of knowledge: symbolic data analysis J Am Stat Assoc 98 470-487
[3]  
Park C(2011)Estimation of a flexible simple linear model for interval data based on set arithmetic Comput Stat Data Anal 55 2568-2578
[4]  
Jeon Y(2012)Confidence sets in a linear regression model for interval data J Stat Plann Infer 142 1320-1329
[5]  
Billard L(2011)Far beyond the classical data models: symbolic data analysis Stat Anal Data Min 4 157-170
[6]  
Diday E(2010)A robust method for linear regression of symbolic interval data Patt Rec Lett 31 1991-1996
[7]  
Blanco-Fernandez A(2004)Least angle regression Ann Stat 32 407-499
[8]  
Corral N(2007)Least squares estimation of linear regression models for convex compact random sets Adv Data Anal Classif 1 67-81
[9]  
Gonzalez-Rodriguez G(2008)Centre and range method to fitting a linear regression model on symbolic interval data Comput Stat Data Anal 52 1500-1515
[10]  
Blanco-Fernandez A(2010)Constrained linear regression models for symbolic interval-valued variables Comput Stat Data Anal 54 333-347