On the implementation of LIR: the case of simple linear regression with interval data

被引:1
作者
Cattaneo, Marco E. G. V. [1 ]
Wiencierz, Andrea [1 ]
机构
[1] Ludwig Maximilians Univ Munchen, Dept Stat, D-80539 Munich, Germany
关键词
Interval-censored data; Nonparametric likelihood inference; Robust regression; Least median of squares; Exact algorithm; R package; LEAST QUANTILE; SQUARES; ALGORITHMS; INCOME;
D O I
10.1007/s00180-013-0459-9
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
This paper considers the problem of simple linear regression with interval-censored data. That is, pairs of intervals are observed instead of the pairs of precise values for the two variables (dependent and independent). Each of these intervals is closed but possibly unbounded, and contains the corresponding (unobserved) value of the dependent or independent variable. The goal of the regression is to describe the relationship between (the precise values of) these two variables by means of a linear function. Likelihood-based Imprecise Regression (LIR) is a recently introduced, very general approach to regression for imprecisely observed quantities. The result of a LIR analysis is in general set-valued: it consists of all regression functions that cannot be excluded on the basis of likelihood inference. These regression functions are said to be undominated. Since the interval data can be unbounded, a robust regression method is necessary. Hence, we consider the robust LIR method based on the minimization of the residuals' quantiles. For this method, we prove that the set of all the intercept-slope pairs corresponding to the undominated regression functions is the union of finitely many polygons. We give an exact algorithm for determining this set (i.e., for determining the set-valued result of the robust LIR analysis), and show that it has worst-case time complexity . We have implemented this exact algorithm as part of the R package linLIR.
引用
收藏
页码:743 / 767
页数:25
相关论文
共 46 条
[1]  
Alexandrov Alexander D., 2005, Convex polyhedra
[2]  
[Anonymous], 2012, R LANG ENV STAT COMP
[3]  
[Anonymous], 2007, SAND20070939 SAND NA
[4]  
[Anonymous], 2009, Wiley Series in Probability and Statistics, DOI DOI 10.1002/9780470434697.CH7
[5]  
[Anonymous], 1998, SORTING SEARCHING
[6]   ACCEPTABILITY OF REGRESSION SOLUTIONS - ANOTHER LOOK AT COMPUTATIONAL ACCURACY [J].
BEATON, AE ;
RUBIN, DB ;
BARONE, JL .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1976, 71 (353) :158-168
[7]  
BUCKLEY J, 1979, BIOMETRIKA, V66, P429
[8]   THE DETERMINATION OF A LEAST QUANTILE OF SQUARES REGRESSION LINE FOR ALL QUANTILES [J].
CARRIZOSA, E ;
PLASTRIA, F .
COMPUTATIONAL STATISTICS & DATA ANALYSIS, 1995, 20 (05) :467-479
[9]   Likelihood-based Imprecise Regression [J].
Cattaneo, Marco E. G. V. ;
Wiencierz, Andrea .
INTERNATIONAL JOURNAL OF APPROXIMATE REASONING, 2012, 53 (08) :1137-1154
[10]   A review on empirical likelihood methods for regression [J].
Chen, Song Xi ;
Van Keilegom, Ingrid .
TEST, 2009, 18 (03) :415-447