Beat the Heap: An Imputation Strategy for Valid Inferences from Rounded Income Data

被引:11
作者
Drechsler, Joerg [1 ]
Kiesl, Hans [2 ]
机构
[1] Inst Employment Res, Regensburger Str 104, D-90478 Nurnberg, Germany
[2] OTH Regensburg, Dept Comp Sci & Math, Regensburg, Germany
关键词
Heaping; Measurement error; Multiple imputation; Poverty rate;
D O I
10.1093/jssam/smv032
中图分类号
O1 [数学]; C [社会科学总论];
学科分类号
03 ; 0303 ; 0701 ; 070101 ;
摘要
Questions on income in surveys are prone to two sources of errors that can cause bias if not addressed adequately at the analysis stage. On the one hand, income is considered sensitive information, and response rates on income questions generally tend to be lower than response rates for other nonsensitive questions. On the other hand, respondents usually do not remember their exact income and thus tend to provide a rounded estimate. The negative effects of item nonresponse are well studied, and most statistical agencies have developed sophisticated imputation methods to correct for this potential source of bias. However, to our knowledge, the effects of rounding are hardly ever considered in practice, despite the fact that several studies have found strong evidence that most of the respondents round their reported income values. In this article, we illustrate the substantial impact that rounding can have on important measures derived from the income variable, such as the poverty rate. To obtain unbiased estimates, we propose a two-stage imputation strategy that estimates the posterior probability for rounding given the observed income values at the first stage and reimputes the observed income values given the rounding probabilities at the second stage. A simulation study shows that the proposed imputation model can help overcome the possible negative effects of rounding. We also present results based on the household income variable from the German panel study Labour Market and Social Security.
引用
收藏
页码:22 / 42
页数:21
相关论文
共 31 条
[1]  
[Anonymous], 2021, Bayesian data analysis
[2]   IS EARNINGS NONRESPONSE IGNORABLE? [J].
Bollinger, Christopher R. ;
Hirsch, Barry T. .
REVIEW OF ECONOMICS AND STATISTICS, 2013, 95 (02) :407-416
[3]  
Clementi F, 2005, NEW ECON WINDOWS, P3, DOI 10.1007/88-470-0389-X_1
[4]  
Czajka JohnL., 2008, INCOME DATA POLICY A
[5]   Reliable inference for the Gini index [J].
Davidson, Russell .
JOURNAL OF ECONOMETRICS, 2009, 150 (01) :30-40
[6]  
DEMPSTER AP, 1983, J ROY STAT SOC B MET, V45, P51
[7]   MI Double Feature: Multiple Imputation to Address Nonresponse and Rounding Errors in Income Questions [J].
Drechsler, Joerg ;
Kies, Hans ;
Speidel, Matthias .
AUSTRIAN JOURNAL OF STATISTICS, 2015, 44 (02) :59-71
[8]  
Eurostat, 2015, GLOSS EQ DISP INC ST
[9]   Multiple imputation for model checking: Completed-data plots with missing and latent data [J].
Gelman, A ;
Van Mechelen, I ;
Verbeke, G ;
Heitjan, DF ;
Meulders, M .
BIOMETRICS, 2005, 61 (01) :74-85
[10]  
Haughton J., 2009, HDB POVERTY INEQUALI, DOI DOI 10.1596/978-0-8213-7613-3