Predictive Mean Matching as an alternative imputation method to hot deck in Vigitel

被引:3
作者
Santana dos Santos, Iolanda Karla [1 ,2 ]
Conde, Wolney Lisboa [1 ]
机构
[1] Univ Sao Paulo, Fac Saude Publ, Av Dr Arnaldo 715, BR-02361100 Sao Paulo, SP, Brazil
[2] Fundacao Univ Fed ABC, Santo Andre, SP, Brazil
来源
CADERNOS DE SAUDE PUBLICA | 2020年 / 36卷 / 06期
关键词
Nutrition Surveys; Surveillance; Nutritional Status; Epidemiology; MULTIPLE IMPUTATION;
D O I
10.1590/0102-311X00167219
中图分类号
R1 [预防医学、卫生学];
学科分类号
1004 ; 120402 ;
摘要
This study aimed to describe the estimated means for weight, height, and body mass index (BMI) according to two imputation methods, using data from Vigitel (Risk and Protective Factors Surveillance System for Chronic Non-Communicable Diseases Through Telephone Interview). This was a cross-sectional study that used secondary data from the Vigitel survey from 2006 to 2017. The two imputation methods used in the study were hot deck and Predictive Mean Matching (PMM). The weight and height variables imputed by hot deck were provided by Vigitel. Two models were conducted with PMM: (i) explanatory variables - city, sex, age in years, race/color, and schooling; (ii) explanatory variables - city, sex, and age in years. Weight and height were the outcome variables in the two models. PMM combines linear regression and random selection of the value for imputation. Linear prediction is used as a measure of distance between the missing value and the possible donors, thereby creating the virtual space with the candidate cases for yielding the value for imputation. One of the candidates from the pool is randomly selected, and its value is assigned to the missing unit. BMI was calculated by dividing weight in kilograms by height squared. The result shows the means and standard deviations for weight, height, and BMI according to imputation method and year. The estimates used the survey module from Stata, which considers the sampling effects. The mean values for weight, height, and BMI estimated by hot deck and PMM were similar. The results with the Vigitel data suggest the applicability of PMM to the set of health surveys.
引用
收藏
页数:8
相关论文
共 18 条
[1]   Diagnostics for multivariate imputations [J].
Abayomi, Kobi ;
Gelman, Andrew ;
Levy, Marc .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES C-APPLIED STATISTICS, 2008, 57 :273-291
[2]  
Allison P.D., 2015, Imputation by predictive mean matching: Promise peril
[3]  
Allison P.D., 2009, The SAGE handbook of quantitative methods in psychology
[4]   A Review of Hot Deck Imputation for Survey Non-response [J].
Andridge, Rebecca R. ;
Little, Roderick J. A. .
INTERNATIONAL STATISTICAL REVIEW, 2010, 78 (01) :40-64
[5]  
[Anonymous], 1999, Obesity: preventing and managing the global epidemic. Report of a WHO consultation on obesity
[6]   Graphical and numerical diagnostic tools to assess suitability of multiple imputations and imputation models [J].
Bondarenko, Irina ;
Raghunathan, Trivellore .
STATISTICS IN MEDICINE, 2016, 35 (17) :3007-3020
[7]  
Heidarian Miri H, 2016, GLOB J HLTH SCI, V8, P133
[8]  
HEITJAN DF, 1991, J R STAT SOC C-APPL, V40, P13
[9]   Multiple Imputation Under Violated Distributional Assumptions: A Systematic Evaluation of the Assumed Robustness of Predictive Mean Matching [J].
Kleinke, Kristian .
JOURNAL OF EDUCATIONAL AND BEHAVIORAL STATISTICS, 2017, 42 (04) :371-404
[10]  
Marchenko Y.V., A note on how to perform multiple-imputation diagnostics in Stata