INTEGRATING MULTIPLE BUILT ENVIRONMENT DATA SOURCES

被引:0
作者
Won, Jung Yeon [1 ]
Elliott, Michael R. [1 ]
Sanchez-Vaznaugh, Emma V. [2 ]
Sanchez, Brisa N. [3 ]
机构
[1] Univ Michigan, Dept Biostat, Ann Arbor, MI 48109 USA
[2] San Francisco State Univ, Dept Hlth Educ, San Francisco, CA 94132 USA
[3] Drexel Univ, Dept Epidemiol & Biostat, Philadelphia, PA 19104 USA
关键词
Built-environment; count exposure; data integration; measurement error; Dirichlet process mixture model; commercial business lists; BODY-MASS INDEX; MEASUREMENT ERROR; BAYESIAN-APPROACH; POPULATION-SIZE; FOOD; MODELS; MIXTURES; CHILDREN; POISSON; NUMBER;
D O I
10.1214/22-AOAS1692
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Studies examining the contribution of the built environment to health often rely on commercial data sources to derive exposure measures, such as the number of specific food outlets in study participants' neighborhoods. Data on the location of community amenities (e.g., food outlets) can be col-lected from multiple sources. However, these commercial listings are known to have ascertainment errors and thus provide conflicting claims about the number and location of amenities. We propose a method that integrates expo-sure measures from different databases, while accounting for ascertainment errors, and obtains unbiased health effects of latent exposure. We frame the problem of conflicting exposure measures as a problem of two contingency tables with partially known margins, with the entries of the tables modeled using a multinomial distribution. Available estimates of source quality were embedded in a joint model for observed exposure counts, latent exposures, and health outcomes. Simulations show that our modeling framework yields substantially improved inferences regarding the health effects. We used the proposed method to estimate the association between children's body mass index (BMI) and the concentration of food outlets near their schools when both the NETS and Reference USA databases are available.
引用
收藏
页码:1722 / 1739
页数:18
相关论文
共 42 条
[1]  
Aldous David J., 1985, Lecture Notes in Math., P1, DOI [DOI 10.1007/BFB0099421, 10.1007/BFb0099421]
[2]   Proximity to Fast-Food Outlets and Supermarkets as Predictors of Fast-Food Dining Frequency [J].
Athens, Jessica K. ;
Duncan, Dustin T. ;
Elbel, Brian .
JOURNAL OF THE ACADEMY OF NUTRITION AND DIETETICS, 2016, 116 (08) :1266-1275
[3]   Discrete Latent Variable Models [J].
Bartolucci, Francesco ;
Pandolfi, Silvia ;
Pennoni, Fulvia .
ANNUAL REVIEW OF STATISTICS AND ITS APPLICATION, 2022, 9 :425-452
[4]  
CALIFORNIA DEPARTMENT OF EDUCATION, 2019, PHYS FITN TEST PFT
[5]   Modified ground-truthing: an accurate and cost-effective food environment validation method for town and rural areas [J].
Caspi, Caitlin Eicher ;
Friebur, Robin .
INTERNATIONAL JOURNAL OF BEHAVIORAL NUTRITION AND PHYSICAL ACTIVITY, 2016, 13
[6]  
Dong XL, 2009, PROC VLDB ENDOW, V2
[7]   Modeling unobserved sources of heterogeneity in animal abundance using a Dirichlet process prior [J].
Dorazio, Robert M. ;
Mukherjee, Bhramar ;
Zhang, Li ;
Ghosh, Malay ;
Jelks, Howard L. ;
Jordan, Frank .
BIOMETRICS, 2008, 64 (02) :635-644
[8]   BAYESIAN ANALYSIS OF SOME NONPARAMETRIC PROBLEMS [J].
FERGUSON, TS .
ANNALS OF STATISTICS, 1973, 1 (02) :209-230
[9]   ANALYSIS OF INCOMPLETE MULTI-WAY CONTINGENCY TABLES [J].
FIENBERG, SE .
BIOMETRICS, 1972, 28 (01) :177-&
[10]   MULTIPLE RECAPTURE CENSUS FOR CLOSED POPULATIONS AND INCOMPLETE 2K CONTINGENCY TABLES [J].
FIENBERG, SE .
BIOMETRIKA, 1972, 59 (03) :591-603