On the use of ZIP codes and ZIP code tabulation areas (ZCTAs) for the spatial analysis of epidemiological data

被引:200
作者
Grubesic T.H. [1 ]
Matisziw T.C. [2 ]
机构
[1] Department of Geography, Indiana University, Bloomington
[2] Center for Urban and Regional Analysis, The Ohio State University, Columbus
关键词
Street Segment; Census Block; Areal Unit; Spatial Weight Matrix; Spatial Mismatch;
D O I
10.1186/1476-072X-5-58
中图分类号
学科分类号
摘要
Background: While the use of spatially referenced data for the analysis of epidemiological data is growing, issues associated with selecting the appropriate geographic unit of analysis are also emerging. A particularly problematic unit is the ZIP code. Lacking standardization and highly dynamic in structure, the use of ZIP codes and ZIP code tabulation areas (ZCTA) for the spatial analysis of disease present a unique challenge to researchers. Problems associated with these units for detecting spatial patterns of disease are explored. Results: A brief review of ZIP codes and their spatial representation is conducted. Though frequently represented as polygons to facilitate analysis, ZIP codes are actually defined at a narrower spatial resolution reflecting the street addresses they serve. This research shows that their generalization as continuous regions is an imposed structure that can have serious implications in the interpretation of research results. ZIP codes areas and Census defined ZCTAs, two commonly used polygonal representations of ZIP code address ranges, are examined in an effort to identify the spatial statistical sensitivities that emerge given differences in how these representations are defined. Here, comparative analysis focuses on the detection of patterns of prostate cancer in New York State. Of particular interest for studies utilizing local, spatial statistical tests, is that differences in the topological structures of ZIP code areas and ZCTAs give rise to different spatial patterns of disease. These differences are related to the different methodologies used in the generalization of ZIP code information. Given the difficulty associated with generating ZIP code boundaries, both ZIP code areas and ZCTAs contain numerous representational errors which can have a significant impact on spatial analysis. While the use of ZIP code polygons for spatial analysis is relatively straightforward, ZCTA representations contain additional topological features (e.g. lakes and rivers) and contain fragmented polygons that can hinder spatial analysis. Conclusion: Caution must be exercised when using spatially referenced data, particularly that which is attributed to ZIP codes and ZCTAs, for epidemiological analysis. Researchers should be cognizant of representational errors associated with both geographies and their resulting spatial mismatch, especially when comparing the results obtained using different topological representations. While ZCTAs can be problematic, topological corrections are easily implemented in a geographic information system to remedy erroneous aggregation effects. © 2006 Grubesic and Matisziw; licensee BioMed Central Ltd.
引用
收藏
相关论文
共 28 条
[1]  
Jacquez G.M., Current practices in the spatial analysis of cancer: Flies in the ointment, International Journal of Health Geographics, 3, 22, (2004)
[2]  
Jacquez G.M., Grieling D.A., Local clustering in breast, lung and colorectal cancer in Long Island, New York, International Journal of Health Geographics, 2, 3, (2003)
[3]  
Boscoe F.P., Ward M.H., Reynolds P., Current practices in spatial analysis of cancer data: Data characteristics and data sources for geographic studies of cancer, International Journal of Health Geographics, 3, 28, (2004)
[4]  
Miller H.J., Wentz E.A., Representation and spatial analysis in geographic information systems, Annals of the Association of American Geographers, 93, pp. 574-594, (2003)
[5]  
Johnson G.D., Small area mapping of prostate cancer incidence in New York State (USA) using fully Bayesian hierarchical modeling, International Journal of Health Geographics, 3, 29, (2004)
[6]  
Openshaw S., The modifiable areal unit problem, Concepts and Techniques in Modern Geography, 38, (1984)
[7]  
Han D., Carrow S.S., Rogerson P.A., Munschauer F.E., Geographical variation of cerebrovascular disease in New York State: The correlation with income, International Journal of Health Geographics, 4, 25, (2005)
[8]  
Krieger N., Waterman P., Chen J.T., Soobader M.J., Subramanian S.V., Carson R., ZIP code caveat: Bias due to spatiotemporal mismatches between ZIP codes and US census-defined geographic areas - The Public Health Disparities Geocoding Project, Am J Public Health, 92, pp. 1100-1102, (2002)
[9]  
Wang F., Spatial clusters of cancers in Illinois 1986-2000, J Med Syst, 28, 3, pp. 237-256, (2004)
[10]  
Cook W.H., Grala K., Wallis R.C., Avian GIS models to signal human risk for West Nile virus in Mississippi, International Journal of Health Geographics, 5, 36, (2006)