Beyond Crosswalks: Reliability of Exposure Assessment Following Automated Coding of Free-Text Job Descriptions for Occupational Epidemiology

被引:27
作者
Burstyn, Igor [1 ]
Slutsky, Anton [2 ]
Lee, Derrick G. [3 ]
Singer, Alison B. [4 ]
An, Yuan [2 ]
Michael, Yvonne L. [5 ]
机构
[1] Drexel Univ, Sch Publ Hlth, Dept Environm & Occupat Hlth, Philadelphia, PA 19104 USA
[2] Drexel Univ, Coll Comp & Informat, Philadelphia, PA 19104 USA
[3] Univ British Columbia, Sch Populat & Publ Hlth, Vancouver, BC V5Z 1M9, Canada
[4] Johns Hopkins Bloomberg Sch Publ Hlth, Dept Epidemiol, Baltimore, MD USA
[5] Drexel Univ, Dept Epidemiol & Biostat, Sch Publ Hlth, Philadelphia, PA 19104 USA
基金
美国国家科学基金会;
关键词
allergens; coal tar pitch volatiles; epidemiology methodology; ergonomics; exposure assessment methodology; job-exposure matrix; MATRIX; ASTHMA;
D O I
10.1093/annhyg/meu006
中图分类号
R1 [预防医学、卫生学];
学科分类号
1004 ; 120402 ;
摘要
Epidemiologists typically collect narrative descriptions of occupational histories because these are less prone than self-reported exposures to recall bias of exposure to a specific hazard. However, the task of coding these narratives can be daunting and prohibitively time-consuming in some settings. The aim of this manuscript is to evaluate the performance of a computer algorithm to translate the narrative description of occupational codes into standard classification of jobs (2010 Standard Occupational Classification) in an epidemiological context. The fundamental question we address is whether exposure assignment resulting from manual (presumed gold standard) coding of the narratives is materially different from that arising from the application of automated coding. We pursued our work through three motivating examples: assessment of physical demands in Womens Health Initiative observational study, evaluation of predictors of exposure to coal tar pitch volatiles in the US Occupational Safety and Health Administrations (OSHA) Integrated Management Information System, and assessment of exposure to agents known to cause occupational asthma in a pregnancy cohort. In these diverse settings, we demonstrate that automated coding of occupations results in assignment of exposures that are in reasonable agreement with results that can be obtained through manual coding. The correlation between physical demand scores based on manual and automated job classification schemes was reasonable (r 0.5). The agreement between predictive probability of exceeding the OSHAs permissible exposure level for polycyclic aromatic hydrocarbons, using coal tar pitch volatiles as a surrogate, based on manual and automated coding of jobs was modest (Kendall rank correlation 0.29). In the case of binary assignment of exposure to asthmagens, we observed that fair to excellent agreement in classifications can be reached, depending on presence of ambiguity in assigned job classification ( 0.50.8). Thus, the success of automated coding appears to depend on the setting and type of exposure that is being assessed. Our overall recommendation is that automated translation of short narrative descriptions of jobs for exposure assessment is feasible in some settings and essential for large cohorts, especially if combined with manual coding to both assess reliability of coding and to further refine the coding algorithm.
引用
收藏
页码:482 / 492
页数:11
相关论文
共 19 条
  • [1] Anderson G, 1998, CONTROL CLIN TRIALS, V19, P61
  • [2] Implementation of the Women's Health Initiative Study Design
    Anderson, GL
    Manson, J
    Wallace, R
    Lund, B
    Hall, D
    Davis, S
    Shumaker, S
    Wang, CY
    Stein, E
    Prentice, RL
    [J]. ANNALS OF EPIDEMIOLOGY, 2003, 13 (09) : S5 - S17
  • [3] Optimizing power in allocating resources to exposure assessment in an epidemiologic study
    Armstrong, BG
    [J]. AMERICAN JOURNAL OF EPIDEMIOLOGY, 1996, 144 (02) : 192 - 197
  • [4] Estimating the Extent and Distribution of New-Onset Adult Asthma in British Columbia Using Frequentist and Bayesian Approaches
    Beach, Jeremy
    Burstyn, Igor
    Cherry, Nicola
    [J]. ANNALS OF OCCUPATIONAL HYGIENE, 2012, 56 (06) : 719 - 727
  • [5] Bureau of Labour and Statistics, 2013, STAND OCC CLASS COD
  • [6] Carroll R.J., 1995, Measurement error in nonlinear models, V1st
  • [7] Data Linkage to Estimate the Extent and Distribution of Occupational Disease: New Onset Adult Asthma in Alberta, Canada
    Cherry, Nicola
    Beach, Jeremy
    Burstyn, Igor
    Fan, Xiangning
    Guo, Na
    Kapur, Nitin
    [J]. AMERICAN JOURNAL OF INDUSTRIAL MEDICINE, 2009, 52 (11) : 831 - 840
  • [8] Dosemeci M., 1996, OCCUP HYG, V3, P169
  • [9] Estimating Occupational Beryllium Exposure from Compliance Monitoring Data
    Hamm, Michele P.
    Burstyn, Igor
    [J]. ARCHIVES OF ENVIRONMENTAL & OCCUPATIONAL HEALTH, 2011, 66 (02) : 75 - 86
  • [10] Development of an asthma specific job exposure matrix and its application in the epidemiological study of genetics and environment in asthma (EGEA)
    Kennedy, SM
    Le Moual, N
    Choudat, D
    Kauffmann, F
    [J]. OCCUPATIONAL AND ENVIRONMENTAL MEDICINE, 2000, 57 (09) : 635 - 641