Estimating Population Distributions When Some Data Are Below a Limit of Detection by Using a Reverse Kaplan-Meier Estimator

被引:81
作者
Gillespie, Brenda W. [1 ,2 ]
Chen, Qixuan [2 ,3 ]
Reichert, Heidi [1 ]
Franzblau, Alfred [4 ]
Hedgeman, Elizabeth [4 ]
Lepkowski, James [5 ]
Adriaens, Peter [6 ]
Demond, Avery [6 ]
Luksemburg, William [7 ]
Garabrant, David H. [4 ]
机构
[1] Univ Michigan, Ctr Stat Consultat & Res, Ann Arbor, MI 48109 USA
[2] Univ Michigan, Sch Publ Hlth, Dept Biostat, Ann Arbor, MI 48109 USA
[3] Columbia Univ, Dept Biostat, Mailman Sch Publ Hlth, New York, NY USA
[4] Univ Michigan, Dept Environm Hlth Sci, Ann Arbor, MI 48109 USA
[5] Univ Michigan, Inst Social Res, Ann Arbor, MI 48109 USA
[6] Univ Michigan, Dept Civil & Environm Engn, Coll Engn, Ann Arbor, MI 48109 USA
[7] Vista Analyt Lab, El Dorado Hills, CA USA
关键词
DOUBLY CENSORED-DATA; POLYCHLORINATED-BIPHENYLS; DIOXINS; PCDFS; PCDDS; SERUM; PCBS;
D O I
10.1097/EDE.0b013e3181ce9f08
中图分类号
R1 [预防医学、卫生学];
学科分类号
1004 ; 120402 ;
摘要
Background: Data with some values below a limit of detection (LOD) can be analyzed using methods of survival analysis for left-censored data. The reverse Kaplan-Meier (KM) estimator provides an effective method for estimating the distribution function and thus population percentiles for such data. Although developed in the 1970s and strongly advocated since then, it remains rarely used, partly due to limited software availability. Methods: In this paper, the reverse KM estimator is described and is illustrated using serum dioxin data from the University of Michigan Dioxin Exposure Study (UMDES) and the National Health and Nutrition Examination Survey (NHANES). Percentile estimates for left-censored data using the reverse KM estimator are compared with replacing values below the LOD with the LOD/2 or LOD/root 2. Results: When some LODs are in the upper range of the complete values, and/or the percent censored is high, the different methods can yield quite different percentile estimates. The reverse KM estimator, which is the nonparametric maximum likelihood estimator, is the preferred method. Software options are discussed: The reverse KM can be calculated using software for the KM estimator. The JMP and SAS (SAS Institute, Cary, NC) and Minitab (Minitab, Inc, State College, PA), software packages calculate the reverse KM directly using their Turnbull estimator routines. Conclusion: The reverse KM estimator is recommended for estimation of the distribution function and population percentiles in preference to commonly used methods such as substituting LOD/2 or LOD/root 2 for values below the LOD, assuming a known parametric distribution, or using imputation to replace the left-censored values.
引用
收藏
页码:S64 / S70
页数:7
相关论文
共 25 条
[1]   Evaluation of statistical treatments of left-censored environmental data using coincident uncensored data sets: I. Summary statistics [J].
Antweiler, Ronald C. ;
Taylor, Howard E. .
ENVIRONMENTAL SCIENCE & TECHNOLOGY, 2008, 42 (10) :3732-3738
[2]   The Mean, Median, and Confidence Intervals of the Kaplan-Meier Survival Estimate-Computations and Applications [J].
Barker, Chris .
AMERICAN STATISTICIAN, 2009, 63 (01) :78-80
[3]   Percentile estimation using variable censored data [J].
Caudill, Samuel P. ;
Wong, Lee-Yang ;
Turner, Wayman E. ;
Lee, Robin ;
Henderson, Alden ;
Patterson, Donald G., Jr. .
CHEMOSPHERE, 2007, 68 (01) :169-180
[4]  
Efron B., 1967, Proceedings of the Fifth Berkeley Symposium on Mathematical Statistics and Probability, Volume 4: Biology and Problems of Health, P831
[5]  
Giolo S.R., 2004, TURNBULLS NONPARAMET
[6]  
Greenwood M., 1926, The natural duration of cancer. Reports on Public Health and Medical Subjects, P1
[7]   ASYMPTOTIC PROPERTIES OF SELF-CONSISTENT ESTIMATORS BASED ON DOUBLY CENSORED-DATA [J].
GU, MG ;
ZHANG, CH .
ANNALS OF STATISTICS, 1993, 21 (02) :611-624
[8]   The University of Michigan Dioxin Exposure Study: Population Survey Results and Serum Concentrations for Polychlorinated Dioxins, Furans, and Biphenyls [J].
Hedgeman, Elizabeth ;
Chen, Qixuan ;
Hong, Biling ;
Chang, Chiung-Wen ;
Olson, Kristen ;
LaDronka, Kathleen ;
Ward, Barbara ;
Adriaens, Peter ;
Demond, Avery ;
Gillespie, Brenda W. ;
Lepkowski, James ;
Franzblau, Alfred ;
Garabrant, David H. .
ENVIRONMENTAL HEALTH PERSPECTIVES, 2009, 117 (05) :811-817
[9]  
Heiberger RM, 2009, R EXCEL
[10]  
Helsel D. R, 2005, Nondetects and Data Analysis. Statistics for Censored Environmental Data