A COVARIATE ADJUSTMENT FOR ZERO-TRUNCATED APPROACHES TO ESTIMATING THE SIZE OF HIDDEN AND ELUSIVE POPULATIONS

被引:29
作者
Boehning, Dankmar [1 ]
van der Heuden, Peter G. M. [2 ]
机构
[1] Univ Reading, Sch Biol Sci, Reading RG6 6BX, Berks, England
[2] Univ Utrecht, Dept Methodol & Stat, Fac Social & Behav Sci, NL-3508 TC Utrecht, Netherlands
关键词
Population size estimation; capture-recapture; estimation under model misspecification; truncated Poisson and binomial likelihood; elusive population; CAPTURE; NUMBER; DISTRIBUTIONS;
D O I
10.1214/08-AOAS214
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
In this paper we consider the estimation of population size from one-source capture-recapture data, that is, a list in which individuals can potentially be found repeatedly and where the question is how many individuals are missed by the list. As a typical example, we provide data from a drug user study in Bangkok from 2001 where the list consists of drug users who repeatedly contact treatment institutions. Drug users with 1, 2, 3, ... contacts occur, but drug users with zero contacts are not present, requiring the size of this group to be estimated. Statistically, these data can be considered as stemming from a zero-truncated count distribution, We revisit an estimator for the population size suggested by Zelterman that is known to be robust under potential unobserved heterogeneity. We demonstrate that the Zelterman estimator can be viewed as a maximum likelihood estimator for a locally truncated Poisson likelihood which is equivalent to a binomial likelihood. This result allows the extension of the Zelterman estimator by means of logistic regression to include observed heterogeneity in the form of covariates. We also review ail estimator proposed by Chao and explain why we are not able to obtain similar results for this estimator. The Zelterman estimator is applied in two case studies, the first a drug user study from Bangkok, the second an illegal immigrant study in the Netherlands. Our results suggest the new estimator should be used, in particular, if Substantial unobserved heterogeneity is present.
引用
收藏
页码:595 / 610
页数:16
相关论文
共 23 条
[1]  
[Anonymous], STAT NEERLANDICA
[2]  
Bishop M.M., 1975, DISCRETE MULTIVARIAT
[3]   Nonparametric maximum likelihood estimation of population size based on the counting distribution [J].
Böhning, D ;
Schön, D .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES C-APPLIED STATISTICS, 2005, 54 :721-737
[4]   Estimating the number of drug users in Bangkok 2001:: A capture-recapture approach using repeated entries in one list [J].
Böhning, D ;
Suppawattanabodee, B ;
Kusolvisitkul, W ;
Viwatwongkasem, C .
EUROPEAN JOURNAL OF EPIDEMIOLOGY, 2004, 19 (12) :1075-1083
[5]  
BOHNING D, 2008, COVARIATE ADJUSTME S, DOI DOI 10.1214/08-AOAS214SUPP
[6]   Equivalence of truncated count mixture distributions and mixtures of truncated count distributions [J].
Bohning, Dankmar ;
Kuhnert, Ronny .
BIOMETRICS, 2006, 62 (04) :1207-1215
[7]   ESTIMATING POPULATION-SIZE FOR SPARSE DATA IN CAPTURE RECAPTURE EXPERIMENTS [J].
CHAO, A .
BIOMETRICS, 1989, 45 (02) :427-438
[8]   ESTIMATING THE POPULATION-SIZE FOR CAPTURE RECAPTURE DATA WITH UNEQUAL CATCHABILITY [J].
CHAO, A .
BIOMETRICS, 1987, 43 (04) :783-791
[10]   Estimating the number of drug injectors from needle exchange data [J].
Hay, G ;
Smit, F .
ADDICTION RESEARCH & THEORY, 2003, 11 (04) :235-243