A bagging-based correction for the mixture model estimator of population size.
BIOMETRICAL JOURNAL(2008)
摘要
Estimation of a population size by means of capture-recapture techniques is all important problem occurring in many areas of life and social sciences. We consider the frequencies of frequencies situation, where a count variable is used to summarize how often a unit has been identified in the target population of interest. The distribution of this count variable is zero-truncated since zero identifications do not occur in the sample. As all application we consider the surveillance of scrapie ill Great Britain. In this case study holdings with scrapie that are not identified (zero counts) do not enter the surveillance database. The count variable of interest is the number of scrapie cases per holding. For count distributions a common model is the Poisson distribution and, to adjust for potential heterogeneity, a discrete mixture of Poisson distributions is used. Mixtures of Poissons Usually provide an excellent fit as will be demonstrated in the application of interest. However, as it has been recently demonstrated, mixtures also suffer under the so-called boundary problem, resulting in overestimation of population size. It is suggested here to select the Mixture model oil the basis of the Bayesian Information Criterion. This strategy is further refined by employing a bagging procedure leading to a series of estimates of population size. Using the median of this series, highly influential size estimates are avoided. In limited simulation studies it is shown that the procedure leads to estimates with remarkable small bias.
更多查看译文
关键词
Bagging,Bootstrap,Boundary Problem,Nonparametric Mixture Model,Population Size Estimator,Zero-truncation
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络