Estimating population size and duplication rates when records cannot be linked

STATISTICS IN MEDICINE(2003)

引用 2|浏览9
暂无评分
摘要
The capture-recapture approach to estimating the size of a population is a well-studied area of statistics. The number of distinct individuals, N-A and N-B, On each of two lists, A and B, and the number common to both lists, N-AB, are used to form an estimate of the binomial probability of being on one of the lists, which then allows an estimate to be made of the size of the population. Critical to the method is an accurate count Of N-AB. We consider situations in which this count is not available. Such problems arise in a variety of behavioural health contexts in which the need for protection of privacy may prevent sharing identifying information, so it is not possible to specifically match an individual who appears on one list with an individual on the other. Suppose that the birth dates and/or other demographics of individuals on each list are known. We introduce two methods for estimating the duplication rates and the size of the population. Conditioning on the set beta of birth dates of those on list B, NA and Nil, the maximum likelihood estimators (MLEs) and their variance are derived. The MLEs are based on the proportion of individuals on list A whose birth dates fall in P. This approach is particularly useful if list B itself contains duplicates. The second model utilizes the full sample distribution of the birth dates. We generalize this approach to accommodate multiple demographic characteristics. The approaches are applied to the problem of estimating duplication rates and the population size of veterans who have mental illness in Kings County, NY. The data are lists of those receiving service from the Veterans Administration system and from providers funded or certified by the New York State Office of Mental Health. Copyright (C) 2003 John Wiley Sons, Ltd.
更多
查看译文
关键词
capture-recapture,mental health,record linkage,population size,duplication rates,birth dates
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要