Prevalence estimation subject to misclassification: The mis-substitution bias and some remedies

Zhiwei Zhang, Chunling Liu, Sungduk Kim, Aiyi Liu

Research output: Journal article publicationJournal articleAcademic researchpeer-review

13 Citations (Scopus)

Abstract

We consider the problem of estimating the prevalence of a disease under a group testing framework. Because assays are usually imperfect, misclassification of disease status is a major challenge in prevalence estimation. To account for possible misclassification, it is usually assumed that the sensitivity and specificity of the assay are known and independent of the group size. This assumption is often questionable, and substitution of incorrect values of an assay's sensitivity and specificity can result in a large bias in the prevalence estimate, which we refer to as the mis-substitution bias. In this article, we propose simple designs and methods for prevalence estimation that do not require known values of assay sensitivity and specificity. If a gold standard test is available, it can be applied to a validation subsample to yield information on the imperfect assay's sensitivity and specificity. When a gold standard is unavailable, it is possible to estimate assay sensitivity and specificity, either as unknown constants or as specified functions of the group size, from group testing data with varying group size. We develop methods for estimating parameters and for finding or approximating optimal designs, and perform extensive simulation experiments to evaluate and compare the different designs. An example concerning human immunodeficiency virus infection is used to illustrate the validation subsample design.
Original languageEnglish
Pages (from-to)4482-4500
Number of pages19
JournalStatistics in Medicine
Volume33
Issue number25
DOIs
Publication statusPublished - 1 Jan 2014

Keywords

  • Dilution effect
  • Group testing
  • Maximum likelihood
  • Optimal design
  • Pooled testing
  • Sensitivity
  • Specificity
  • Test error

ASJC Scopus subject areas

  • Epidemiology
  • Statistics and Probability

Cite this