PLoS Computational Biology (Jul 2021)

Sample size calculation for phylogenetic case linkage.

  • Shirlee Wohl,
  • John R Giles,
  • Justin Lessler

DOI
https://doi.org/10.1371/journal.pcbi.1009182
Journal volume & issue
Vol. 17, no. 7
p. e1009182

Abstract

Read online

Sample size calculations are an essential component of the design and evaluation of scientific studies. However, there is a lack of clear guidance for determining the sample size needed for phylogenetic studies, which are becoming an essential part of studying pathogen transmission. We introduce a statistical framework for determining the number of true infector-infectee transmission pairs identified by a phylogenetic study, given the size and population coverage of that study. We then show how characteristics of the criteria used to determine linkage and aspects of the study design can influence our ability to correctly identify transmission links, in sometimes counterintuitive ways. We test the overall approach using outbreak simulations and provide guidance for calculating the sensitivity and specificity of the linkage criteria, the key inputs to our approach. The framework is freely available as the R package phylosamp, and is broadly applicable to designing and evaluating a wide array of pathogen phylogenetic studies.