Genome Biology (Aug 2017)

A statistical framework for analyzing deep mutational scanning data

  • Alan F. Rubin,
  • Hannah Gelman,
  • Nathan Lucas,
  • Sandra M. Bajjalieh,
  • Anthony T. Papenfuss,
  • Terence P. Speed,
  • Douglas M. Fowler

DOI
https://doi.org/10.1186/s13059-017-1272-5
Journal volume & issue
Vol. 18, no. 1
pp. 1 – 15

Abstract

Read online

Abstract Deep mutational scanning is a widely used method for multiplex measurement of functional consequences of protein variants. We developed a new deep mutational scanning statistical model that generates error estimates for each measurement, capturing both sampling error and consistency between replicates. We apply our model to one novel and five published datasets comprising 243,732 variants and demonstrate its superiority in removing noisy variants and conducting hypothesis testing. Simulations show our model applies to scans based on cell growth or binding and handles common experimental errors. We implemented our model in Enrich2, software that can empower researchers analyzing deep mutational scanning data.