Genome Biology (Jan 2024)

Improved prediction of bacterial CRISPRi guide efficiency from depletion screens through mixed-effect machine learning and data integration

  • Yanying Yu,
  • Sandra Gawlitt,
  • Lisa Barros de Andrade e Sousa,
  • Erinc Merdivan,
  • Marie Piraud,
  • Chase L. Beisel,
  • Lars Barquist

DOI
https://doi.org/10.1186/s13059-023-03153-y
Journal volume & issue
Vol. 25, no. 1
pp. 1 – 22

Abstract

Read online

Abstract CRISPR interference (CRISPRi) is the leading technique to silence gene expression in bacteria; however, design rules remain poorly defined. We develop a best-in-class prediction algorithm for guide silencing efficiency by systematically investigating factors influencing guide depletion in genome-wide essentiality screens, with the surprising discovery that gene-specific features substantially impact prediction. We develop a mixed-effect random forest regression model that provides better estimates of guide efficiency. We further apply methods from explainable AI to extract interpretable design rules from the model. This study provides a blueprint for predictive models for CRISPR technologies where only indirect measurements of guide activity are available.