PLoS ONE (Jan 2023)

Predicting CRISPR-Cas12a guide efficiency for targeting using machine learning.

  • Aidan O'Brien,
  • Denis C Bauer,
  • Gaetan Burgio

DOI
https://doi.org/10.1371/journal.pone.0292924
Journal volume & issue
Vol. 18, no. 10
p. e0292924

Abstract

Read online

Genome editing through the development of CRISPR (Clustered Regularly Interspaced Short Palindromic Repeat)-Cas technology has revolutionized many fields in biology. Beyond Cas9 nucleases, Cas12a (formerly Cpf1) has emerged as a promising alternative to Cas9 for editing AT-rich genomes. Despite the promises, guide RNA efficiency prediction through computational tools search still lacks accuracy. Through a computational meta-analysis, here we report that Cas12a target and off-target cleavage behavior are a factor of nucleotide bias combined with nucleotide mismatches relative to the protospacer adjacent motif (PAM) site. These features helped to train a Random Forest machine learning model to improve the accuracy by at least 15% over existing algorithms to predict guide RNA efficiency for the Cas12a enzyme. Despite the progresses, our report underscores the need for more representative datasets and further benchmarking to reliably and accurately predict guide RNA efficiency and off-target effects for Cas12a enzymes.