Computational and Structural Biotechnology Journal (Jan 2023)

Prediction of CRISPR-Cas9 off-target activities with mismatches and indels based on hybrid neural network

  • Yanpeng Yang,
  • Jian Li,
  • Quan Zou,
  • Yaoping Ruan,
  • Hailin Feng

Journal volume & issue
Vol. 21
pp. 5039 – 5048

Abstract

Read online

The CRISPR/Cas9 system has significantly advanced the field of gene editing, yet its clinical application is constrained by the considerable challenge of off-target effects. Although numerous deep learning models for off-target prediction have been proposed, most struggle to effectively extract the nuanced features of guide RNA (gRNA) and DNA sequence pairs and to mitigate information loss during data transmission within the model. To address these limitations, we introduce a novel Hybrid Neural Network (HNN) model that employs a parallelized network structure to fully extract pertinent features from different positions and types of bases in the sequence to minimize information loss. Notably, this study marks the first application of word embedding techniques to extract information from sequence pairs that contain insertions and deletions (Indels). Comprehensive evaluation across diverse datasets indicates that our proposed model outperforms existing state-of-the-art prediction methods in off-target prediction. The datasets and source codes supporting this study can be found at https://github.com/Yang-k955/CRISPR-HW.

Keywords