Journal of Cheminformatics (Jun 2024)

PUResNetV2.0: a deep learning model leveraging sparse representation for improved ligand binding site prediction

  • Kandel Jeevan,
  • Shrestha Palistha,
  • Hilal Tayara,
  • Kil T. Chong

DOI
https://doi.org/10.1186/s13321-024-00865-6
Journal volume & issue
Vol. 16, no. 1
pp. 1 – 16

Abstract

Read online

Abstract Accurate ligand binding site prediction (LBSP) within proteins is essential for drug discovery. We developed ProteinUNetResNetV2.0 (PUResNetV2.0), leveraging sparse representation of protein structures to improve LBSP accuracy. Our training dataset included protein complexes from 4729 protein families. Evaluations on benchmark datasets showed that PUResNetV2.0 achieved an 85.4% Distance Center Atom (DCA) success rate and a 74.7% F1 Score on the Holo801 dataset, outperforming existing methods. However, its performance in specific cases, such as RNA, DNA, peptide-like ligand, and ion binding site prediction, was limited due to constraints in our training data. Our findings underscore the potential of sparse representation in LBSP, especially for oligomeric structures, suggesting PUResNetV2.0 as a promising tool for computational drug discovery.