BMC Bioinformatics (Sep 2022)

A reinforcement learning approach for protein–ligand binding pose prediction

  • Chenran Wang,
  • Yang Chen,
  • Yuan Zhang,
  • Keqiao Li,
  • Menghan Lin,
  • Feng Pan,
  • Wei Wu,
  • Jinfeng Zhang

DOI
https://doi.org/10.1186/s12859-022-04912-7
Journal volume & issue
Vol. 23, no. 1
pp. 1 – 18

Abstract

Read online

Abstract Protein ligand docking is an indispensable tool for computational prediction of protein functions and screening drug candidates. Despite significant progress over the past two decades, it is still a challenging problem, characterized by the still limited understanding of the energetics between proteins and ligands, and the vast conformational space that has to be searched to find a satisfactory solution. In this project, we developed a novel reinforcement learning (RL) approach, the asynchronous advantage actor-critic model (A3C), to address the protein ligand docking problem. The overall framework consists of two models. During the search process, the agent takes an action selected by the actor model based on the current location. The critic model then evaluates this action and predict the distance between the current location and true binding site. Experimental results showed that in both single- and multi-atom cases, our model improves binding site prediction substantially compared to a naïve model. For the single-atom ligand, copper ion (Cu2+), the model predicted binding sites have a median root-mean-square-deviation (RMSD) of 2.39 Å to the true binding sites when starting from random starting locations. For the multi-atom ligand, sulfate ion (SO4 2−), the predicted binding sites have a median RMSD of 3.82 Å to the true binding sites. The ligand-specific models built in this study can be used in solvent mapping studies and the RL framework can be readily scaled up to larger and more diverse sets of ligands.

Keywords