LVPocket: integrated 3D global-local information to protein binding pockets prediction with transfer learning of protein structure classification

Ruifeng Zhou; Jing Fan; Sishu Li; Wenjie Zeng; Yilun Chen; Xiaoshan Zheng; Hongyang Chen; Jun Liao

doi:10.1186/s13321-024-00871-8

Journal of Cheminformatics (Jul 2024)

LVPocket: integrated 3D global-local information to protein binding pockets prediction with transfer learning of protein structure classification

Ruifeng Zhou,
Jing Fan,
Sishu Li,
Wenjie Zeng,
Yilun Chen,
Xiaoshan Zheng,
Hongyang Chen,
Jun Liao

Affiliations

Ruifeng Zhou: School of Science, China Pharmaceutical University
Jing Fan: School of Science, China Pharmaceutical University
Sishu Li: School of Science, China Pharmaceutical University
Wenjie Zeng: School of Science, China Pharmaceutical University
Yilun Chen: School of Science, China Pharmaceutical University
Xiaoshan Zheng: School of Science, China Pharmaceutical University
Hongyang Chen: Research Center for Graph Computing, Zhejiang Lab
Jun Liao: School of Science, China Pharmaceutical University

DOI: https://doi.org/10.1186/s13321-024-00871-8
Journal volume & issue: Vol. 16, no. 1
pp. 1 – 12

Abstract

Read online

Abstract Background Previous deep learning methods for predicting protein binding pockets mainly employed 3D convolution, yet an abundance of convolution operations may lead the model to excessively prioritize local information, thus overlooking global information. Moreover, it is essential for us to account for the influence of diverse protein folding structural classes. Because proteins classified differently structurally exhibit varying biological functions, whereas those within the same structural class share similar functional attributes. Results We proposed LVPocket, a novel method that synergistically captures both local and global information of protein structure through the integration of Transformer encoders, which help the model achieve better performance in binding pockets prediction. And then we tailored prediction models for data of four distinct structural classes of proteins using the transfer learning. The four fine-tuned models were trained on the baseline LVPocket model which was trained on the sc-PDB dataset. LVPocket exhibits superior performance on three independent datasets compared to current state-of-the-art methods. Additionally, the fine-tuned model outperforms the baseline model in terms of performance. Scientific contribution We present a novel model structure for predicting protein binding pockets that provides a solution for relying on extensive convolutional computation while neglecting global information about protein structures. Furthermore, we tackle the impact of different protein folding structures on binding pocket prediction tasks through the application of transfer learning methods. Graphical Abstract

Published in Journal of Cheminformatics

ISSN: 1758-2946 (Online)
Publisher: BMC
Country of publisher: United Kingdom
LCC subjects: Technology: Technology (General): Industrial engineering. Management engineering: Information technology; Science: Chemistry
Website: https://jcheminf.biomedcentral.com/

About the journal

Abstract

Keywords