Scientific Data (Apr 2024)
A comprehensive dataset of protein-protein interactions and ligand binding pockets for advancing drug discovery
Abstract
Abstract This dataset represents a collection of pocket-centric structural data related to protein-protein interactions (PPIs) and PPI-related ligand binding sites. The dataset includes high-quality structural information on more than 23,000 pockets, 3,700 proteins on more than 500 organisms, and nearly 3500 ligands that can aid researchers in the fields of bioinformatics, structural biology, and drug discovery. It encompasses a diverse set of PPI complexes with more than 1,700 unique protein families including some with associated ligands, enabling detailed investigations into molecular interactions at the atomic level. This article introduces an indispensable resource designed to unlock the full potential of PPIs while pioneering a novel metric for pocket similarity for hypothesizing protein partners repurposing.