Densest subgraph-based methods for protein-protein interaction hot spot prediction

Ruiming Li; Jung-Yu Lee; Jinn-Moon Yang; Tatsuya Akutsu

doi:10.1186/s12859-022-04996-1

BMC Bioinformatics (Oct 2022)

Densest subgraph-based methods for protein-protein interaction hot spot prediction

Ruiming Li,
Jung-Yu Lee,
Jinn-Moon Yang,
Tatsuya Akutsu

Affiliations

Ruiming Li: Bioinformatics Center, Institute for Chemical Research, Kyoto University
Jung-Yu Lee: Institute of Bioinformatics and Systems Biology, National Yang Ming Chiao Tung University
Jinn-Moon Yang: Institute of Bioinformatics and Systems Biology, National Yang Ming Chiao Tung University
Tatsuya Akutsu: Bioinformatics Center, Institute for Chemical Research, Kyoto University

DOI: https://doi.org/10.1186/s12859-022-04996-1
Journal volume & issue: Vol. 23, no. 1
pp. 1 – 12

Abstract

Read online

Abstract Background Hot spots play an important role in protein binding analysis. The residue interaction network is a key point in hot spot prediction, and several graph theory-based methods have been proposed to detect hot spots. Although the existing methods can yield some interesting residues by network analysis, low recall has limited their abilities in finding more potential hot spots. Result In this study, we develop three graph theory-based methods to predict hot spots from only a single residue interaction network. We detect the important residues by finding subgraphs with high densities, i.e., high average degrees. Generally, a high degree implies a high binding possibility between protein chains, and thus a subgraph with high density usually relates to binding sites that have a high rate of hot spots. By evaluating the results on 67 complexes from the SKEMPI database, our methods clearly outperform existing graph theory-based methods on recall and F-score. In particular, our main method, Min-SDS, has an average recall of over 0.665 and an f2-score of over 0.364, while the recall and f2-score of the existing methods are less than 0.400 and 0.224, respectively. Conclusion The Min-SDS method performs best among all tested methods on the hot spot prediction problem, and all three of our methods provide useful approaches for analyzing bionetworks. In addition, the densest subgraph-based methods predict hot spots with only one residue interaction network, which is constructed from spatial atomic coordinate data to mitigate the shortage of data from wet-lab experiments.

Published in BMC Bioinformatics

ISSN: 1471-2105 (Online)
Publisher: BMC
Country of publisher: United Kingdom
LCC subjects: Medicine: Medicine (General): Computer applications to medicine. Medical informatics; Science: Biology (General)
Website: http://www.biomedcentral.com/bmcbioinformatics/

About the journal

Abstract

Keywords