BMC Bioinformatics (Sep 2023)

Identification of plant vacuole proteins by using graph neural network and contact maps

  • Jianan Sui,
  • Jiazi Chen,
  • Yuehui Chen,
  • Naoki Iwamori,
  • Jin Sun

DOI
https://doi.org/10.1186/s12859-023-05475-x
Journal volume & issue
Vol. 24, no. 1
pp. 1 – 20

Abstract

Read online

Abstract Plant vacuoles are essential organelles in the growth and development of plants, and accurate identification of their proteins is crucial for understanding their biological properties. In this study, we developed a novel model called GraphIdn for the identification of plant vacuole proteins. The model uses SeqVec, a deep representation learning model, to initialize the amino acid sequence. We utilized the AlphaFold2 algorithm to obtain the structural information of corresponding plant vacuole proteins, and then fed the calculated contact maps into a graph convolutional neural network. GraphIdn achieved accuracy values of 88.51% and 89.93% in independent testing and fivefold cross-validation, respectively, outperforming previous state-of-the-art predictors. As far as we know, this is the first model to use predicted protein topology structure graphs to identify plant vacuole proteins. Furthermore, we assessed the effectiveness and generalization capability of our GraphIdn model by applying it to identify and locate peroxisomal proteins, which yielded promising outcomes. The source code and datasets can be accessed at https://github.com/SJNNNN/GraphIdn .

Keywords