BMC Genomics (Sep 2021)

Predicting metabolic pathway membership with deep neural networks by integrating sequential and ontology information

  • Imam Cartealy,
  • Li Liao

DOI
https://doi.org/10.1186/s12864-021-07629-8
Journal volume & issue
Vol. 22, no. S4
pp. 1 – 10

Abstract

Read online

Abstract Background Inference of protein’s membership in metabolic pathways has become an important task in functional annotation of protein. The membership information can provide valuable context to the basic functional annotation and also aid reconstruction of incomplete pathways. Previous works have shown success of inference by using various similarity measures of gene ontology. Results In this work, we set out to explore integrating ontology and sequential information to further improve the accuracy. Specifically, we developed a neural network model with an architecture tailored to facilitate the integration of features from different sources. Furthermore, we built models that are able to perform predictions from pathway-centric or protein-centric perspectives. We tested the classifiers using 5-fold cross validation for all metabolic pathways reported in KEGG database. Conclusions The testing results demonstrate that by integrating ontology and sequential information with a tailored architecture our deep neural network method outperforms the existing methods significantly in the pathway-centric mode, and in the protein-centric mode, our method either outperforms or performs comparably with a suite of existing GO term based semantic similarity methods.

Keywords