PeerJ (Jul 2023)

Protein family neighborhood analyzer—ProFaNA

  • Bartosz Baranowski,
  • Krzysztof Pawłowski

DOI
https://doi.org/10.7717/peerj.15715
Journal volume & issue
Vol. 11
p. e15715

Abstract

Read online Read online

Background Functionally related genes are well known to be often grouped in close vicinity in the genomes, particularly in prokaryotes. Notwithstanding the diverse evolutionary mechanisms leading to this phenomenon, it can be used to predict functions of uncharacterized genes. Methods Here, we provide a simple but robust statistical approach that leverages the vast amounts of genomic data available today. Considering a protein domain as a functional unit, one can explore other functional units (domains) that significantly often occur within the genomic neighborhoods of the queried domain. This analysis can be performed across different taxonomic levels. Provisions can also be made to correct for the uneven sampling of the taxonomic space by genomic sequencing projects that often focus on large numbers of very closely related strains, e.g., pathogenic ones. To this end, an optional procedure for averaging occurrences within subtaxa is available. Results Several examples show this approach can provide useful functional predictions for uncharacterized gene families, and how to combine this information with other approaches. The method is made available as a web server at http://bioinfo.sggw.edu.pl/neighborhood_analysis.

Keywords