Predicting horizontal gene transfers with perfect transfer networks

Alitzel López Sánchez; Manuel Lafond

doi:10.1186/s13015-023-00242-2

Algorithms for Molecular Biology (Feb 2024)

Predicting horizontal gene transfers with perfect transfer networks

Alitzel López Sánchez,
Manuel Lafond

Affiliations

Alitzel López Sánchez: Department of Computer Science, Université de Sherbrooke
Manuel Lafond: Department of Computer Science, Université de Sherbrooke

DOI: https://doi.org/10.1186/s13015-023-00242-2
Journal volume & issue: Vol. 19, no. 1
pp. 1 – 19

Abstract

Read online

Abstract Background Horizontal gene transfer inference approaches are usually based on gene sequences: parametric methods search for patterns that deviate from a particular genomic signature, while phylogenetic methods use sequences to reconstruct the gene and species trees. However, it is well-known that sequences have difficulty identifying ancient transfers since mutations have enough time to erase all evidence of such events. In this work, we ask whether character-based methods can predict gene transfers. Their advantage over sequences is that homologous genes can have low DNA similarity, but still have retained enough important common motifs that allow them to have common character traits, for instance the same functional or expression profile. A phylogeny that has two separate clades that acquired the same character independently might indicate the presence of a transfer even in the absence of sequence similarity. Our contributions We introduce perfect transfer networks, which are phylogenetic networks that can explain the character diversity of a set of taxa under the assumption that characters have unique births, and that once a character is gained it is rarely lost. Examples of such traits include transposable elements, biochemical markers and emergence of organelles, just to name a few. We study the differences between our model and two similar models: perfect phylogenetic networks and ancestral recombination networks. Our goals are to initiate a study on the structural and algorithmic properties of perfect transfer networks. We then show that in polynomial time, one can decide whether a given network is a valid explanation for a set of taxa, and show how, for a given tree, one can add transfer edges to it so that it explains a set of taxa. We finally provide lower and upper bounds on the number of transfers required to explain a set of taxa, in the worst case.

Published in Algorithms for Molecular Biology

ISSN: 1748-7188 (Online)
Publisher: BMC
Country of publisher: United Kingdom
LCC subjects: Science: Biology (General): Genetics
Website: http://almob.biomedcentral.com

About the journal

Abstract

Keywords