Solving Classification Problems for Large Sets of Protein Sequences with the Example of Hox and ParaHox Proteins

Stefanie D. Hueber; Tancred Frickey

doi:10.3390/jdb4010008

Journal of Developmental Biology (Feb 2016)

Solving Classification Problems for Large Sets of Protein Sequences with the Example of Hox and ParaHox Proteins

Stefanie D. Hueber,
Tancred Frickey

Affiliations

Stefanie D. Hueber: Department of Biology, University of Konstanz, Konstanz 78464, Germany
Tancred Frickey: Department of Biology, University of Konstanz, Konstanz 78464, Germany

DOI: https://doi.org/10.3390/jdb4010008
Journal volume & issue: Vol. 4, no. 1
p. 8

Abstract

Read online

Phylogenetic methods are key to providing models for how a given protein family evolved. However, these methods run into difficulties when sequence divergence is either too low or too high. Here, we provide a case study of Hox and ParaHox proteins so that additional insights can be gained using a new computational approach to help solve old classification problems. For two (Gsx and Cdx) out of three ParaHox proteins the assignments differ between the currently most established view and four alternative scenarios. We use a non-phylogenetic, pairwise-sequence-similarity-based method to assess which of the previous predictions, if any, are best supported by the sequence-similarity relationships between Hox and ParaHox proteins. The overall sequence-similarities show Gsx to be most similar to Hox2–3, and Cdx to be most similar to Hox4–8. The results indicate that a purely pairwise-sequence-similarity-based approach can provide additional information not only when phylogenetic inference methods have insufficient information to provide reliable classifications (as was shown previously for central Hox proteins), but also when the sequence variation is so high that the resulting phylogenetic reconstructions are likely plagued by long-branch-attraction artifacts.

Published in Journal of Developmental Biology

ISSN: 2221-3759 (Online)
Publisher: MDPI AG
Country of publisher: Switzerland
LCC subjects: Science: Biology (General)
Website: http://www.mdpi.com/journal/jdb

About the journal

Abstract

Keywords