PLoS ONE (Jan 2013)
Plant F-box protein evolution is determined by lineage-specific timing of major gene family expansion waves.
Abstract
F-box proteins (FBPs) represent one of the largest and fastest evolving gene/protein families in the plant kingdom. The FBP superfamily can be divided in several subfamilies characterized by different C-terminal protein-protein interaction domains that recruit targets for proteasomal degradation. Hence, a clear picture of their phylogeny and molecular evolution is of special interest for the general understanding of evolutionary histories of multi-domain and/or large protein families in plants. In an effort to further understand the molecular evolution of F-box family proteins, we asked whether the largest subfamily in Arabidopsis thaliana, which carries a C-terminal F-box associated domain (FBA proteins) shares evolutionary patterns and signatures of selection with other FBPs. To address this question, we applied phylogenetic and molecular evolution analyses in combination with the evaluation of transcriptional profiles. Based on the 2219 FBA proteins we de novo identified in 34 completely sequenced plant genomes, we compared their evolutionary patterns to a previously analyzed large subfamily carrying C-terminal kelch repeats. We found that these two large FBP subfamilies generally tend to evolve by massive waves of duplication, followed by sequence conservation of the F-box domain and sequence diversification of the target recruiting domain. We conclude that the earlier in evolutionary time a major wave of expansion occurred, the more pronounced these selection signatures are. As a consequence, when performing cross species comparisons among FBP subfamilies, significant differences will be observed in the selective signatures of protein-protein interaction domains. Depending on the species, the investigated subfamilies comprise up to 45% of the complete superfamily, indicating that other subfamilies possibly follow similar modes of evolution.