International Journal of Molecular Sciences (Jul 2021)

Search for Highly Divergent Tandem Repeats in Amino Acid Sequences

  • Valentina Rudenko,
  • Eugene Korotkov

DOI
https://doi.org/10.3390/ijms22137096
Journal volume & issue
Vol. 22, no. 13
p. 7096

Abstract

Read online

We report a Method to Search for Highly Divergent Tandem Repeats (MSHDTR) in protein sequences which considers pairwise correlations between adjacent residues. MSHDTR was compared with some previously developed methods for searching for tandem repeats (TRs) in amino acid sequences, such as T-REKS and XSTREAM, which focus on the identification of TRs with significant sequence similarity, whereas MSHDTR detects repeats that significantly diverged during evolution, accumulating deletions, insertions, and substitutions. The application of MSHDTR to a search of the Swiss-Prot databank revealed over 15 thousand TR-containing amino acid sequences that were difficult to find using the other methods. Among the detected TRs, the most representative were those with consensus lengths of two and seven residues; these TRs were subjected to cluster analysis and the classes of patterns were identified. All TRs detected in this study have been combined into a databank accessible over the WWW.

Keywords