BMC Bioinformatics (Jan 2006)

A reinforced merging methodology for mapping unique peptide motifs in members of protein families

  • Chang Chun-Tien,
  • Tang Chuan-Yi,
  • Wu Pei-Chih,
  • Su Bo-Han,
  • Fan Tan-chi,
  • Pai Tun-Wen,
  • Chang Hao-Teng,
  • Liu Shi-Hwei,
  • Chang Margaret

DOI
https://doi.org/10.1186/1471-2105-7-38
Journal volume & issue
Vol. 7, no. 1
p. 38

Abstract

Read online

Abstract Background Members of a protein family often have highly conserved sequences; most of these sequences carry identical biological functions and possess similar three-dimensional (3-D) structures. However, enzymes with high sequence identity may acquire differential functions other than the common catalytic ability. It is probable that each of their variable regions consists of a unique peptide motif (UPM), which selectively interacts with other cellular proteins, rendering additional biological activities. The ability to identify and localize such UPMs is paramount in recognizing the characteristic role of each member of a protein family. Results We have developed a reinforced merging algorithm (RMA) with which non-gapped UPMs were identified in a variety of query protein sequences including members of human ribonuclease A (RNaseA), epidermal growth factor receptor (EGFR), matrix metalloproteinase (MMP), and Sma-and-Mad related protein families (Smad). The UPMs generally occupy specific positions in the resolved 3-D structures, especially the loop regions on the structural surfaces. These motifs coincide with the recognition sites for antibodies, as the epitopes of four monoclonal antibodies and two polyclonal antibodies were shown to overlap with the UPMs. Most of the UPMs were found to correlate well with the potential antigenic regions predicted by PROTEAN. Furthermore, an accuracy of 70% can be achieved in terms of mapping a UPM to an epitope. Conclusion Our study provides a bioinformatic approach for searching and predicting potential epitopes and interacting motifs that distinguish different members of a protein family.