Communications Biology (Jul 2024)

Identification and characterization of specific motifs in effector proteins of plant parasites using MOnSTER

  • Giulia Calia,
  • Paola Porracciolo,
  • Yongpan Chen,
  • Djampa Kozlowski,
  • Hannes Schuler,
  • Alessandro Cestaro,
  • Michaël Quentin,
  • Bruno Favery,
  • Etienne G. J. Danchin,
  • Silvia Bottini

DOI
https://doi.org/10.1038/s42003-024-06515-9
Journal volume & issue
Vol. 7, no. 1
pp. 1 – 14

Abstract

Read online

Abstract Plant pathogens cause billions of dollars of crop loss every year and are a major threat to global food security. Identifying and characterizing pathogens effectors is crucial towards their improved control. Because of their poor sequence conservation, effector identification is challenging, and current methods generate too many candidates without indication for prioritizing experimental studies. In most phyla, effectors contain specific sequence motifs which influence their localization and targets in the plant. Therefore, there is an urgent need to develop bioinformatics tools tailored for pathogen effectors. To circumvent these limitations, we have developed MOnSTER a specific tool that identifies clusters of motifs of protein sequences (CLUMPs). MOnSTER can be fed with motifs identified by de novo tools or from databases such as Pfam and InterProScan. The advantage of MOnSTER is the reduction of motif redundancy by clustering them and associating a score. This score encompasses the physicochemical properties of AAs and the motif occurrences. We built up our method to identify discriminant CLUMPs in oomycetes effectors. Consequently, we applied MOnSTER on plant parasitic nematodes and identified six CLUMPs in about 60% of the known nematode candidate parasitism proteins. Furthermore, we found co-occurrences of CLUMPs with protein domains important for invasion and pathogenicity. The potentiality of this tool goes beyond the effector characterization and can be used to easily cluster motifs and calculate the CLUMP-score on any set of protein sequences.