Egyptian Journal of Medical Human Genetics (Jan 2023)

In silico identification of prospective virulence factors associated with candidiasis in Meyerozyma guilliermondii strain SO from genome dataset

  • Robiatul Azilah Zainudin,
  • Suriana Sabri,
  • Abu Bakar Salleh,
  • Arpah Abu,
  • Raja Farhana Raja Khairuddin,
  • Siti Nurbaya Oslan

DOI
https://doi.org/10.1186/s43042-023-00384-3
Journal volume & issue
Vol. 24, no. 1
pp. 1 – 13

Abstract

Read online

Abstract Background Meyerozyma guilliermondii is a prospective yeast that has extensively contributed to the biotechnology sector. In 2015, M. guilliermondii strain SO which was isolated from spoiled orange has successfully been developed as an inducer-free expression system and attained a significant impact in producing industrially important recombinant proteins. The species possesses high similarity to Candida albicans which may cause candidiasis. The industrial-benefiting M. guilliermondii strain SO has been underexplored for its virulence status. Thus, this study aimed to document the potential virulence factors through the comprehensive in silico analysis of M. guilliermondii strain SO genome. This analysis demonstrated the molecular characterization which could distinguish the pathogenicity status of M. guilliermondii. Results The genome data were generated from Illumina HiSeq 4000 sequencing platform and assembled into 51 scaffolds successfully accumulating a genome size of 10.63 Mbp. These enclosed 5,335 CDS genes and 5,349 protein sequences with 43.72% GC content. About 99.29% of them were annotated to public databases. Komagataella phaffii, Saccharomyces cerevisiae and the reference strain of M. guilliermondii (ATCC 6260) were used as the controls. They were compared with our in-house strain SO to identify the consensus domain or subdomain which could putatively be considered as virulence factors. Candida albicans was used as the pathogenic model. Hence, hidden Markov model against strain SO proteome had identified secreted aspartic proteases (SAP), phospholipase C (PLC) and phospholipase D (PLD) with an E-value of 2.4e−107, 9.5e−200 and 0.0e+00, respectively, in resemblance of C. albicans. The topology of the phylogenetic analysis indicated that these virulence factors in M. guilliermondii strain SO and C. albicans branched from the same node and clustered together as a clade, signifying their molecular relatedness and congeneric among these species, subsequently proposing the virulence status of M. guilliermondii. Conclusion The SAP, PLC and PLD genes’ features that were significant in expressing determinants of pathogenicity were successfully identified in M. guilliermondii strain SO genome dataset, thus concluding the virulency of this species. On account of this finding, the strategy of gene knockout through CRISPR-Cas9 or homologous recombination strategies is needed to engineer the feasible novel expression host system. Over and above, the genetically modified strain of M. guilliermondii allegedly may eradicate the risk of candidiasis infection.

Keywords