PLoS Computational Biology (Nov 2023)

CGG toolkit: Software components for computational genomics.

  • Dimitrios Vasileiou,
  • Christos Karapiperis,
  • Ismini Baltsavia,
  • Anastasia Chasapi,
  • Dag Ahrén,
  • Paul J Janssen,
  • Ioannis Iliopoulos,
  • Vasilis J Promponas,
  • Anton J Enright,
  • Christos A Ouzounis

DOI
https://doi.org/10.1371/journal.pcbi.1011498
Journal volume & issue
Vol. 19, no. 11
p. e1011498

Abstract

Read online

Public-domain availability for bioinformatics software resources is a key requirement that ensures long-term permanence and methodological reproducibility for research and development across the life sciences. These issues are particularly critical for widely used, efficient, and well-proven methods, especially those developed in research settings that often face funding discontinuities. We re-launch a range of established software components for computational genomics, as legacy version 1.0.1, suitable for sequence matching, masking, searching, clustering and visualization for protein family discovery, annotation and functional characterization on a genome scale. These applications are made available online as open source and include MagicMatch, GeneCAST, support scripts for CoGenT-like sequence collections, GeneRAGE and DifFuse, supported by centrally administered bioinformatics infrastructure funding. The toolkit may also be conceived as a flexible genome comparison software pipeline that supports research in this domain. We illustrate basic use by examples and pictorial representations of the registered tools, which are further described with appropriate documentation files in the corresponding GitHub release.