BMC Bioinformatics (Apr 2020)

2SigFinder: the combined use of small-scale and large-scale statistical testing for genomic island detection from a single genome

  • Rui Kong,
  • Xinnan Xu,
  • Xiaoqing Liu,
  • Pingan He,
  • Michael Q. Zhang,
  • Qi Dai

DOI
https://doi.org/10.1186/s12859-020-3501-2
Journal volume & issue
Vol. 21, no. 1
pp. 1 – 15

Abstract

Read online

Abstract Background Genomic islands are associated with microbial adaptations, carrying genomic signatures different from the host. Some methods perform an overall test to identify genomic islands based on their local features. However, regions of different scales will display different genomic features. Results We proposed here a novel method “2SigFinder “, the first combined use of small-scale and large-scale statistical testing for genomic island detection. The proposed method was tested by genomic island boundary detection and identification of genomic islands or functional features of real biological data. We also compared the proposed method with the comparative genomics and composition-based approaches. The results indicate that the proposed 2SigFinder is more efficient in identifying genomic islands. Conclusions From real biological data, 2SigFinder identified genomic islands from a single genome and reported robust results across different experiments, without annotated information of genomes or prior knowledge from other datasets. 2SigHunter identified 25 Pathogenicity, 1 tRNA, 2 Virulence and 2 Repeats from 27 Pathogenicity, 1 tRNA, 2 Virulence and 2 Repeats, and detected 101 Phage and 28 HEG out of 130 Phage and 36 HEGs in S. enterica Typhi CT18, which shows that it is more efficient in detecting functional features associated with GIs.

Keywords