BMC Research Notes (Oct 2023)

PacBio long read-assembled draft genome of Pythium insidiosum strain Pi-S isolated from a Thai patient with pythiosis

  • Theerapong Krajaejun,
  • Preecha Patumcharoenpol,
  • Thidarat Rujirawat,
  • Weerayuth Kittichotirat,
  • Sithichoke Tangphatsornruang,
  • Tassanee Lohnoo,
  • Wanta Yingyong

DOI
https://doi.org/10.1186/s13104-023-06532-7
Journal volume & issue
Vol. 16, no. 1
pp. 1 – 4

Abstract

Read online

Abstract Objectives Pythium insidiosum is the causative agent of pythiosis, a difficult-to-treat condition, in humans and animals worldwide. Biological information about this filamentous microorganism is sparse. Genomes of several P. insidiosum strains were sequenced using the Illumina short-read NGS platform, producing incomplete genome sequence data. PacBio long-read platform was employed to obtain a better-quality genome of Pythium insidiosum. The obtained genome data could promote basic research on the pathogen’s biology and pathogenicity. Data description gDNA sample was extracted from the P. insidiosum strain Pi-S for whole-genome sequencing by PacBio long-read NGS platform. Raw reads were assembled using CANU (v2.1), polished using ARROW (SMRT link version 5.0.1), aligned with the original raw PacBio reads using pbmm2 (v1.2.1), consensus sequence checked using ARROW, and gene predicted using Funannotate pipeline (v1.7.4). The genome completion was assessed using BUSCO (v4.0.2). As a result, 840 contigs (maximum length: 1.3 Mb; N 50: 229.9 Kb; L 50: 70) were obtained. Sequence assembly showed a genome size of 66.7 Mb (178x coverage; 57.2% G-C content) that contained 20,375 ORFs. A BUSCO-based assessment revealed 85.5% genome completion. All assembled contig sequences have been deposited in the NCBI database under the accession numbers BBXB02000001 - BBXB02000840.

Keywords