BMC Genomics (Sep 2024)

Optimizing next-generation sequencing efficiency in clinical settings: analysis of read length impact on cost and performance

  • Pedro Milet Meirelles,
  • Pablo Alessandro B Viana,
  • Diogo Antonio Tschoeke,
  • Laise de Moraes,
  • Luciane Amorim Santos,
  • Manoel Barral-Netto,
  • Ricardo Khouri,
  • Pablo Ivan P Ramos

DOI
https://doi.org/10.1186/s12864-024-10778-1
Journal volume & issue
Vol. 25, no. 1
pp. 1 – 7

Abstract

Read online

Abstract Background The expansion of sequencing technologies as a result of the response to the COVID-19 pandemic enabled pathogen (meta)genomics to be deployed as a routine component of surveillance in many countries. Scaling genomic surveillance, however, comes with associated costs in both equipment and sequencing reagents, which should be optimized. Here, we evaluate the cost efficiency and performance of different read lengths in identifying pathogens in metagenomic samples. We carefully evaluated performance metrics, costs, and time requirements relative to choices of 75, 150 and 300 base pairs (bp) read lengths in pathogen identification. Results Our findings revealed that moving from 75 bp to 150 bp read length approximately doubles both the cost and sequencing time. Opting for 300 bp reads leads to approximately two- and three-fold increases, respectively, in cost and sequencing time compared to 75 bp reads. For viral pathogen detection, the sensitivity median ranged from 99% with 75 bp reads to 100% with 150–300 bp reads. However, bacterial pathogens detection was less effective with shorter reads: 87% with 75 bp, 95% with 150 bp, and 97% with 300 bp reads. These findings were consistent across different levels of taxa abundance. The precision of pathogen detection using shorter reads was comparable to that of longer reads across most viral and bacterial taxa. Conclusions During disease outbreak situations, when swift responses are required for pathogen identification, we suggest prioritizing 75 bp read lengths, especially if detection of viral pathogens is aimed. This practical approach allows better use of resources, enabling the sequencing of more samples using streamlined workflows, while maintaining a reliable response capability.

Keywords