BMC Bioinformatics (Jul 2017)

TnseqDiff: identification of conditionally essential genes in transposon sequencing studies

  • Lili Zhao,
  • Mark T. Anderson,
  • Weisheng Wu,
  • Harry L. T. Mobley,
  • Michael A. Bachman

DOI
https://doi.org/10.1186/s12859-017-1745-2
Journal volume & issue
Vol. 18, no. 1
pp. 1 – 11

Abstract

Read online

Abstract Background Tn-Seq is a high throughput technique for analysis of transposon mutant libraries to determine conditional essentiality of a gene under an experimental condition. A special feature of the Tn-seq data is that multiple mutants in a gene provides independent evidence to prioritize that gene as being essential. The existing methods do not account for this feature or rely on a high-density transposon library. Moreover, these methods are unable to accommodate complex designs. Results The method proposed here is specifically designed for the analysis of Tn-Seq data. It utilizes two steps to estimate the conditional essentiality for each gene in the genome. First, it collects evidence of conditional essentiality for each insertion by comparing read counts of that insertion between conditions. Second, it combines insertion-level evidence for the corresponding gene. It deals with data from both low- and high-density transposon libraries and accommodates complex designs. Moreover, it is very fast to implement. The performance of the proposed method was tested on simulated data and experimental Tn-Seq data from Serratia marcescens transposon mutant library used to identify genes that contribute to fitness in a murine model of infection. Conclusion We describe a new, efficient method for identifying conditionally essential genes in Tn-Seq experiments with high detection sensitivity and specificity. It is implemented as TnseqDiff function in R package Tnseq and can be installed from the Comprehensive R Archive Network, CRAN.

Keywords