BMC Genomics (Jan 2020)

Enhanced genome assembly and a new official gene set for Tribolium castaneum

  • Nicolae Herndon,
  • Jennifer Shelton,
  • Lizzy Gerischer,
  • Panos Ioannidis,
  • Maria Ninova,
  • Jürgen Dönitz,
  • Robert M. Waterhouse,
  • Chun Liang,
  • Carsten Damm,
  • Janna Siemanowski,
  • Peter Kitzmann,
  • Julia Ulrich,
  • Stefan Dippel,
  • Georg Oberhofer,
  • Yonggang Hu,
  • Jonas Schwirz,
  • Magdalena Schacht,
  • Sabrina Lehmann,
  • Alice Montino,
  • Nico Posnien,
  • Daniela Gurska,
  • Thorsten Horn,
  • Jan Seibert,
  • Iris M. Vargas Jentzsch,
  • Kristen A. Panfilio,
  • Jianwei Li,
  • Ernst A. Wimmer,
  • Dominik Stappert,
  • Siegfried Roth,
  • Reinhard Schröder,
  • Yoonseong Park,
  • Michael Schoppmeier,
  • Ho-Ryun Chung,
  • Martin Klingler,
  • Sebastian Kittelmann,
  • Markus Friedrich,
  • Rui Chen,
  • Boran Altincicek,
  • Andreas Vilcinskas,
  • Evgeny Zdobnov,
  • Sam Griffiths-Jones,
  • Matthew Ronshaugen,
  • Mario Stanke,
  • Sue J. Brown,
  • Gregor Bucher

DOI
https://doi.org/10.1186/s12864-019-6394-6
Journal volume & issue
Vol. 21, no. 1
pp. 1 – 13

Abstract

Read online

Abstract Background The red flour beetle Tribolium castaneum has emerged as an important model organism for the study of gene function in development and physiology, for ecological and evolutionary genomics, for pest control and a plethora of other topics. RNA interference (RNAi), transgenesis and genome editing are well established and the resources for genome-wide RNAi screening have become available in this model. All these techniques depend on a high quality genome assembly and precise gene models. However, the first version of the genome assembly was generated by Sanger sequencing, and with a small set of RNA sequence data limiting annotation quality. Results Here, we present an improved genome assembly (Tcas5.2) and an enhanced genome annotation resulting in a new official gene set (OGS3) for Tribolium castaneum, which significantly increase the quality of the genomic resources. By adding large-distance jumping library DNA sequencing to join scaffolds and fill small gaps, the gaps in the genome assembly were reduced and the N50 increased to 4753kbp. The precision of the gene models was enhanced by the use of a large body of RNA-Seq reads of different life history stages and tissue types, leading to the discovery of 1452 novel gene sequences. We also added new features such as alternative splicing, well defined UTRs and microRNA target predictions. For quality control, 399 gene models were evaluated by manual inspection. The current gene set was submitted to Genbank and accepted as a RefSeq genome by NCBI. Conclusions The new genome assembly (Tcas5.2) and the official gene set (OGS3) provide enhanced genomic resources for genetic work in Tribolium castaneum. The much improved information on transcription start sites supports transgenic and gene editing approaches. Further, novel types of information such as splice variants and microRNA target genes open additional possibilities for analysis.

Keywords