Data in Brief (Oct 2021)

The dataset of predicted trypsin serine peptidases and their inactive homologs in Tenebrio molitor transcriptomes

  • Nikita I. Zhiganov,
  • Valeriia F. Tereshchenkova,
  • Brenda Oppert,
  • Irina Y. Filippova,
  • Nataliya V. Belyaeva,
  • Yakov E. Dunaevsky,
  • Mikhail A. Belozersky,
  • Elena N. Elpidina

Journal volume & issue
Vol. 38
p. 107301

Abstract

Read online

Tenebrio molitor is an important coleopteran model insect and agricultural pest from the Tenebrionidae family. We used RNA-Seq transcriptome data from T. molitor to annotate trypsin-like sequences from the chymotrypsin S1 family of serine peptidases, including sequences of active serine peptidases (SerP) and their inactive homologs (SerPH) in T. molitor transcriptomes. A total of 63 S1 family tryspin-like serine peptidase sequences were de novo assembled. Among the sequences, 58 were predicted to be active trypsins and five inactive SerPH. The length of preproenzyme and mature form of the predicted enzyme, position of signal peptide and proenzyme cleavage sites, molecular mass, active site and S1 substrate binding subsite residues, and transmembrane and regulatory domains were analyzed using bioinformatic tools. The data can be used for further physiological, biochemical, and phylogenetic study of tenebrionid pests and other animal systems.

Keywords