Data in Brief (Aug 2024)

Dataset of PLA2 family identified from transcriptomic high-throughput sequencing of Androctonus crassicauda (Scorpionida: Buthidae) venom gland

  • Fatemeh Salabi,
  • Hedieh Jafari

Journal volume & issue
Vol. 55
p. 110629

Abstract

Read online

Recently, RNA sequencing has been widely applied to deeply understand the molecular diversity of the venom compounds of various venomous animal species, including scorpions. Among the venomous scorpion species of the Buthidae family, there are many documents of stinging and severe envenoming of victims by the scorpion of Androctonus crassicauda. We present here a high-throughput RNA sequencing dataset of the venom glands from five A. crassicauda individuals, including male and female scorpions. Furthermore, the assembled data corresponding to annotated PLA2 transcripts are also presented. The dataset in this report is related to our research article entitled: “Whole transcriptome sequencing reveals the activity of the PLA2 family members in Androctonus crassicauda (Scorpionida: Buthidae) venom gland” [1]. Here, the venom gland transcriptome analysis of the A. crassicauda was performed. The analysis of concatenated clustered transcriptome assembly using TrinityStats.pl showed that de novo assembly of 517,799,704 clean read pairs generated 744,804 trinity transcripts representing 563,526 trinity genes. BUSCO score for the concatenated clustered transcriptome assembly against orthologs from Arachnida showed 96.7 % complete, 1.6 % fragmented, 1.7 % missing genes, and 2934 genes. Subsequently, the sequences represented PLA2 annotation were extracted from the transcriptome dataset using BLAST searches against the local PLA2 database. We found several cDNA sequences representing PLA2 annotations, which based on sequence similarity to previously found PLA2s, we named platelet-activating factor acetylhydrolases, calcium-dependent PLA2s, calcium-independent PLA2s, and secreted PLA2s. The PLA2 data significantly enrich KEGG pathways related to lipid metabolism. This manuscript complements the primary research article by providing additional data on the abundant estimation of PLA2s.

Keywords