Data in Brief (Feb 2021)
Illumina short-read sequencing data, de novo assembly and annotations of the Drosophila nasuta nasuta genome
Abstract
The Drosophila nasuta nasuta (D. n. nasuta) is a member of nasuta subgroup of immigrans species group of Drosophila widely distributed across South-East Asia and central to Southern Africa. It displays morphological similarities with other members of the nasuta subgroup with which it has a recent divergence history. The genomic DNA of D. n. nasuta Coorg strain was paired-end sequenced using Illumina HiSeq 2500 technology to obtain a draft genome assembly of 145.64 Mb. The generated assembly retrieved 93.6% of the conserved dipteran BUSCO orthologs. Approximately 85% of the ab initio predicted proteins exhibit sequence similarity to the proteins of D. albomicans which is the closest annotated species. This draft genome sequence is a valuable resource to Drosophila geneticists and evolutionary biologists to understand molecular organisation of the genome and its evolution during early stages of speciation.