Scientific Data (Oct 2024)

Novel Megaptera novaeangliae (Humpback whale) haplotype chromosome-level reference genome

  • Maria-Vittoria Carminati,
  • Vlonjat Lonnie Gashi,
  • Ruiqi Li,
  • Daniel Jacob Klee,
  • Sara Rose Padula,
  • Ajay Manish Patel,
  • Andy Dick Yee Tan,
  • Jacqueline Mattos,
  • Nolan Kane

DOI
https://doi.org/10.1038/s41597-024-03922-9
Journal volume & issue
Vol. 11, no. 1
pp. 1 – 7

Abstract

Read online

Abstract The sequencing of a kidney sample (KW2013002) from a stranded Megaptera novaeangliae (Humpback whale) calf is the first chromosome-level reference genome for this species1. The calf, a 457 cm and 2,500 lbs male, was found stranded in Hawai’i Kai, HI, in 2013 and was marked as abandoned/orphaned. In 2023, 1 g of kidney was sequenced with PacBio long-read DNA sequencing, chromatin conformation capture (Hi-C), RNA sequencing, and mitochondrial sequencing to comprehensively characterize the genome and transcriptome of M. novaeangliae. Data validation includes a synteny analysis, mitochondrial annotation, and a comparison of BUSCO scores (scaffold v. reference genome and Balaenoptera musculus (Blue whale) v. M. novaeangliae). BUSCO analysis was performed on an M. novaeangliae scaffold-level assembly to determine genomic completeness of the reference genome, with a scaffold BUSCO score of 91.2% versus a score of 95.4%. Synteny analysis was performed using the B. musculus genome as comparison to determine chromosome-level coverage and structure. Further, a time-based phylogenetic tree was constructed using the sequenced data and publicly available genomes.