F1000Research (May 2021)

First draft genome assembly of the desert locust, Schistocerca gregaria [version 2; peer review: 2 approved, 1 approved with reservations]

  • Heleen Verlinden,
  • Lieven Sterck,
  • Jia Li,
  • Zhen Li,
  • Anna Yssel,
  • Yannick Gansemans,
  • Rik Verdonck,
  • Michiel Holtof,
  • Hojun Song,
  • Spencer T. Behmer,
  • Gregory A. Sword,
  • Tom Matheson,
  • Swidbert R. Ott,
  • Dieter Deforce,
  • Filip Van Nieuwerburgh,
  • Yves Van de Peer,
  • Jozef Vanden Broeck

DOI
https://doi.org/10.12688/f1000research.25148.2
Journal volume & issue
Vol. 9

Abstract

Read online

Background: At the time of publication, the most devastating desert locust crisis in decades is affecting East Africa, the Arabian Peninsula and South-West Asia. The situation is extremely alarming in East Africa, where Kenya, Ethiopia and Somalia face an unprecedented threat to food security and livelihoods. Most of the time, however, locusts do not occur in swarms, but live as relatively harmless solitary insects. The phenotypically distinct solitarious and gregarious locust phases differ markedly in many aspects of behaviour, physiology and morphology, making them an excellent model to study how environmental factors shape behaviour and development. A better understanding of the extreme phenotypic plasticity in desert locusts will offer new, more environmentally sustainable ways of fighting devastating swarms. Methods: High molecular weight DNA derived from two adult males was used for Mate Pair and Paired End Illumina sequencing and PacBio sequencing. A reliable reference genome of Schistocerca gregaria was assembled using the ABySS pipeline, scaffolding was improved using LINKS. Results: In total, 1,316 Gb Illumina reads and 112 Gb PacBio reads were produced and assembled. The resulting draft genome consists of 8,817,834,205 bp organised in 955,015 scaffolds with an N50 of 157,705 bp, making the desert locust genome the largest insect genome sequenced and assembled to date. In total, 18,815 protein-encoding genes are predicted in the desert locust genome, of which 13,646 (72.53%) obtained at least one functional assignment based on similarity to known proteins. Conclusions: The desert locust genome data will contribute greatly to studies of phenotypic plasticity, physiology, neurobiology, molecular ecology, evolutionary genetics and comparative genomics, and will promote the desert locust’s use as a model system. The data will also facilitate the development of novel, more sustainable strategies for preventing or combating swarms of these infamous insects.