Scientific Data (May 2024)

PacBio Hi-Fi genome assembly of Sipha maydis, a model for the study of multipartite mutualism in insects

  • François Renoz,
  • Nicolas Parisot,
  • Patrice Baa-Puyoulet,
  • Léo Gerlin,
  • Samir Fakhour,
  • Hubert Charles,
  • Thierry Hance,
  • Federica Calevro

DOI
https://doi.org/10.1038/s41597-024-03297-x
Journal volume & issue
Vol. 11, no. 1
pp. 1 – 9

Abstract

Read online

Abstract Dependence on multiple nutritional endosymbionts has evolved repeatedly in insects feeding on unbalanced diets. However, reference genomes for species hosting multi-symbiotic nutritional systems are lacking, even though they are essential for deciphering the processes governing cooperative life between insects and anatomically integrated symbionts. The cereal aphid Sipha maydis is a promising model for addressing these issues, as it has evolved a nutritional dependence on two bacterial endosymbionts that complement each other. In this study, we used PacBio High fidelity (HiFi) long-read sequencing to generate a highly contiguous genome assembly of S. maydis with a length of 410 Mb, 3,570 contigs with a contig N50 length of 187 kb, and BUSCO completeness of 95.5%. We identified 117 Mb of repetitive sequences, accounting for 29% of the genome assembly, and predicted 24,453 protein-coding genes, of which 2,541 were predicted enzymes included in an integrated metabolic network with the two aphid-associated endosymbionts. These resources provide valuable genetic and metabolic information for understanding the evolution and functioning of multi-symbiotic systems in insects.