Genome Biology (Aug 2018)

Optical and physical mapping with local finishing enables megabase-scale resolution of agronomically important regions in the wheat genome

  • Gabriel Keeble-Gagnère,
  • Philippe Rigault,
  • Josquin Tibbits,
  • Raj Pasam,
  • Matthew Hayden,
  • Kerrie Forrest,
  • Zeev Frenkel,
  • Abraham Korol,
  • B. Emma Huang,
  • Colin Cavanagh,
  • Jen Taylor,
  • Michael Abrouk,
  • Andrew Sharpe,
  • David Konkin,
  • Pierre Sourdille,
  • Benoît Darrier,
  • Frédéric Choulet,
  • Aurélien Bernard,
  • Simone Rochfort,
  • Adam Dimech,
  • Nathan Watson-Haigh,
  • Ute Baumann,
  • Paul Eckermann,
  • Delphine Fleury,
  • Angela Juhasz,
  • Sébastien Boisvert,
  • Marc-Alexandre Nolin,
  • Jaroslav Doležel,
  • Hana Šimková,
  • Helena Toegelová,
  • Jan Šafář,
  • Ming-Cheng Luo,
  • Francisco Câmara,
  • Matthias Pfeifer,
  • Don Isdale,
  • Johan Nyström-Persson,
  • IWGSC,
  • Dal-Hoe Koo,
  • Matthew Tinning,
  • Dangqun Cui,
  • Zhengang Ru,
  • Rudi Appels

DOI
https://doi.org/10.1186/s13059-018-1475-4
Journal volume & issue
Vol. 19, no. 1
pp. 1 – 18

Abstract

Read online

Abstract Background Numerous scaffold-level sequences for wheat are now being released and, in this context, we report on a strategy for improving the overall assembly to a level comparable to that of the human genome. Results Using chromosome 7A of wheat as a model, sequence-finished megabase-scale sections of this chromosome were established by combining a new independent assembly using a bacterial artificial chromosome (BAC)-based physical map, BAC pool paired-end sequencing, chromosome-arm-specific mate-pair sequencing and Bionano optical mapping with the International Wheat Genome Sequencing Consortium RefSeq v1.0 sequence and its underlying raw data. The combined assembly results in 18 super-scaffolds across the chromosome. The value of finished genome regions is demonstrated for two approximately 2.5 Mb regions associated with yield and the grain quality phenotype of fructan carbohydrate grain levels. In addition, the 50 Mb centromere region analysis incorporates cytological data highlighting the importance of non-sequence data in the assembly of this complex genome region. Conclusions Sufficient genome sequence information is shown to now be available for the wheat community to produce sequence-finished releases of each chromosome of the reference genome. The high-level completion identified that an array of seven fructosyl transferase genes underpins grain quality and that yield attributes are affected by five F-box-only-protein-ubiquitin ligase domain and four root-specific lipid transfer domain genes. The completed sequence also includes the centromere.

Keywords