Scientific Data (Jul 2024)

Genomic Reference Resource for African Cattle: Genome Sequences and High-Density Array Variants

  • Abdulfatai Tijjani,
  • Sumaya Kambal,
  • Endashaw Terefe,
  • Regina Njeru,
  • Moses Ogugo,
  • Gideon Ndambuki,
  • Ayao Missohou,
  • Amadou Traore,
  • Bashir Salim,
  • Chukwunonso Ezeasor,
  • Claire D’andre H.,
  • Emmanuel T. Obishakin,
  • Boubacar Diallo,
  • Essodina Talaki,
  • Issaka Y. Abdoukarim,
  • Oyekanmi Nash,
  • Richard Osei-Amponsah,
  • Simeone Ravaorimanana,
  • Youssouf Issa,
  • Tsadkan Zegeye,
  • Christopher Mukasa,
  • Christian Tiambo,
  • James G. D. Prendergast,
  • Stephen J. Kemp,
  • Jianlin Han,
  • Karen Marshall,
  • Olivier Hanotte

DOI
https://doi.org/10.1038/s41597-024-03589-2
Journal volume & issue
Vol. 11, no. 1
pp. 1 – 10

Abstract

Read online

Abstract The diversity in genome resources is fundamental to designing genomic strategies for local breed improvement and utilisation. These resources also support gene discovery and enhance our understanding of the mechanisms of resilience with applications beyond local breeds. Here, we report the genome sequences of 555 cattle (208 of which comprise new data) and high-density (HD) array genotyping of 1,082 samples (537 new samples) from indigenous African cattle populations. The new sequences have an average genome coverage of ~30X, three times higher than the average (~10X) of the over 300 sequences already in the public domain. Following variant quality checks, we identified approximately 32.3 million sequence variants and 661,943 HD autosomal variants mapped to the Bos taurus reference genome (ARS-UCD1.2). The new datasets were generated as part of the Centre for Tropical Livestock Genetics and Health (CTLGH) Genomic Reference Resource for African Cattle (GRRFAC) initiative, which aspires to facilitate the generation of this livestock resource and hopes for its utilisation for complete indigenous breed characterisation and sustainable global livestock improvement.