Genome Biology (Aug 2023)
Genome sequencing of 2000 canids by the Dog10K consortium advances the understanding of demography, genome function and architecture
- Jennifer R. S. Meadows,
- Jeffrey M. Kidd,
- Guo-Dong Wang,
- Heidi G. Parker,
- Peter Z. Schall,
- Matteo Bianchi,
- Matthew J. Christmas,
- Katia Bougiouri,
- Reuben M. Buckley,
- Christophe Hitte,
- Anthony K. Nguyen,
- Chao Wang,
- Vidhya Jagannathan,
- Julia E. Niskanen,
- Laurent A. F. Frantz,
- Meharji Arumilli,
- Sruthi Hundi,
- Kerstin Lindblad-Toh,
- Catarina Ginja,
- Kadek Karang Agustina,
- Catherine André,
- Adam R. Boyko,
- Brian W. Davis,
- Michaela Drögemüller,
- Xin-Yao Feng,
- Konstantinos Gkagkavouzis,
- Giorgos Iliopoulos,
- Alexander C. Harris,
- Marjo K. Hytönen,
- Daniela C. Kalthoff,
- Yan-Hu Liu,
- Petros Lymberakis,
- Nikolaos Poulakakis,
- Ana Elisabete Pires,
- Fernando Racimo,
- Fabian Ramos-Almodovar,
- Peter Savolainen,
- Semina Venetsani,
- Imke Tammen,
- Alexandros Triantafyllidis,
- Bridgett vonHoldt,
- Robert K. Wayne,
- Greger Larson,
- Frank W. Nicholas,
- Hannes Lohi,
- Tosso Leeb,
- Ya-Ping Zhang,
- Elaine A. Ostrander
Affiliations
- Jennifer R. S. Meadows
- Science for Life Laboratory, Department of Medical Biochemistry and Microbiology, Uppsala University
- Jeffrey M. Kidd
- Department of Human Genetics, University of Michigan
- Guo-Dong Wang
- State Key Laboratory of Genetic Resources and Evolution, Kunming Institute of Zoology, Chinese Academy of Sciences
- Heidi G. Parker
- National Human Genome Research Institute, National Institutes of Health
- Peter Z. Schall
- Department of Human Genetics, University of Michigan
- Matteo Bianchi
- Science for Life Laboratory, Department of Medical Biochemistry and Microbiology, Uppsala University
- Matthew J. Christmas
- Science for Life Laboratory, Department of Medical Biochemistry and Microbiology, Uppsala University
- Katia Bougiouri
- Section for Molecular Ecology and Evolution, Globe Institute, University of Copenhagen
- Reuben M. Buckley
- National Human Genome Research Institute, National Institutes of Health
- Christophe Hitte
- University of Rennes, CNRS, Institute Genetics and Development Rennes - UMR6290
- Anthony K. Nguyen
- Department of Human Genetics, University of Michigan
- Chao Wang
- Science for Life Laboratory, Department of Medical Biochemistry and Microbiology, Uppsala University
- Vidhya Jagannathan
- Institute of Genetics, Vetsuisse Faculty, University of Bern
- Julia E. Niskanen
- Department of Medical and Clinical Genetics, Department of Veterinary Biosciences, University of Helsinki and Folkhälsan Research Center
- Laurent A. F. Frantz
- School of Biological and Behavioural Sciences, Queen Mary University of London, London E14NS, UK and Palaeogenomics Group, Department of Veterinary Sciences, Ludwig Maximilian University
- Meharji Arumilli
- Department of Medical and Clinical Genetics, Department of Veterinary Biosciences, University of Helsinki and Folkhälsan Research Center
- Sruthi Hundi
- Department of Medical and Clinical Genetics, Department of Veterinary Biosciences, University of Helsinki and Folkhälsan Research Center
- Kerstin Lindblad-Toh
- Science for Life Laboratory, Department of Medical Biochemistry and Microbiology, Uppsala University
- Catarina Ginja
- BIOPOLIS-CIBIO-InBIO-Centro de Investigação Em Biodiversidade E Recursos Genéticos - ArchGen Group, Universidade Do Porto
- Kadek Karang Agustina
- Department of Public Health, Udayana University
- Catherine André
- University of Rennes, CNRS, Institute Genetics and Development Rennes - UMR6290
- Adam R. Boyko
- Department of Biomedical Sciences, Cornell University
- Brian W. Davis
- Department of Veterinary Integrative Biosciences, School of Veterinary Medicine and Biomedical Sciences, Texas A&M University
- Michaela Drögemüller
- Institute of Genetics, Vetsuisse Faculty, University of Bern
- Xin-Yao Feng
- State Key Laboratory of Genetic Resources and Evolution, Kunming Institute of Zoology, Chinese Academy of Sciences
- Konstantinos Gkagkavouzis
- Department of Genetics, School of Biology, ), Aristotle University of Thessaloniki, Thessaloniki, Macedonia 54124, Greece and Genomics and Epigenomics Translational Research (GENeTres), Center for Interdisciplinary Research and Innovation (CIRI-AUTH
- Giorgos Iliopoulos
- NGO “Callisto”, Wildlife and Nature Conservation Society
- Alexander C. Harris
- National Human Genome Research Institute, National Institutes of Health
- Marjo K. Hytönen
- Department of Medical and Clinical Genetics, Department of Veterinary Biosciences, University of Helsinki and Folkhälsan Research Center
- Daniela C. Kalthoff
- NGO “Callisto”, Wildlife and Nature Conservation Society
- Yan-Hu Liu
- State Key Laboratory of Genetic Resources and Evolution, Kunming Institute of Zoology, Chinese Academy of Sciences
- Petros Lymberakis
- Natural History Museum of Crete & Department of Biology, University of Crete
- Nikolaos Poulakakis
- Natural History Museum of Crete & Department of Biology, University of Crete
- Ana Elisabete Pires
- BIOPOLIS-CIBIO-InBIO-Centro de Investigação Em Biodiversidade E Recursos Genéticos - ArchGen Group, Universidade Do Porto
- Fernando Racimo
- Section for Molecular Ecology and Evolution, Globe Institute, University of Copenhagen
- Fabian Ramos-Almodovar
- Department of Human Genetics, University of Michigan
- Peter Savolainen
- Department of Gene Technology, Science for Life Laboratory, KTH - Royal Institute of Technology
- Semina Venetsani
- Department of Genetics, School of Biology, Aristotle University of Thessaloniki
- Imke Tammen
- Sydney School of Veterinary Science, The University of Sydney
- Alexandros Triantafyllidis
- Department of Genetics, School of Biology, ), Aristotle University of Thessaloniki, Thessaloniki, Macedonia 54124, Greece and Genomics and Epigenomics Translational Research (GENeTres), Center for Interdisciplinary Research and Innovation (CIRI-AUTH
- Bridgett vonHoldt
- Department of Ecology and Evolutionary Biology, Princeton University
- Robert K. Wayne
- Department of Ecology and Evolutionary Biology, Ecology and Evolutionary Biology, University of California
- Greger Larson
- Palaeogenomics and Bio-Archaeology Research Network, School of Archaeology, University of Oxford
- Frank W. Nicholas
- Sydney School of Veterinary Science, The University of Sydney
- Hannes Lohi
- Department of Medical and Clinical Genetics, Department of Veterinary Biosciences, University of Helsinki and Folkhälsan Research Center
- Tosso Leeb
- Institute of Genetics, Vetsuisse Faculty, University of Bern
- Ya-Ping Zhang
- State Key Laboratory of Genetic Resources and Evolution, Kunming Institute of Zoology, Chinese Academy of Sciences
- Elaine A. Ostrander
- National Human Genome Research Institute, National Institutes of Health
- DOI
- https://doi.org/10.1186/s13059-023-03023-7
- Journal volume & issue
-
Vol. 24,
no. 1
pp. 1 – 41
Abstract
Abstract Background The international Dog10K project aims to sequence and analyze several thousand canine genomes. Incorporating 20 × data from 1987 individuals, including 1611 dogs (321 breeds), 309 village dogs, 63 wolves, and four coyotes, we identify genomic variation across the canid family, setting the stage for detailed studies of domestication, behavior, morphology, disease susceptibility, and genome architecture and function. Results We report the analysis of > 48 M single-nucleotide, indel, and structural variants spanning the autosomes, X chromosome, and mitochondria. We discover more than 75% of variation for 239 sampled breeds. Allele sharing analysis indicates that 94.9% of breeds form monophyletic clusters and 25 major clades. German Shepherd Dogs and related breeds show the highest allele sharing with independent breeds from multiple clades. On average, each breed dog differs from the UU_Cfam_GSD_1.0 reference at 26,960 deletions and 14,034 insertions greater than 50 bp, with wolves having 14% more variants. Discovered variants include retrogene insertions from 926 parent genes. To aid functional prioritization, single-nucleotide variants were annotated with SnpEff and Zoonomia phyloP constraint scores. Constrained positions were negatively correlated with allele frequency. Finally, the utility of the Dog10K data as an imputation reference panel is assessed, generating high-confidence calls across varied genotyping platform densities including for breeds not included in the Dog10K collection. Conclusions We have developed a dense dataset of 1987 sequenced canids that reveals patterns of allele sharing, identifies likely functional variants, informs breed structure, and enables accurate imputation. Dog10K data are publicly available.
Keywords