G3: Genes, Genomes, Genetics (Jun 2018)

diploS/HIC: An Updated Approach to Classifying Selective Sweeps

  • Andrew D. Kern,
  • Daniel R. Schrider

DOI
https://doi.org/10.1534/g3.118.200262
Journal volume & issue
Vol. 8, no. 6
pp. 1959 – 1970

Abstract

Read online

Identifying selective sweeps in populations that have complex demographic histories remains a difficult problem in population genetics. We previously introduced a supervised machine learning approach, S/HIC, for finding both hard and soft selective sweeps in genomes on the basis of patterns of genetic variation surrounding a window of the genome. While S/HIC was shown to be both powerful and precise, the utility of S/HIC was limited by the use of phased genomic data as input. In this report we describe a deep learning variant of our method, diploS/HIC, that uses unphased genotypes to accurately classify genomic windows. diploS/HIC is shown to be quite powerful even at moderate to small sample sizes.

Keywords