PLoS ONE (Jan 2021)

Efficient association mapping from k-mers-An application in finding sex-specific sequences.

  • Zakaria Mehrab,
  • Jaiaid Mobin,
  • Ibrahim Asadullah Tahmid,
  • Atif Rahman

DOI
https://doi.org/10.1371/journal.pone.0245058
Journal volume & issue
Vol. 16, no. 1
p. e0245058

Abstract

Read online

Genome wide association studies (GWAS) attempt to map genotypes to phenotypes in organisms. This is typically performed by genotyping individuals using microarray or by aligning whole genome sequencing reads to a reference genome. Both approaches require knowledge of a reference genome which hinders their application to organisms with no or incomplete reference genomes. This caveat can be removed by using alignment-free association mapping methods based on k-mers from sequencing reads. Here we present an improved implementation of an alignment free association mapping method. The new implementation is faster and includes additional features to make it more flexible than the original implementation. We have tested our implementation on an E. Coli ampicillin resistance dataset and observe improvement in execution time over the original implementation while maintaining accuracy in results. We also demonstrate that the method can be applied to find sex specific sequences.