Genome Biology (Dec 2022)

STRling: a k-mer counting approach that detects short tandem repeat expansions at known and novel loci

  • Harriet Dashnow,
  • Brent S. Pedersen,
  • Laurel Hiatt,
  • Joe Brown,
  • Sarah J. Beecroft,
  • Gianina Ravenscroft,
  • Amy J. LaCroix,
  • Phillipa Lamont,
  • Richard H. Roxburgh,
  • Miriam J. Rodrigues,
  • Mark Davis,
  • Heather C. Mefford,
  • Nigel G. Laing,
  • Aaron R. Quinlan

DOI
https://doi.org/10.1186/s13059-022-02826-4
Journal volume & issue
Vol. 23, no. 1
pp. 1 – 20

Abstract

Read online

Abstract Expansions of short tandem repeats (STRs) cause many rare diseases. Expansion detection is challenging with short-read DNA sequencing data since supporting reads are often mapped incorrectly. Detection is particularly difficult for “novel” STRs, which include new motifs at known loci or STRs absent from the reference genome. We developed STRling to efficiently count k-mers to recover informative reads and call expansions at known and novel STR loci. STRling is sensitive to known STR disease loci, has a low false discovery rate, and resolves novel STR expansions to base-pair position accuracy. It is fast, scalable, open-source, and available at: github.com/quinlan-lab/STRling .