Agrosystems, Geosciences & Environment (Dec 2023)

STAC: A tool to leverage genetic marker data for crop research and breeding

  • Scott Carle,
  • Alecia Kiszonas,
  • Kimberly Garland‐Campbell,
  • Craig F. Morris

DOI
https://doi.org/10.1002/agg2.20436
Journal volume & issue
Vol. 6, no. 4
pp. n/a – n/a

Abstract

Read online

Abstract As genotyping by sequencing (GBS) becomes more prevalent and cost‐effective, there is a benefit in being able to apply the data to solve a variety of problems. However, high degrees of missing data and overreliance on single nucleotide polymorphisms (SNPs), while ignoring other forms of genetic variation, frequently plague attempts to make full use of GBS sequence data. Here we have developed two R scripts to serve as a tool in haplotype determination at loci of interest within biparental populations. One of these scripts, Sparse Tag Allele Caller (STAC), provides both automated calling and visual representations of the data around a locus of interest to assist in rapid data compilation decision‐making. The other script, STAC Integrate, allows automated quality control and logic‐based integration of presence/absence data with SNP data, while also rendering global overviews of recombination and coverage across the genome. These scripts are designed to be used together to maximize the utility of the available data. These tools were validated on a biparental population of wheat that was genotyped through GBS. They successfully enabled haplotype determination of a locus that was difficult to directly genotype, and their systemic accuracy was demonstrated in multiple populations and species. These scripts may serve as a tool for researchers attempting to make better use of GBS and other genetic marker data for both research and crop breeding decisions.