PLoS Computational Biology (Feb 2019)

Graph Peak Caller: Calling ChIP-seq peaks on graph-based reference genomes.

  • Ivar Grytten,
  • Knut D Rand,
  • Alexander J Nederbragt,
  • Geir O Storvik,
  • Ingrid K Glad,
  • Geir K Sandve

DOI
https://doi.org/10.1371/journal.pcbi.1006731
Journal volume & issue
Vol. 15, no. 2
p. e1006731

Abstract

Read online

Graph-based representations are considered to be the future for reference genomes, as they allow integrated representation of the steadily increasing data on individual variation. Currently available tools allow de novo assembly of graph-based reference genomes, alignment of new read sets to the graph representation as well as certain analyses like variant calling and haplotyping. We here present a first method for calling ChIP-Seq peaks on read data aligned to a graph-based reference genome. The method is a graph generalization of the peak caller MACS2, and is implemented in an open source tool, Graph Peak Caller. By using the existing tool vg to build a pan-genome of Arabidopsis thaliana, we validate our approach by showing that Graph Peak Caller with a pan-genome reference graph can trace variants within peaks that are not part of the linear reference genome, and find peaks that in general are more motif-enriched than those found by MACS2.