BMC Genomics (Dec 2012)

Combining multiple ChIP-seq peak detection systems using combinatorial fusion

  • Schweikert Christina,
  • Brown Stuart,
  • Tang Zuojian,
  • Smith Phillip R,
  • Hsu D Frank

DOI
https://doi.org/10.1186/1471-2164-13-S8-S12
Journal volume & issue
Vol. 13, no. Suppl 8
p. S12

Abstract

Read online

Abstract Background Due to the recent rapid development in ChIP-seq technologies, which uses high-throughput next-generation DNA sequencing to identify the targets of Chromatin Immunoprecipitation, there is an increasing amount of sequencing data being generated that provides us with greater opportunity to analyze genome-wide protein-DNA interactions. In particular, we are interested in evaluating and enhancing computational and statistical techniques for locating protein binding sites. Many peak detection systems have been developed; in this study, we utilize the following six: CisGenome, MACS, PeakSeq, QuEST, SISSRs, and TRLocator. Results We define two methods to merge and rescore the regions of two peak detection systems and analyze the performance based on average precision and coverage of transcription start sites. The results indicate that ChIP-seq peak detection can be improved by fusion using score or rank combination. Conclusion Our method of combination and fusion analysis would provide a means for generic assessment of available technologies and systems and assist researchers in choosing an appropriate system (or fusion method) for analyzing ChIP-seq data. This analysis offers an alternate approach for increasing true positive rates, while decreasing false positive rates and hence improving the ChIP-seq peak identification process.