F1000Research (Jun 2019)

A step-by-step guide to analyzing CAGE data using R/Bioconductor [version 1; peer review: 2 approved]

  • Malte Thodberg,
  • Albin Sandelin

DOI
https://doi.org/10.12688/f1000research.18456.1
Journal volume & issue
Vol. 8

Abstract

Read online

Cap Analysis of Gene Expression (CAGE) is one of the most popular 5'-end sequencing methods. In a single experiment, CAGE can be used to locate and quantify the expression of both Transcription Start Sites (TSSs) and enhancers. This is workflow is a case study on how to use the CAGEfightR package to orchestrate analysis of CAGE data within the Bioconductor project. This workflow starts from BigWig-files and covers both basic CAGE analyses such as identifying, quantifying and annotating TSSs and enhancers, advanced analysis such as finding interacting TSS-enhancer pairs and enhancer clusters, to differential expression analysis and alternative TSS usage. R-code, discussion and references are intertwined to help provide guidelines for future CAGE studies of the same kind.