Genome Biology (Jan 2020)

A curated benchmark of enhancer-gene interactions for evaluating enhancer-target gene prediction methods

  • Jill E. Moore,
  • Henry E. Pratt,
  • Michael J. Purcaro,
  • Zhiping Weng

DOI
https://doi.org/10.1186/s13059-019-1924-8
Journal volume & issue
Vol. 21, no. 1
pp. 1 – 16

Abstract

Read online

Abstract Background Many genome-wide collections of candidate cis-regulatory elements (cCREs) have been defined using genomic and epigenomic data, but it remains a major challenge to connect these elements to their target genes. Results To facilitate the development of computational methods for predicting target genes, we develop a Benchmark of candidate Enhancer-Gene Interactions (BENGI) by integrating the recently developed Registry of cCREs with experimentally derived genomic interactions. We use BENGI to test several published computational methods for linking enhancers with genes, including signal correlation and the TargetFinder and PEP supervised learning methods. We find that while TargetFinder is the best-performing method, it is only modestly better than a baseline distance method for most benchmark datasets when trained and tested with the same cell type and that TargetFinder often does not outperform the distance method when applied across cell types. Conclusions Our results suggest that current computational methods need to be improved and that BENGI presents a useful framework for method development and testing.

Keywords