Frontiers in Genetics (Apr 2023)

Predicting enhancer-promoter interaction based on epigenomic signals

  • Leqiong Zheng,
  • Leqiong Zheng,
  • Leqiong Zheng,
  • Li Liu,
  • Wen Zhu,
  • Wen Zhu,
  • Yijie Ding,
  • Fangxiang Wu

DOI
https://doi.org/10.3389/fgene.2023.1133775
Journal volume & issue
Vol. 14

Abstract

Read online

Introduction: The physical interactions between enhancers and promoters are often involved in gene transcriptional regulation. High tissue-specific enhancer-promoter interactions (EPIs) are responsible for the differential expression of genes. Experimental methods are time-consuming and labor-intensive in measuring EPIs. An alternative approach, machine learning, has been widely used to predict EPIs. However, most existing machine learning methods require a large number of functional genomic and epigenomic features as input, which limits the application to different cell lines.Methods: In this paper, we developed a random forest model, HARD (H3K27ac, ATAC-seq, RAD21, and Distance), to predict EPI using only four types of features.Results: Independent tests on a benchmark dataset showed that HARD outperforms other models with the fewest features.Discussion: Our results revealed that chromatin accessibility and the binding of cohesin are important for cell-line-specific EPIs. Furthermore, we trained the HARD model in the GM12878 cell line and performed testing in the HeLa cell line. The cross-cell-lines prediction also performs well, suggesting it has the potential to be applied to other cell lines.

Keywords