PLoS Computational Biology (Feb 2020)

DeepHiC: A generative adversarial network for enhancing Hi-C data resolution.

  • Hao Hong,
  • Shuai Jiang,
  • Hao Li,
  • Guifang Du,
  • Yu Sun,
  • Huan Tao,
  • Cheng Quan,
  • Chenghui Zhao,
  • Ruijiang Li,
  • Wanying Li,
  • Xiaoyao Yin,
  • Yangchen Huang,
  • Cheng Li,
  • Hebing Chen,
  • Xiaochen Bo

DOI
https://doi.org/10.1371/journal.pcbi.1007287
Journal volume & issue
Vol. 16, no. 2
p. e1007287

Abstract

Read online

Hi-C is commonly used to study three-dimensional genome organization. However, due to the high sequencing cost and technical constraints, the resolution of most Hi-C datasets is coarse, resulting in a loss of information and biological interpretability. Here we develop DeepHiC, a generative adversarial network, to predict high-resolution Hi-C contact maps from low-coverage sequencing data. We demonstrated that DeepHiC is capable of reproducing high-resolution Hi-C data from as few as 1% downsampled reads. Empowered by adversarial training, our method can restore fine-grained details similar to those in high-resolution Hi-C matrices, boosting accuracy in chromatin loops identification and TADs detection, and outperforms the state-of-the-art methods in accuracy of prediction. Finally, application of DeepHiC to Hi-C data on mouse embryonic development can facilitate chromatin loop detection. We develop a web-based tool (DeepHiC, http://sysomics.com/deephic) that allows researchers to enhance their own Hi-C data with just a few clicks.