Genome Biology (Jun 2022)

scDART: integrating unmatched scRNA-seq and scATAC-seq data and learning cross-modality relationship simultaneously

  • Ziqi Zhang,
  • Chengkai Yang,
  • Xiuwei Zhang

DOI
https://doi.org/10.1186/s13059-022-02706-x
Journal volume & issue
Vol. 23, no. 1
pp. 1 – 28

Abstract

Read online

Abstract It is a challenging task to integrate scRNA-seq and scATAC-seq data obtained from different batches. Existing methods tend to use a pre-defined gene activity matrix to convert the scATAC-seq data into scRNA-seq data. The pre-defined gene activity matrix is often of low quality and does not reflect the dataset-specific relationship between the two data modalities. We propose scDART, a deep learning framework that integrates scRNA-seq and scATAC-seq data and learns cross-modalities relationships simultaneously. Specifically, the design of scDART allows it to preserve cell trajectories in continuous cell populations and can be applied to trajectory inference on integrated data.

Keywords