Genome Biology (Apr 2023)

PAUSE: principled feature attribution for unsupervised gene expression analysis

  • Joseph D. Janizek,
  • Anna Spiro,
  • Safiye Celik,
  • Ben W. Blue,
  • John C. Russell,
  • Ting-I Lee,
  • Matt Kaeberlin,
  • Su-In Lee

DOI
https://doi.org/10.1186/s13059-023-02901-4
Journal volume & issue
Vol. 24, no. 1
pp. 1 – 30

Abstract

Read online

Abstract As interest in using unsupervised deep learning models to analyze gene expression data has grown, an increasing number of methods have been developed to make these models more interpretable. These methods can be separated into two groups: post hoc analyses of black box models through feature attribution methods and approaches to build inherently interpretable models through biologically-constrained architectures. We argue that these approaches are not mutually exclusive, but can in fact be usefully combined. We propose PAUSE ( https://github.com/suinleelab/PAUSE ), an unsupervised pathway attribution method that identifies major sources of transcriptomic variation when combined with biologically-constrained neural network models.

Keywords