BMC Bioinformatics (Feb 2020)

SPECS: a non-parametric method to identify tissue-specific molecular features for unbalanced sample groups

  • Celine Everaert,
  • Pieter-Jan Volders,
  • Annelien Morlion,
  • Olivier Thas,
  • Pieter Mestdagh

DOI
https://doi.org/10.1186/s12859-020-3407-z
Journal volume & issue
Vol. 21, no. 1
pp. 1 – 8

Abstract

Read online

Abstract Background To understand biology and differences among various tissues or cell types, one typically searches for molecular features that display characteristic abundance patterns. Several specificity metrics have been introduced to identify tissue-specific molecular features, but these either require an equal number of replicates per tissue or they can’t handle replicates at all. Results We describe a non-parametric specificity score that is compatible with unequal sample group sizes. To demonstrate its usefulness, the specificity score was calculated on all GTEx samples, detecting known and novel tissue-specific genes. A webtool was developed to browse these results for genes or tissues of interest. An example python implementation of SPECS is available at https://github.com/celineeveraert/SPECS. The precalculated SPECS results on the GTEx data are available through a user-friendly browser at specs.cmgg.be. Conclusions SPECS is a non-parametric method that identifies known and novel specific-expressed genes. In addition, SPECS could be adopted for other features and applications.

Keywords