Genome Biology (Dec 2018)

miRTrace reveals the organismal origins of microRNA sequencing data

  • Wenjing Kang,
  • Yrin Eldfjell,
  • Bastian Fromm,
  • Xavier Estivill,
  • Inna Biryukova,
  • Marc R. Friedländer

DOI
https://doi.org/10.1186/s13059-018-1588-9
Journal volume & issue
Vol. 19, no. 1
pp. 1 – 15

Abstract

Read online

Abstract We present here miRTrace, the first algorithm to trace microRNA sequencing data back to their taxonomic origins. This is a challenge with profound implications for forensics, parasitology, food control, and research settings where cross-contamination can compromise results. miRTrace accurately (> 99%) assigns real and simulated data to 14 important animal and plant groups, sensitively detects parasitic infection in mammals, and discovers the primate origin of single cells. Applying our algorithm to over 700 public datasets, we find evidence that over 7% are cross-contaminated and present a novel solution to clean these computationally, even after sequencing has occurred. miRTrace is freely available at https://github.com/friedlanderlab/mirtrace.