Genome Biology (Jul 2023)

Identifying and quantifying isoforms from accurate full-length transcriptome sequencing reads with Mandalorion

  • Roger Volden,
  • Kayla D. Schimke,
  • Ashley Byrne,
  • Danilo Dubocanin,
  • Matthew Adams,
  • Christopher Vollmers

DOI
https://doi.org/10.1186/s13059-023-02999-6
Journal volume & issue
Vol. 24, no. 1
pp. 1 – 15

Abstract

Read online

Abstract In this manuscript, we introduce and benchmark Mandalorion v4.1 for the identification and quantification of full-length transcriptome sequencing reads. It further improves upon the already strong performance of Mandalorion v3.6 used in the LRGASP consortium challenge. By processing real and simulated data, we show three main features of Mandalorion: first, Mandalorion-based isoform identification has very high precision and maintains high recall even in the absence of any genome annotation. Second, isoform read counts as quantified by Mandalorion show a high correlation with simulated read counts. Third, isoforms identified by Mandalorion closely reflect the full-length transcriptome sequencing data sets they are based on.