Frontiers in Genetics (Mar 2023)

RNA-seq data science: From raw data to effective interpretation

  • Dhrithi Deshpande,
  • Karishma Chhugani,
  • Yutong Chang,
  • Aaron Karlsberg,
  • Caitlin Loeffler,
  • Jinyang Zhang,
  • Agata Muszyńska,
  • Agata Muszyńska,
  • Viorel Munteanu,
  • Harry Yang,
  • Jeremy Rotman,
  • Laura Tao,
  • Brunilda Balliu,
  • Elizabeth Tseng,
  • Eleazar Eskin,
  • Eleazar Eskin,
  • Eleazar Eskin,
  • Fangqing Zhao,
  • Fangqing Zhao,
  • Pejman Mohammadi,
  • Paweł P. Łabaj,
  • Paweł P. Łabaj,
  • Serghei Mangul,
  • Serghei Mangul

DOI
https://doi.org/10.3389/fgene.2023.997383
Journal volume & issue
Vol. 14

Abstract

Read online

RNA sequencing (RNA-seq) has become an exemplary technology in modern biology and clinical science. Its immense popularity is due in large part to the continuous efforts of the bioinformatics community to develop accurate and scalable computational tools to analyze the enormous amounts of transcriptomic data that it produces. RNA-seq analysis enables genes and their corresponding transcripts to be probed for a variety of purposes, such as detecting novel exons or whole transcripts, assessing expression of genes and alternative transcripts, and studying alternative splicing structure. It can be a challenge, however, to obtain meaningful biological signals from raw RNA-seq data because of the enormous scale of the data as well as the inherent limitations of different sequencing technologies, such as amplification bias or biases of library preparation. The need to overcome these technical challenges has pushed the rapid development of novel computational tools, which have evolved and diversified in accordance with technological advancements, leading to the current myriad of RNA-seq tools. These tools, combined with the diverse computational skill sets of biomedical researchers, help to unlock the full potential of RNA-seq. The purpose of this review is to explain basic concepts in the computational analysis of RNA-seq data and define discipline-specific jargon.

Keywords