Nanomaterials (May 2020)

Transcriptomics in Toxicogenomics, Part II: Preprocessing and Differential Expression Analysis for High Quality Data

  • Antonio Federico,
  • Angela Serra,
  • My Kieu Ha,
  • Pekka Kohonen,
  • Jang-Sik Choi,
  • Irene Liampa,
  • Penny Nymark,
  • Natasha Sanabria,
  • Luca Cattelani,
  • Michele Fratello,
  • Pia Anneli Sofia Kinaret,
  • Karolina Jagiello,
  • Tomasz Puzyn,
  • Georgia Melagraki,
  • Mary Gulumian,
  • Antreas Afantitis,
  • Haralambos Sarimveis,
  • Tae-Hyun Yoon,
  • Roland Grafström,
  • Dario Greco

DOI
https://doi.org/10.3390/nano10050903
Journal volume & issue
Vol. 10, no. 5
p. 903

Abstract

Read online

Preprocessing of transcriptomics data plays a pivotal role in the development of toxicogenomics-driven tools for chemical toxicity assessment. The generation and exploitation of large volumes of molecular profiles, following an appropriate experimental design, allows the employment of toxicogenomics (TGx) approaches for a thorough characterisation of the mechanism of action (MOA) of different compounds. To date, a plethora of data preprocessing methodologies have been suggested. However, in most cases, building the optimal analytical workflow is not straightforward. A careful selection of the right tools must be carried out, since it will affect the downstream analyses and modelling approaches. Transcriptomics data preprocessing spans across multiple steps such as quality check, filtering, normalization, batch effect detection and correction. Currently, there is a lack of standard guidelines for data preprocessing in the TGx field. Defining the optimal tools and procedures to be employed in the transcriptomics data preprocessing will lead to the generation of homogeneous and unbiased data, allowing the development of more reliable, robust and accurate predictive models. In this review, we outline methods for the preprocessing of three main transcriptomic technologies including microarray, bulk RNA-Sequencing (RNA-Seq), and single cell RNA-Sequencing (scRNA-Seq). Moreover, we discuss the most common methods for the identification of differentially expressed genes and to perform a functional enrichment analysis. This review is the second part of a three-article series on Transcriptomics in Toxicogenomics.

Keywords