Transcriptomics in Toxicogenomics, Part II: Preprocessing and Differential Expression Analysis for High Quality Data

Antonio Federico; Angela Serra; My Kieu Ha; Pekka Kohonen; Jang-Sik Choi; Irene Liampa; Penny Nymark; Natasha Sanabria; Luca Cattelani; Michele Fratello; Pia Anneli Sofia Kinaret; Karolina Jagiello; Tomasz Puzyn; Georgia Melagraki; Mary Gulumian; Antreas Afantitis; Haralambos Sarimveis; Tae-Hyun Yoon; Roland Grafström; Dario Greco

doi:10.3390/nano10050903

Nanomaterials (May 2020)

Transcriptomics in Toxicogenomics, Part II: Preprocessing and Differential Expression Analysis for High Quality Data

Antonio Federico,
Angela Serra,
My Kieu Ha,
Pekka Kohonen,
Jang-Sik Choi,
Irene Liampa,
Penny Nymark,
Natasha Sanabria,
Luca Cattelani,
Michele Fratello,
Pia Anneli Sofia Kinaret,
Karolina Jagiello,
Tomasz Puzyn,
Georgia Melagraki,
Mary Gulumian,
Antreas Afantitis,
Haralambos Sarimveis,
Tae-Hyun Yoon,
Roland Grafström,
Dario Greco

Affiliations

Antonio Federico: Faculty of Medicine and Health Technology, Tampere University, FI-33014 Tampere, Finland
Angela Serra: Faculty of Medicine and Health Technology, Tampere University, FI-33014 Tampere, Finland
My Kieu Ha: Center for Next Generation Cytometry, Hanyang University, Seoul 04763, Korea
Pekka Kohonen: Institute of Environmental Medicine, Karolinska Institutet, 171 77 Stockholm, Sweden
Jang-Sik Choi: Center for Next Generation Cytometry, Hanyang University, Seoul 04763, Korea
Irene Liampa: School of Chemical Engineering, National Technical University of Athens, 157 80 Athens, Greece
Penny Nymark: Institute of Environmental Medicine, Karolinska Institutet, 171 77 Stockholm, Sweden
Natasha Sanabria: National Institute for Occupational Health, Johannesburg 30333, South Africa
Luca Cattelani: Faculty of Medicine and Health Technology, Tampere University, FI-33014 Tampere, Finland
Michele Fratello: Faculty of Medicine and Health Technology, Tampere University, FI-33014 Tampere, Finland
Pia Anneli Sofia Kinaret: Faculty of Medicine and Health Technology, Tampere University, FI-33014 Tampere, Finland
Karolina Jagiello: QSAR Lab Ltd., Aleja Grunwaldzka 190/102, 80-266 Gdansk, Poland
Tomasz Puzyn: QSAR Lab Ltd., Aleja Grunwaldzka 190/102, 80-266 Gdansk, Poland
Georgia Melagraki: Nanoinformatics Department, NovaMechanics Ltd., Nicosia 1065, Cyprus
Mary Gulumian: National Institute for Occupational Health, Johannesburg 30333, South Africa
Antreas Afantitis: Nanoinformatics Department, NovaMechanics Ltd., Nicosia 1065, Cyprus
Haralambos Sarimveis: School of Chemical Engineering, National Technical University of Athens, 157 80 Athens, Greece
Tae-Hyun Yoon: Center for Next Generation Cytometry, Hanyang University, Seoul 04763, Korea
Roland Grafström: Institute of Environmental Medicine, Karolinska Institutet, 171 77 Stockholm, Sweden
Dario Greco: Faculty of Medicine and Health Technology, Tampere University, FI-33014 Tampere, Finland

DOI: https://doi.org/10.3390/nano10050903
Journal volume & issue: Vol. 10, no. 5
p. 903

Abstract

Read online

Preprocessing of transcriptomics data plays a pivotal role in the development of toxicogenomics-driven tools for chemical toxicity assessment. The generation and exploitation of large volumes of molecular profiles, following an appropriate experimental design, allows the employment of toxicogenomics (TGx) approaches for a thorough characterisation of the mechanism of action (MOA) of different compounds. To date, a plethora of data preprocessing methodologies have been suggested. However, in most cases, building the optimal analytical workflow is not straightforward. A careful selection of the right tools must be carried out, since it will affect the downstream analyses and modelling approaches. Transcriptomics data preprocessing spans across multiple steps such as quality check, filtering, normalization, batch effect detection and correction. Currently, there is a lack of standard guidelines for data preprocessing in the TGx field. Defining the optimal tools and procedures to be employed in the transcriptomics data preprocessing will lead to the generation of homogeneous and unbiased data, allowing the development of more reliable, robust and accurate predictive models. In this review, we outline methods for the preprocessing of three main transcriptomic technologies including microarray, bulk RNA-Sequencing (RNA-Seq), and single cell RNA-Sequencing (scRNA-Seq). Moreover, we discuss the most common methods for the identification of differentially expressed genes and to perform a functional enrichment analysis. This review is the second part of a three-article series on Transcriptomics in Toxicogenomics.

Published in Nanomaterials

ISSN: 2079-4991 (Online)
Publisher: MDPI AG
Country of publisher: Switzerland
LCC subjects: Science: Chemistry
Website: https://www.mdpi.com/journal/nanomaterials

About the journal

Abstract

Keywords