Stats (Sep 2024)

A Statistical Methodology for Evaluating Asymmetry after Normalization with Application to Genomic Data

  • Víctor Leiva,
  • Jimmy Corzo,
  • Myrian E. Vergara,
  • Raydonal Ospina,
  • Cecilia Castro

DOI
https://doi.org/10.3390/stats7030059
Journal volume & issue
Vol. 7, no. 3
pp. 967 – 983

Abstract

Read online

This study evaluates the symmetry of data distributions after normalization, focusing on various statistical tests, including a few explored test named Rp. We apply normalization techniques, such as variance stabilizing transformations, to ribonucleic acid sequencing data with varying sample sizes to assess their effectiveness in achieving symmetric data distributions. Our findings reveal that while normalization generally induces symmetry, some samples retain asymmetric distributions, challenging the conventional assumption of post-normalization symmetry. The Rp test, in particular, shows superior performance when there are variations in sample size and data distribution, making it a preferred tool for assessing symmetry when applied to genomic data. This finding underscores the importance of validating symmetry assumptions during data normalization, especially in genomic data, as overlooked asymmetries can lead to potential inaccuracies in downstream analyses. We analyze postmortem lateral temporal lobe samples to explore normal aging and Alzheimer’s disease, highlighting the critical role of symmetry testing in the accurate interpretation of genomic data.

Keywords