Applied Sciences (Jun 2024)

Exhaustive Variant Interaction Analysis Using Multifactor Dimensionality Reduction

  • Gonzalo Gómez-Sánchez,
  • Lorena Alonso,
  • Miguel Ángel Pérez,
  • Ignasi Morán,
  • David Torrents,
  • Josep Ll. Berral

DOI
https://doi.org/10.3390/app14125136
Journal volume & issue
Vol. 14, no. 12
p. 5136

Abstract

Read online

One of the main goals of human genetics is to understand the connections between genomic variation and the predisposition to develop a complex disorder. These disease–variant associations are usually studied in a single independent manner, disregarding the possible effect derived from the interaction between genomic variants. In particular, in a background of complex diseases, these interactions can be directly linked to the disorder and may play an important role in disease development. Although their study has been suggested to help complete the understanding of the genetic bases of complex diseases, this still represents a big challenge due to large computing demands. Here, we take advantage of high-performance computing technologies to tackle this problem by using a combination of machine learning methods and statistical approaches. As a result, we created a containerized framework that uses multifactor dimensionality reduction (MDR) to detect pairs of variants associated with type 2 diabetes (T2D). This methodology was tested on the Northwestern University NUgene project cohort using a dataset of 1,883,192 variant pairs with a certain degree of association with T2D. Out of the pairs studied, we identified 104 significant pairs: two of which exhibit a potential functional relationship with T2D. These results place the proposed MDR method as a valid, efficient, and portable solution to study variant interaction in real reduced genomic datasets.

Keywords