Establishing a GRU-GCN coordination-based prediction model for miRNA-disease associations

Kai-Cheng Chuang; Ping-Sung Cheng; Yu-Hung Tsai; Meng-Hsiun Tsai

doi:10.1186/s12863-024-01293-z

BMC Genomic Data (Jan 2025)

Establishing a GRU-GCN coordination-based prediction model for miRNA-disease associations

Kai-Cheng Chuang,
Ping-Sung Cheng,
Yu-Hung Tsai,
Meng-Hsiun Tsai

Affiliations

Kai-Cheng Chuang: Department of Life Sciences, National Chung Hsing University
Ping-Sung Cheng: Department of Management Information Systems, National Chung Hsing University
Yu-Hung Tsai: Department of Management Information Systems, National Chung Hsing University
Meng-Hsiun Tsai: Department of Management Information Systems, National Chung Hsing University

DOI: https://doi.org/10.1186/s12863-024-01293-z
Journal volume & issue: Vol. 26, no. 1
pp. 1 – 11

Abstract

Read online

Abstract Background miRNAs (microRNAs) are endogenous RNAs with lengths of 18 to 24 nucleotides and play critical roles in gene regulation and disease progression. Although traditional wet-lab experiments provide direct evidence for miRNA-disease associations, they are often time-consuming and complicated to analyze by current bioinformatics tools. In recent years, machine learning (ML) and deep learning (DL) techniques are powerful tools to analyze large-scale biological data. Hence, developing a model to predict, identify, and rank connections in miRNAs and diseases can significantly enhance the precision and efficiency in investigating the relationships between miRNAs and diseases. Results In this study, we utilized miRNA-disease association data obtained by biotechnological experiments to develop a DL model for miRNA-disease associations. To improve the accuracy of prediction in this model, we introduced two labeling strategies, weight-based and majority-based definitions, to classify miRNA-disease associations. After preprocessing, data was trained with a novel model combining gated recurrent units (GRU) and graph convolutional network (GCN) to predict the level of miRNA-disease associations. The miRNA-disease association datasets were from HMDD (the Human miRNA Disease Database) and categorized by two distinct labeling approaches, weight-based definitions and majority-based definitions. We classified the miRNA-disease associations into three groups, “upregulated”, “downregulated” and “nonspecific”, by regression analysis and multiclass classification. This GRU-GCN coordinated model achieved a robust area under the curve (AUC) score of 0.8 in all datasets, demonstrating the efficacy in predicting potential miRNA-disease relationships. Conclusions By introducing innovative label-preprocessing methods, this study addressed the relationships between miRNAs and diseases, and improved the ambiguity of the results in different experiments. Based on these refined label definitions, we developed a DL-based model to refine and predict the results of associations between miRNAs and diseases. This model offers a valuable tool for complementing traditional experimental methods and enhancing our understanding of miRNA-related disease mechanisms.

Published in BMC Genomic Data

ISSN: 2730-6844 (Online)
Publisher: BMC
Country of publisher: United Kingdom
LCC subjects: Science: Biology (General): Genetics
Website: https://bmcgenomdata.biomedcentral.com/

About the journal

Abstract

Keywords