Using Attribution Sequence Alignment to Interpret Deep Learning Models for miRNA Binding Site Prediction

Katarína Grešová; Ondřej Vaculík; Panagiotis Alexiou

doi:10.3390/biology12030369

Biology (Feb 2023)

Using Attribution Sequence Alignment to Interpret Deep Learning Models for miRNA Binding Site Prediction

Katarína Grešová,
Ondřej Vaculík,
Panagiotis Alexiou

Affiliations

Katarína Grešová: Central European Institute of Technology (CEITEC), Masaryk University, 625 00 Brno, Czech Republic
Ondřej Vaculík: Central European Institute of Technology (CEITEC), Masaryk University, 625 00 Brno, Czech Republic
Panagiotis Alexiou: Central European Institute of Technology (CEITEC), Masaryk University, 625 00 Brno, Czech Republic

DOI: https://doi.org/10.3390/biology12030369
Journal volume & issue: Vol. 12, no. 3
p. 369

Abstract

Read online

MicroRNAs (miRNAs) are small non-coding RNAs that play a central role in the post-transcriptional regulation of biological processes. miRNAs regulate transcripts through direct binding involving the Argonaute protein family. The exact rules of binding are not known, and several in silico miRNA target prediction methods have been developed to date. Deep learning has recently revolutionized miRNA target prediction. However, the higher predictive power comes with a decreased ability to interpret increasingly complex models. Here, we present a novel interpretation technique, called attribution sequence alignment, for miRNA target site prediction models that can interpret such deep learning models on a two-dimensional representation of miRNA and putative target sequence. Our method produces a human readable visual representation of miRNA:target interactions and can be used as a proxy for the further interpretation of biological concepts learned by the neural network. We demonstrate applications of this method in the clustering of experimental data into binding classes, as well as using the method to narrow down predicted miRNA binding sites on long transcript sequences. Importantly, the presented method works with any neural network model trained on a two-dimensional representation of interactions and can be easily extended to further domains such as protein–protein interactions.

Published in Biology

ISSN: 2079-7737 (Online)
Publisher: MDPI AG
Country of publisher: Switzerland
LCC subjects: Science: Biology (General)
Website: https://www.mdpi.com/journal/biology

About the journal

Abstract

Keywords