Exploring Reinforcement Learning Methods for Multiple Sequence Alignment: A Brief Review

Gaad Chaimaa; Chadi Mohamed-Amine; Sraitih Mohamed; Aamouche Ahmed

doi:10.1051/bioconf/20237501004

BIO Web of Conferences (Jan 2023)

Exploring Reinforcement Learning Methods for Multiple Sequence Alignment: A Brief Review

Gaad Chaimaa,
Chadi Mohamed-Amine,
Sraitih Mohamed,
Aamouche Ahmed

Affiliations

Gaad Chaimaa: LISA Laboratory, National School of Applied Sciences, University of Cadi Ayyad
Chadi Mohamed-Amine: LISI Laboratory, Computer science department, Faculty of Sciences Semlalia, University of Cadi Ayyad
Sraitih Mohamed: MSC Laboratory, National school of applied sciences, University of Cadi Ayyad
Aamouche Ahmed: LISA Laboratory, National School of Applied Sciences, University of Cadi Ayyad

DOI: https://doi.org/10.1051/bioconf/20237501004
Journal volume & issue: Vol. 75
p. 01004

Abstract

Read online

Multiple sequence alignment (MSA) plays a vital role in uncovering similarities among biological sequences such as DNA, RNA, or proteins, providing valuable information about their structural, functional, and evolutionary relationships. However, MSA is a computationally challenging problem, with complexity growing exponentially as the number and length of sequences increase. Currently, standard MSA tools like ClustalW, T-Coffee, and MAFFT, which are based on heuristic algorithms, are widely used but still face many challenges due to the combinatorial explosion. Recent advancements in MSA algorithms have employed reinforcement learning (RL), particularly deep reinforcement learning (DRL), and demonstrated optimized execution time and accuracy with promising results. This is because deep reinforcement learning algorithms update their search policies using gradient descent, instead of exploring the entire solution space making it significantly faster and efficient. In this article, we provide an overview of the recent historical advancements in MSA algorithms, highlighting RL models used to tackle the MSA problem and main challenges and opportunities in this regard.

Published in BIO Web of Conferences

ISSN: 2117-4458 (Online)
Publisher: EDP Sciences
Country of publisher: France
LCC subjects: Science: Microbiology; Science: Physiology; Science: Zoology
Website: http://www.bio-conferences.org

About the journal

Abstract

Keywords