Local Alignment of DNA Sequence Based on Deep Reinforcement Learning

Yong-Joon Song; Dong-Ho Cho

doi:10.1109/OJEMB.2021.3076156

IEEE Open Journal of Engineering in Medicine and Biology (Jan 2021)

Local Alignment of DNA Sequence Based on Deep Reinforcement Learning

Yong-Joon Song,
Dong-Ho Cho

Affiliations

Yong-Joon Song: ORCiD; School of Electrical Engineering, Korea Advanced Institute of Science and Technology, Daejeon, South Korea
Dong-Ho Cho: ORCiD; School of Electrical Engineering, Korea Advanced Institute of Science and Technology, Daejeon, South Korea

DOI: https://doi.org/10.1109/OJEMB.2021.3076156
Journal volume & issue: Vol. 2
pp. 170 – 178

Abstract

Read online

Goal: Over the decades, there have been improvements in the sequence alignment algorithm, with significant advances in various aspects such as complexity and accuracy. However, human-defined algorithms have an explicit limitation in view of developmental completeness. This paper introduces a novel local alignment method to obtain optimal sequence alignment based on reinforcement learning. Methods: There is a DQNalign algorithm that learns and performs sequence alignment through deep reinforcement learning. This paper proposes a DQN x-drop algorithm that performs local alignment without human intervention by combining the x-drop algorithm with this DQNalign algorithm. The proposed algorithm performs local alignment by repeatedly observing the subsequences and selecting the next alignment direction until the x-drop algorithm terminates the DQNalign algorithm. This proposed algorithm has an advantage in view of linear computational complexity compared to conventional local alignment algorithms. Results: This paper compares alignment performance (coverage and identity) and complexity for a fair comparison between the proposed DQN x-drop algorithm and the conventional greedy x-drop algorithm. Firstly, we prove the proposed algorithm's superiority by comparing the two algorithms’ computational complexity through numerical analysis. After that, we tested the alignment performance actual HEV and E.coli sequence datasets. The proposed method shows the comparable identity and coverage performance to the conventional alignment method while having linear complexity for the $X$ parameter. Conclusions: Through this study, it was possible to confirm the possibility of a new local alignment algorithm that minimizes computational complexity without human intervention.

Published in IEEE Open Journal of Engineering in Medicine and Biology

ISSN: 2644-1276 (Online)
Publisher: IEEE
Country of publisher: United States
LCC subjects: Medicine: Medicine (General): Computer applications to medicine. Medical informatics; Medicine: Medicine (General): Medical technology
Website: https://ieeexplore.ieee.org/xpl/RecentIssue.jsp?punumber=8782705

About the journal

Abstract

Keywords